There are multiple ways of creating a morph layer.
Duplicate the norm layer. Name the new layer “ignore:morph”.
Split the words you have identified into the requisite number of tokens.
Consider using Google Refine to check your work (see if compound words or words containing mnt-, at-, ref-, etc., are still in the ignore:morph layer).
Complete the steps in the next section to create the morph layer.
Many of these steps are demonstrated in this video.
You need to create a clean morph layer that has only unique data in it; 80-90% of the data in ignore:morph is identical to the data in norm, making it difficult for a human to see when you’ve got compound words or morphs. It’s cluttered. So the morph layer in our published annotated corpora only contains unique data that differs from the word-level layers. (Word-level layers in Coptic SCRIPTORIUM are usually named “orig” and “norm.”)
1. Insert a new column for the morph layer but it should be empty (as in the video)
2. In the first cell of data, type in a conditional function that will look to see if the ignore:morph cell is identical to the norm cell on that row. If they are identical, the formula will make the cell blank; if they’re not identical morph will contain the morphemes found in ignore:morph. The formula should look something like this:
where E2 is the norm layer and F2 is the ignore:morph layer. Hit “return” when you are done typing the formula so that it disappears. Then select the cell.
3. Select the cell with your formula in it and select the rest of the column down to the end of the layer data. Use the “Edit>Fill>Down” menu item to fill in that column with the formula. You should now have a clean morph layer that contains only the relevant morphs when they appear.