User Tools

Site Tools


basic_annotation_workflow

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
basic_annotation_workflow [2017/04/18 18:17] – small updates to annotation workflow eplattebasic_annotation_workflow [2020/08/03 16:56] admin
Line 1: Line 1:
 ==== Basic Annotation Workflow ==== ==== Basic Annotation Workflow ====
  
-[[transcribe_a_text|Text file]]+=== Transcribe your text === 
 + 
 +Transcribe your text in [[gitdox_workflow|GitDox]]. Alternatively, transcribe your text into a [[transcribe_a_text|text file]]. Be sure the transcription divides the text into bound groups
  
 At this point, you may follow one of two paths: At this point, you may follow one of two paths:
Line 11: Line 13:
 === NLP Service Online Workflow === === NLP Service Online Workflow ===
  
-[[natural_language_processing_service_online|Run the NLP Service]] on your transcribed text.+[[natural_language_processing_service_online|Run the NLP Service]] on your transcribed text in GitDox.   
 +  * If your text is in a text file, copy and paste it into the GitDox text editor. (See the [[gitdox_workflow|GitDox]] page for more information on using the GitDox text editor.) 
 +  * If your text is already transcribed into the GitDox text editor and validated (see [[gitdox_workflow|GitDox]]), you're ready for the NLP tools. 
 + 
 +You will see an NLP button below the text window. Click it.
  
-//You will generally want to proofread tokenization as part of the NLP Service process as described in the guidelines.//+//Note for veteran GitDox users: you do not need to proofread tokenization as part of the NLP Service process. The NLP service works better, now, without tokenizing first.//
  
 [[import_macro|Import the SGML into a spreadsheet.]] [[import_macro|Import the SGML into a spreadsheet.]]
Line 35: Line 41:
 [[create_a_normalized_bound_group_layer|Reconstruct the norm_group layer]]. [[create_a_normalized_bound_group_layer|Reconstruct the norm_group layer]].
  
-Proofread the part of speech (pos), lemma (lemma), and morpheme (morph) layers.+Proofread the part of speech (pos), lemma (lemma), and morpheme (morph) layers. Part of speech and lemma are annotated on the norm level.
  
 Proofread the language of origin (lang) layer. Proofread the language of origin (lang) layer.
basic_annotation_workflow.txt · Last modified: 2020/08/03 18:08 by admin