User Tools

Site Tools


start

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
start [2020/08/03 18:12] – [Natural Language Processing Pipeline] adminstart [2023/05/31 15:36] (current) – [Current Guidelines] updated admin
Line 1: Line 1:
  ====== Coptic SCRIPTORIUM Wiki ======   ====== Coptic SCRIPTORIUM Wiki ====== 
  
-http://www.copticscriptorium.org+https://copticscriptorium.org
  
  ===== Current Guidelines =====       ===== Current Guidelines =====     
Line 7: Line 7:
 [[Annotation layer names]]      [[Annotation layer names]]     
  
-[[http://copticscriptorium.org/download/tools/SCRIPTORIUMDiplTranscriptionGuidelines.pdf|Diplomatic transcription guidelines]]     +See the [[https://copticscriptorium.org/documentation|Documentation Page]] on our main website for the following guidelines: 
 +  * Quick Start Guide for editors and annotators 
 +  * Transcription 
 +  * Part of Speech Tagging 
 +  * Lemmatization 
 +  * Entity Annotations
  
-Tokenization guidelines (in Section 4 of the http://wiki.copticscriptorium.org/doku.php?id=start&do=backlink[[http://copticscriptorium.org/download/tools/SCRIPTORIUMDiplTranscriptionGuidelines.pdf|Transcription Guidelines]])      +[[https://universaldependencies.org/cop/|Treebanking guidelines]]
- +
-[[https://github.com/CopticScriptorium/tagger-part-of-speech/blob/master/scriptorium_tagset_documentation.pdf|Part of speech tagging guidelines]]     +
- +
-[[https://github.com/CopticScriptorium/tagger-part-of-speech/blob/master/Coptic%20SCRIPTORIUM%20lemmatization%20guidelines.pdf|Lemmatization guidelines]]+
  
  ===== Contributor/Annotator Tools and Processes =====   ===== Contributor/Annotator Tools and Processes ===== 
Line 28: Line 29:
  
 [[Basic Annotation Workflow]] for project editors and annotators [[Basic Annotation Workflow]] for project editors and annotators
- 
- ==== Language Processing Tools ==== 
- 
-[[Tokenizer]]      
- 
-[[Import macro]]     
- 
-[[Normalization]]      
- 
-[[Part of Speech Tagging using Tree-tagger]]      
- 
-[[Annotating sub-word morphemes]]      
- 
-[[Language of origin tagging]]   
  
  ==== Versification and Chapter Divisions ====  ==== Versification and Chapter Divisions ====
Line 60: Line 47:
 [[https://calendar.google.com/calendar/embed?src=u.pacific.edu_op37egc45tbarj6d6pejiqgsn0%40group.calendar.google.com&ctz=America/Los_Angeles|Coptic SCRIPTORIUM Calendar]] (permission required) [[https://calendar.google.com/calendar/embed?src=u.pacific.edu_op37egc45tbarj6d6pejiqgsn0%40group.calendar.google.com&ctz=America/Los_Angeles|Coptic SCRIPTORIUM Calendar]] (permission required)
  
-[[Checklist for Publishing Corpora]]       +[[Checklist for Publishing Corpora]] 
  
-[[URN Resolver Database Administration]]     +[[Onboarding New Annotators/Editors]]      
  
-[[URN Resolver Web Application Administration (Archimedes Documentation)]]+[[URN Resolver Database Administration]] (may be outdated)     
 + 
 +[[URN Resolver Web Application Administration (Archimedes Documentation)]] (may be outdated)
  
 Quick link to all open Coptic SCRIPTORIUM [[https://github.com/issues?utf8=%E2%9C%93&q=is%3Aopen+is%3Aissue+user%3ACopticScriptorium+|GitHub Issues]] Quick link to all open Coptic SCRIPTORIUM [[https://github.com/issues?utf8=%E2%9C%93&q=is%3Aopen+is%3Aissue+user%3ACopticScriptorium+|GitHub Issues]]
  
- ===== KELLIA Guidelines in Process =====+ ===== KELLIA Guidelines and White Papers=====
  
-[[KELLIA:unicode:Coptic Unicode standards and guidelines for Coptologists]]+[[https://kellia.uni-goettingen.de/downloads/KELLIA-transcription-white-paper.pdf|KELLIA White Paper on Transcription and Encoding Guidelines for Coptic Literature]] 
 + 
 +[[https://kellia.uni-goettingen.de/downloads/KELLIA-metadata-white-paper.pdf|KELLIA White Paper on Metadata Standards for Digital Coptic]]  
 + 
 +[[https://kellia.uni-goettingen.de/downloads/KELLIA-linked-data-white-paper.pdf|KELLIA White Paper on Linked Data Standards and Practices for Digital Coptic]] 
 + 
 +[[KELLIA:unicode:Coptic Unicode standards and guidelines for Coptologists]] Draft 
 + 
 + 
 + ==== Stand-alone Language Processing Tools ==== 
 + 
 +Note: the NLP pipeline online, described above, provides a more updated and accurate suite of tools. 
 + 
 +[[Tokenizer]]      
 + 
 +[[Import macro]]     
 + 
 +[[Normalization]]      
 + 
 +[[Part of Speech Tagging using Tree-tagger]]      
 + 
 +[[Annotating sub-word morphemes]]      
 + 
 +[[Language of origin tagging]]  
  
-[[KELLIA:LOD:Controlled Vocabularies for Linked Open Data]] 
  
  ===== March 2015 Workshop Resources =====       ===== March 2015 Workshop Resources =====     
  
 [[Processing|Procedure for processing a Coptic text for inclusion in our corpora]]  [[Processing|Procedure for processing a Coptic text for inclusion in our corpora]] 
start.1596499941.txt.gz · Last modified: 2020/08/03 18:12 by admin