User Tools

Site Tools


start

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
start [2020/08/03 18:12] – [Natural Language Processing Pipeline] adminstart [2023/05/31 15:33] – updated main page link admin
Line 1: Line 1:
  ====== Coptic SCRIPTORIUM Wiki ======   ====== Coptic SCRIPTORIUM Wiki ====== 
  
-http://www.copticscriptorium.org+https://copticscriptorium.org
  
  ===== Current Guidelines =====       ===== Current Guidelines =====     
Line 7: Line 7:
 [[Annotation layer names]]      [[Annotation layer names]]     
  
-[[http://copticscriptorium.org/download/tools/SCRIPTORIUMDiplTranscriptionGuidelines.pdf|Diplomatic transcription guidelines]]     +[[https://github.com/CopticScriptorium/tagger-part-of-speech/raw/master/scriptorium-transcription-guidelines.pdf|Diplomatic transcription guidelines]]     
  
-Tokenization guidelines (in Section 4 of the http://wiki.copticscriptorium.org/doku.php?id=start&do=backlink[[http://copticscriptorium.org/download/tools/SCRIPTORIUMDiplTranscriptionGuidelines.pdf|Transcription Guidelines]])     +Tokenization guidelines (in Section 4 of the [[https://github.com/CopticScriptorium/tagger-part-of-speech/raw/master/scriptorium-transcription-guidelines.pdf|Transcription Guidelines]])     
  
-[[https://github.com/CopticScriptorium/tagger-part-of-speech/blob/master/scriptorium_tagset_documentation.pdf|Part of speech tagging guidelines]]    +[[https://github.com/CopticScriptorium/tagger-part-of-speech/raw/master/scriptorium_tagset_documentation.pdf|Part of speech tagging guidelines]]    
  
 [[https://github.com/CopticScriptorium/tagger-part-of-speech/blob/master/Coptic%20SCRIPTORIUM%20lemmatization%20guidelines.pdf|Lemmatization guidelines]] [[https://github.com/CopticScriptorium/tagger-part-of-speech/blob/master/Coptic%20SCRIPTORIUM%20lemmatization%20guidelines.pdf|Lemmatization guidelines]]
 +
 +[[https://universaldependencies.org/cop/|Treebanking guidelines]]
 +
 +[[https://github.com/CopticScriptorium/entity-tagging/raw/master/coptic_scriptorium_entity_guidelines.pdf|Entity annotation guidelines]]
  
  ===== Contributor/Annotator Tools and Processes =====   ===== Contributor/Annotator Tools and Processes ===== 
Line 28: Line 32:
  
 [[Basic Annotation Workflow]] for project editors and annotators [[Basic Annotation Workflow]] for project editors and annotators
- 
- ==== Language Processing Tools ==== 
- 
-[[Tokenizer]]      
- 
-[[Import macro]]     
- 
-[[Normalization]]      
- 
-[[Part of Speech Tagging using Tree-tagger]]      
- 
-[[Annotating sub-word morphemes]]      
- 
-[[Language of origin tagging]]   
  
  ==== Versification and Chapter Divisions ====  ==== Versification and Chapter Divisions ====
Line 60: Line 50:
 [[https://calendar.google.com/calendar/embed?src=u.pacific.edu_op37egc45tbarj6d6pejiqgsn0%40group.calendar.google.com&ctz=America/Los_Angeles|Coptic SCRIPTORIUM Calendar]] (permission required) [[https://calendar.google.com/calendar/embed?src=u.pacific.edu_op37egc45tbarj6d6pejiqgsn0%40group.calendar.google.com&ctz=America/Los_Angeles|Coptic SCRIPTORIUM Calendar]] (permission required)
  
-[[Checklist for Publishing Corpora]]       +[[Checklist for Publishing Corpora]] 
  
-[[URN Resolver Database Administration]]     +[[Onboarding New Annotators/Editors]]      
  
-[[URN Resolver Web Application Administration (Archimedes Documentation)]]+[[URN Resolver Database Administration]] (may be outdated)     
 + 
 +[[URN Resolver Web Application Administration (Archimedes Documentation)]] (may be outdated)
  
 Quick link to all open Coptic SCRIPTORIUM [[https://github.com/issues?utf8=%E2%9C%93&q=is%3Aopen+is%3Aissue+user%3ACopticScriptorium+|GitHub Issues]] Quick link to all open Coptic SCRIPTORIUM [[https://github.com/issues?utf8=%E2%9C%93&q=is%3Aopen+is%3Aissue+user%3ACopticScriptorium+|GitHub Issues]]
  
- ===== KELLIA Guidelines in Process =====+ ===== KELLIA Guidelines and White Papers=====
  
-[[KELLIA:unicode:Coptic Unicode standards and guidelines for Coptologists]]+[[https://kellia.uni-goettingen.de/downloads/KELLIA-transcription-white-paper.pdf|KELLIA White Paper on Transcription and Encoding Guidelines for Coptic Literature]] 
 + 
 +[[https://kellia.uni-goettingen.de/downloads/KELLIA-metadata-white-paper.pdf|KELLIA White Paper on Metadata Standards for Digital Coptic]]  
 + 
 +[[https://kellia.uni-goettingen.de/downloads/KELLIA-linked-data-white-paper.pdf|KELLIA White Paper on Linked Data Standards and Practices for Digital Coptic]] 
 + 
 +[[KELLIA:unicode:Coptic Unicode standards and guidelines for Coptologists]] Draft 
 + 
 + 
 + ==== Stand-alone Language Processing Tools ==== 
 + 
 +Note: the NLP pipeline online, described above, provides a more updated and accurate suite of tools. 
 + 
 +[[Tokenizer]]      
 + 
 +[[Import macro]]     
 + 
 +[[Normalization]]      
 + 
 +[[Part of Speech Tagging using Tree-tagger]]      
 + 
 +[[Annotating sub-word morphemes]]      
 + 
 +[[Language of origin tagging]]  
  
-[[KELLIA:LOD:Controlled Vocabularies for Linked Open Data]] 
  
  ===== March 2015 Workshop Resources =====       ===== March 2015 Workshop Resources =====     
  
 [[Processing|Procedure for processing a Coptic text for inclusion in our corpora]]  [[Processing|Procedure for processing a Coptic text for inclusion in our corpora]] 
start.txt · Last modified: 2023/05/31 15:36 by admin