User Tools

Site Tools


start

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
start [2016/08/28 22:30] – Adding NLP page eplattestart [2023/05/31 15:33] – updated main page link admin
Line 1: Line 1:
  ====== Coptic SCRIPTORIUM Wiki ======   ====== Coptic SCRIPTORIUM Wiki ====== 
  
-http://www.copticscriptorium.org+https://copticscriptorium.org
  
- ==== Current Guidelines ====     + ===== Current Guidelines =====     
  
 [[Annotation layer names]]      [[Annotation layer names]]     
  
-[[http://copticscriptorium.org/download/tools/SCRIPTORIUMDiplTranscriptionGuidelines.pdf|Diplomatic transcription guidelines]]     +[[https://github.com/CopticScriptorium/tagger-part-of-speech/raw/master/scriptorium-transcription-guidelines.pdf|Diplomatic transcription guidelines]]     
  
-Tokenization guidelines (in Section 4 of the [[http://copticscriptorium.org/download/tools/SCRIPTORIUMDiplTranscriptionGuidelines.pdf|Transcription Guidelines]]     +Tokenization guidelines (in Section 4 of the [[https://github.com/CopticScriptorium/tagger-part-of-speech/raw/master/scriptorium-transcription-guidelines.pdf|Transcription Guidelines]]    
  
-[[http://copticscriptorium.org/download/tools/scriptorium_tagset_documentation.pdf|Part of speech tagging guidelines]]    +[[https://github.com/CopticScriptorium/tagger-part-of-speech/raw/master/scriptorium_tagset_documentation.pdf|Part of speech tagging guidelines]]    
  
 [[https://github.com/CopticScriptorium/tagger-part-of-speech/blob/master/Coptic%20SCRIPTORIUM%20lemmatization%20guidelines.pdf|Lemmatization guidelines]] [[https://github.com/CopticScriptorium/tagger-part-of-speech/blob/master/Coptic%20SCRIPTORIUM%20lemmatization%20guidelines.pdf|Lemmatization guidelines]]
  
- ==== KELLIA Guidelines in Process ====+[[https://universaldependencies.org/cop/|Treebanking guidelines]]
  
-[[KELLIA:unicode:Coptic Unicode standards and guidelines for Coptologists]]+[[https://github.com/CopticScriptorium/entity-tagging/raw/master/coptic_scriptorium_entity_guidelines.pdf|Entity annotation guidelines]]
  
-[[KELLIA:LOD:Controlled Vocabularies for Linked Open Data]]+ ===== Contributor/Annotator Tools and Processes ===== 
  
- ==== Contributor/Annotator Tools and Processes ==== + ==== Transcription ==== 
  
- ==Transcription== +[[Transcribe a text|Transcribe a text in a text editor]]
  
-[[Transcribe a text]]  +[[gitdox_workflow|Transcribe a text in GitDox]]  
  
- ==Natural Language Processing Pipeline==+ ==== Natural Language Processing Pipeline and Text Annotation ====
  
-[[Natural Language Processing Service Online]]+[[Natural Language Processing Service Online]] (the public web interface for the NLP pipeline, available to the general public)
  
-[[https://corpling.uis.georgetown.edu/coptic-nlp/|Natural Language Processing Service Online]] +[[Basic Annotation Workflow]] for project editors and annotators
  
- ==Language Processing Tools==+ ==== Versification and Chapter Divisions ==== 
 +   
 +[[versification]] standards
  
-[[Tokenizer]]     + ==== Metadata ====
  
-[[Import macro]]    +[[metadata|Metadata]] annotations
  
-[[Normalization]]     +[[corpus_metadata| Corpus-level metadata]]
  
-[[Part of Speech Tagging using Tree-tagger]]     + ==== Pelagios and Pleiades Linked Geographic Data ==== 
 +[[pelagios_rdf|Editing Pelagios RDF (ttl) files]]
  
-[[Annotating sub-word morphemes]]     + ==== Administrative Documentation and Calendar ====     
  
-[[Language of origin tagging]]    +[[https://calendar.google.com/calendar/embed?src=u.pacific.edu_op37egc45tbarj6d6pejiqgsn0%40group.calendar.google.com&ctz=America/Los_Angeles|Coptic SCRIPTORIUM Calendar]] (permission required)
  
- ==Metadata==+[[Checklist for Publishing Corpora]] 
  
-[[metadata|Metadata]]+[[Onboarding New Annotators/Editors]]      
  
- ==== Administrative Workflows ====     +[[URN Resolver Database Administration]] (may be outdated)    
  
-[[Basic Annotation Workflow]]   +[[URN Resolver Web Application Administration (Archimedes Documentation)]] (may be outdated)
  
-[[Checklist for Publishing Corpora]]       +Quick link to all open Coptic SCRIPTORIUM [[https://github.com/issues?utf8=%E2%9C%93&q=is%3Aopen+is%3Aissue+user%3ACopticScriptorium+|GitHub Issues]]
  
-[[URN Resolver Database Administration]]     + ===== KELLIA Guidelines and White Papers===== 
 + 
 +[[https://kellia.uni-goettingen.de/downloads/KELLIA-transcription-white-paper.pdf|KELLIA White Paper on Transcription and Encoding Guidelines for Coptic Literature]] 
 + 
 +[[https://kellia.uni-goettingen.de/downloads/KELLIA-metadata-white-paper.pdf|KELLIA White Paper on Metadata Standards for Digital Coptic]]  
 + 
 +[[https://kellia.uni-goettingen.de/downloads/KELLIA-linked-data-white-paper.pdf|KELLIA White Paper on Linked Data Standards and Practices for Digital Coptic]] 
 + 
 +[[KELLIA:unicode:Coptic Unicode standards and guidelines for Coptologists]] Draft 
 + 
 + 
 + ==== Stand-alone Language Processing Tools ==== 
 + 
 +Note: the NLP pipeline online, described above, provides a more updated and accurate suite of tools. 
 + 
 +[[Tokenizer]]      
 + 
 +[[Import macro]]     
 + 
 +[[Normalization]]      
 + 
 +[[Part of Speech Tagging using Tree-tagger]]      
 + 
 +[[Annotating sub-word morphemes]]     
  
-[[URN Resolver Web Application Administration (Archimedes Documentation)]]     +[[Language of origin tagging]]  
  
  
- ==== March 2015 Workshop Resources ====     + ===== March 2015 Workshop Resources =====     
  
 [[Processing|Procedure for processing a Coptic text for inclusion in our corpora]]  [[Processing|Procedure for processing a Coptic text for inclusion in our corpora]] 
start.txt · Last modified: 2023/05/31 15:36 by admin