User Tools

Site Tools


start

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
start [2018/01/09 10:19] – Live tagset link from github amirzeldesstart [2025/01/09 10:30] (current) – added validator admin
Line 1: Line 1:
  ====== Coptic SCRIPTORIUM Wiki ======   ====== Coptic SCRIPTORIUM Wiki ====== 
  
-http://www.copticscriptorium.org+https://copticscriptorium.org
  
  ===== Current Guidelines =====       ===== Current Guidelines =====     
Line 7: Line 7:
 [[Annotation layer names]]      [[Annotation layer names]]     
  
-[[http://copticscriptorium.org/download/tools/SCRIPTORIUMDiplTranscriptionGuidelines.pdf|Diplomatic transcription guidelines]]     +See the [[https://copticscriptorium.org/documentation|Documentation Page]] on our main website for the following guidelines: 
 +  * Quick Start Guide for editors and annotators 
 +  * Transcription 
 +  * Part of Speech Tagging 
 +  * Lemmatization 
 +  * Entity Annotations
  
-Tokenization guidelines (in Section 4 of the [[http://copticscriptorium.org/download/tools/SCRIPTORIUMDiplTranscriptionGuidelines.pdf|Transcription Guidelines]])      +[[https://universaldependencies.org/cop/|Treebanking guidelines]]
- +
-[[https://github.com/CopticScriptorium/tagger-part-of-speech/blob/master/scriptorium_tagset_documentation.pdf|Part of speech tagging guidelines]]     +
- +
-[[https://github.com/CopticScriptorium/tagger-part-of-speech/blob/master/Coptic%20SCRIPTORIUM%20lemmatization%20guidelines.pdf|Lemmatization guidelines]] +
- +
- ===== KELLIA Guidelines in Process ===== +
- +
-[[KELLIA:unicode:Coptic Unicode standards and guidelines for Coptologists]] +
- +
-[[KELLIA:LOD:Controlled Vocabularies for Linked Open Data]]+
  
  ===== Contributor/Annotator Tools and Processes =====   ===== Contributor/Annotator Tools and Processes ===== 
Line 27: Line 22:
 [[Transcribe a text|Transcribe a text in a text editor]] [[Transcribe a text|Transcribe a text in a text editor]]
  
-[[gitdox_workflow|Transcribe a text in GitDox]]  +[[gitdox_workflow|Transcribe a text in GitDox]] 
  
- ==== Natural Language Processing Pipeline ====+[[https://gucorpling.org/gitdox/validate_sgml.py|Validate CS SGML]] (use if getting commit or NLP errors in XML mode in Gitdox) 
  
-[[Natural Language Processing Service Online]]+ ==== Natural Language Processing Pipeline and Text Annotation ====
  
- ==== Language Processing Tools ====+[[Natural Language Processing Service Online]] (the public web interface for the NLP pipeline, available to the general public)
  
-[[Tokenizer]]     +[[Basic Annotation Workflow]] for project editors and annotators
  
-[[Import macro]]     + ==== Versification and Chapter Divisions ==== 
- +   
-[[Normalization]]      +[[versification]] standards
- +
-[[Part of Speech Tagging using Tree-tagger]]      +
- +
-[[Annotating sub-word morphemes]]      +
- +
-[[Language of origin tagging]]    +
  
  ==== Metadata ====  ==== Metadata ====
Line 52: Line 41:
  
 [[corpus_metadata| Corpus-level metadata]] [[corpus_metadata| Corpus-level metadata]]
 +
 + ==== Pelagios and Pleiades Linked Geographic Data ====
 +[[pelagios_rdf|Editing Pelagios RDF (ttl) files]]
  
  ==== Administrative Documentation and Calendar ====       ==== Administrative Documentation and Calendar ====     
Line 57: Line 49:
 [[https://calendar.google.com/calendar/embed?src=u.pacific.edu_op37egc45tbarj6d6pejiqgsn0%40group.calendar.google.com&ctz=America/Los_Angeles|Coptic SCRIPTORIUM Calendar]] (permission required) [[https://calendar.google.com/calendar/embed?src=u.pacific.edu_op37egc45tbarj6d6pejiqgsn0%40group.calendar.google.com&ctz=America/Los_Angeles|Coptic SCRIPTORIUM Calendar]] (permission required)
  
-[[Basic Annotation Workflow]]   +[[Checklist for Publishing Corpora]] 
  
-[[Checklist for Publishing Corpora]]       +[[Onboarding New Annotators/Editors]]      
  
-[[URN Resolver Database Administration]]     +[[URN Resolver Database Administration]] (may be outdated)    
  
-[[URN Resolver Web Application Administration (Archimedes Documentation)]]+[[URN Resolver Web Application Administration (Archimedes Documentation)]] (may be outdated)
  
 Quick link to all open Coptic SCRIPTORIUM [[https://github.com/issues?utf8=%E2%9C%93&q=is%3Aopen+is%3Aissue+user%3ACopticScriptorium+|GitHub Issues]] Quick link to all open Coptic SCRIPTORIUM [[https://github.com/issues?utf8=%E2%9C%93&q=is%3Aopen+is%3Aissue+user%3ACopticScriptorium+|GitHub Issues]]
 +
 + ===== KELLIA Guidelines and White Papers=====
 +
 +[[https://kellia.uni-goettingen.de/downloads/KELLIA-transcription-white-paper.pdf|KELLIA White Paper on Transcription and Encoding Guidelines for Coptic Literature]]
 +
 +[[https://kellia.uni-goettingen.de/downloads/KELLIA-metadata-white-paper.pdf|KELLIA White Paper on Metadata Standards for Digital Coptic]] 
 +
 +[[https://kellia.uni-goettingen.de/downloads/KELLIA-linked-data-white-paper.pdf|KELLIA White Paper on Linked Data Standards and Practices for Digital Coptic]]
 +
 +[[KELLIA:unicode:Coptic Unicode standards and guidelines for Coptologists]] Draft
 +
 +
 + ==== Stand-alone Language Processing Tools ====
 +
 +Note: the NLP pipeline online, described above, provides a more updated and accurate suite of tools.
 +
 +[[Tokenizer]]     
 +
 +[[Import macro]]    
 +
 +[[Normalization]]     
 +
 +[[Part of Speech Tagging using Tree-tagger]]     
 +
 +[[Annotating sub-word morphemes]]     
 +
 +[[Language of origin tagging]]  
  
  
start.1515518388.txt.gz · Last modified: 2018/01/09 10:19 by amirzeldes