start
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
start [2020/08/03 18:12] – [Natural Language Processing Pipeline] admin | start [2025/01/09 10:30] (current) – added validator admin | ||
---|---|---|---|
Line 1: | Line 1: | ||
| | ||
- | http://www.copticscriptorium.org | + | https:// |
===== Current Guidelines ===== | ===== Current Guidelines ===== | ||
Line 7: | Line 7: | ||
[[Annotation layer names]] | [[Annotation layer names]] | ||
- | [[http:// | + | See the [[https:// |
+ | * Quick Start Guide for editors and annotators | ||
+ | * Transcription | ||
+ | * Part of Speech Tagging | ||
+ | * Lemmatization | ||
+ | * Entity Annotations | ||
- | Tokenization guidelines (in Section 4 of the http:// | + | [[https://universaldependencies.org/cop/|Treebanking |
- | + | ||
- | [[https:// | + | |
- | + | ||
- | [[https:// | + | |
===== Contributor/ | ===== Contributor/ | ||
Line 21: | Line 22: | ||
[[Transcribe a text|Transcribe a text in a text editor]] | [[Transcribe a text|Transcribe a text in a text editor]] | ||
- | [[gitdox_workflow|Transcribe a text in GitDox]] | + | [[gitdox_workflow|Transcribe a text in GitDox]] |
+ | |||
+ | [[https:// | ||
==== Natural Language Processing Pipeline and Text Annotation ==== | ==== Natural Language Processing Pipeline and Text Annotation ==== | ||
Line 28: | Line 31: | ||
[[Basic Annotation Workflow]] for project editors and annotators | [[Basic Annotation Workflow]] for project editors and annotators | ||
- | |||
- | ==== Language Processing Tools ==== | ||
- | |||
- | [[Tokenizer]] | ||
- | |||
- | [[Import macro]] | ||
- | |||
- | [[Normalization]] | ||
- | |||
- | [[Part of Speech Tagging using Tree-tagger]] | ||
- | |||
- | [[Annotating sub-word morphemes]] | ||
- | |||
- | [[Language of origin tagging]] | ||
==== Versification and Chapter Divisions ==== | ==== Versification and Chapter Divisions ==== | ||
Line 60: | Line 49: | ||
[[https:// | [[https:// | ||
- | [[Checklist for Publishing Corpora]] | + | [[Checklist for Publishing Corpora]] |
- | [[URN Resolver Database Administration]] | + | [[Onboarding New Annotators/ |
- | [[URN Resolver Web Application Administration (Archimedes Documentation)]] | + | [[URN Resolver Database Administration]] (may be outdated) |
+ | |||
+ | [[URN Resolver Web Application Administration (Archimedes Documentation)]] | ||
Quick link to all open Coptic SCRIPTORIUM [[https:// | Quick link to all open Coptic SCRIPTORIUM [[https:// | ||
- | ===== KELLIA Guidelines | + | ===== KELLIA Guidelines |
- | [[KELLIA: | + | [[https:// |
+ | |||
+ | [[https:// | ||
+ | |||
+ | [[https:// | ||
+ | |||
+ | [[KELLIA: | ||
+ | |||
+ | |||
+ | ==== Stand-alone Language Processing Tools ==== | ||
+ | |||
+ | Note: the NLP pipeline online, described above, provides a more updated and accurate suite of tools. | ||
+ | |||
+ | [[Tokenizer]] | ||
+ | |||
+ | [[Import macro]] | ||
+ | |||
+ | [[Normalization]] | ||
+ | |||
+ | [[Part of Speech Tagging using Tree-tagger]] | ||
+ | |||
+ | [[Annotating sub-word morphemes]] | ||
+ | |||
+ | [[Language of origin tagging]] | ||
- | [[KELLIA: | ||
===== March 2015 Workshop Resources ===== | ===== March 2015 Workshop Resources ===== | ||
[[Processing|Procedure for processing a Coptic text for inclusion in our corpora]] | [[Processing|Procedure for processing a Coptic text for inclusion in our corpora]] |
start.1596499941.txt.gz · Last modified: 2020/08/03 18:12 by admin