Table of Contents

Checklist for Publishing and Releasing Corpora

1. New and revised docs should be reviewed by a Senior editor.

2. Add/correct metadata on new documents

3. Check the Issues list for each corpus to be released (whether new or revised versions of documents) on GitHub.

Each corpus may have a list of errors noticed by users or team members. (E.g., https://github.com/CopticScriptorium/ap-dev/issues/35). Make corrections, and note on the issues list that the corrections have been made.

4. Add/correct metadata on edited, previously published documents.

We now use the SAME version # and date on revised documents as on corpora. Give the revised documents the same version # and date as the updated version # and date going in the corpus metadata (see step 5 below). Note: an annotator may have made a minor change a while back and changed the version # and version date, even though the revised document has not yet been published. We do not republish a corpus every time we make a minor revision to one document. You may wish to check the document's version # in our development files against the number in ANNIS if you have any questions. A discrepancy means someone has edited the document; please make sure the version # & date are correct.

5. Add/correct the corpus metadata.

Note: the information in this section describes the workflow before we migrated to Gitdox. This section needs to be updated. Corpus metadata appears on the first document in a corpus.

6. Validate the file.

7. Convert to TEI and PAULA and relANNIS and publish on SCRIPTORIUM ANNIS server.

Typically performed by AZ.

8. Check ANNIS visualizations to be certain there are no obvious bugs in the corpora or stylesheet.

9. Convert to TEI XML.

10. Convert to PAULA & relANNIS and publish on ANNIS server

Typically performed by AZ.

11. Post TEI, relANNIS and PAULA files to GitHub public repository in their respective directories

E.g., https://github.com/CopticScriptorium/corpora/tree/master/AP/apophthegmata.patrum_PAULA for the PAULA XML files of the Apophthegmata Patrum (AP) corpus

===12. Create new meta.json file for linked data applications (with PATHS) and post in corpora repository

13. Create a new release of the GitHub corpora repository, posting information about the latest changes in the release.

At https://github.com/CopticScriptorium/corpora/releases, click “Draft New Release.” Give it a new version number. (Should be same number as the new corpus and document version #s) Describe the corpus and changes/ additions in the description.

14. Update the urn mapping file.

https://github.com/CopticScriptorium/cts/blob/master/coptic/gh_ingest/name_mapping.tab

15. New ingest at data.copticscriptorium.org to account for new data.

Create new corpora, visualizations, etc., if necessary; see documentation in wiki for this application)