User Tools

Site Tools


pelagios_rdf

Pelagios and Pleiades Linked Geographic Data

Introductory Information

Coptic Scriptorium's text data is linked to Pelagios and Pleiades via Pelagios' API. Using RDF, Pelagios links to a query for a term in our corpora (in ANNIS) that corresponds to the Pelagios entry.

The RDF file is located in our GitHub repositories: https://github.com/CopticScriptorium/pelagios-dataset-summary

A list of places (currently managed by Elizabeth Platte) is in our GitHub repositories at: https://github.com/CopticScriptorium/entity-tagging

The RDF file (pelagios.ttl) needs to be updated every time a new place name is added to our corpora/list of places and every time a new corpus is published in ANNIS.

The file pelagios-void.ttl should never need to be changed (unless Pelagios asks us to).

Each place is considered an “item” and each item has an item declaration (defining it) and ONE annotation that aligns it to the Pleiades entry for that place.

Adding a New Item (Place)

The following description uses this item as an example:

<https://github.com/CopticScriptorium/pelagios-dataset-summary/pelagios.ttl#items/00l>
a pelagios:AnnotatedThing ;
dcterms:title “Coptic SCRIPTORIUM corpora query ϩⲓⲉⲣⲟⲥⲟⲗⲩⲙⲁ/ϩⲓⲉⲣⲟⲩⲥⲁⲗⲏⲙ” ;
foaf:homepage <https://corpling.uis.georgetown.edu/annis/scriptorium#_q=bGVtbWE9Is-p4rKT4rKJ4rKj4rKf4rKp4rKl4rKB4rKX4rKP4rKZIiB8IGxlbW1hPSLPqeKyk-KyieKyo-Kyn-KypeKyn-Kyl-KyqeKymeKygSI&_c=YmVzYS5sZXR0ZXJzLHNoZW5vdXRlLmEyMixzaGVub3V0ZS5hYnJhaGFtLm91ci5mYXRoZXIsYXBvcGh0aGVnbWF0YS5wYXRydW0sc2FoaWRpY2EubnQsc2hlbm91dGUuZWFnZXJuZXNzLHNhaGlkaWMub3Qsc2FoaWRpY2EuMWNvcmludGhpYW5zLHNoZW5vdXRlLmZveCxzYWhpZGljYS5tYXJrLGRvYy5wYXB5cmk&cl=5&cr=5&s=0&l=10&_seg=bm9ybV9ncm91cA> ;
dcterms:language “cop” ;.

<https://github.com/CopticScriptorium/pelagios-dataset-summary/pelagios.ttl#items/00l/annotations/01>
a oa:Annotation ;
oa:hasTarget <https://github.com/CopticScriptorium/pelagios-dataset-summary/pelagios.ttl#items/00l> ;
oa:hasBody <https://pleiades.stoa.org/places/687928> ;.

The simplest process is to copy and paste an existing entry and edit.

  • Give the item a new item number (see the number after #items/ above)
  • Be sure to update WHEREVER the item number appears in the DECLARATION and the ANNOTATION
  • Update dcterms:title with the form of the term being searched (should be the lemma)
  • Update the URL to the query in ANNIS for that lemma; IMPORTANT: copy the link from the browser (not the short link available under the share button).
  • Update the oa:hasBody link in the annotation to the correct stable link for this place in Pleiades

Adding a New Corpus/Corpora to the RDF

Look at one of the entries and find all the text in the query that runs AFTER “c=” in the query up to the next ampersand. This is the code for the corpora being queried in ANNIS. In the example above, you will find: “c=YmVzYS5sZXR0ZXJzLHNoZW5vdXRlLmEyMixzaGVub3V0ZS5hYnJhaGFtLm91ci5mYXRoZXIsYXBvcGh0aGVnbWF0YS5wYXRydW0sc2FoaWRpY2EubnQsc2hlbm91dGUuZWFnZXJuZXNzLHNhaGlkaWMub3Qsc2FoaWRpY2EuMWNvcmludGhpYW5zLHNoZW5vdXRlLmZveCxzYWhpZGljYS5tYXJrLGRvYy5wYXB5cmk”

Run a query in ANNIS for any term; be sure you query all the corpora you want queried by Pelagios. Look for the text in the link in the browser bar that runs from “c=” to the next ampersand. Copy that text string and paste it somewhere safe.

Run a search and replace on the entire pelagios.ttl file to search for the old “c=” string and replace it with the new “c=” string

Commit All Updates to GitHub

Be sure to commit all updates to the pelagios.ttl file and sync them with our GitHub repository online.

pelagios_rdf.txt · Last modified: 2018/09/25 11:44 by eplatte