Pinned Repositories
3_WikidataEnrichment
align manuscript authors with wikidata entities, create a database on those through sparql, add the wikidata ids to the catalogues
Catalogues
Specifications and example for encoding catalogues with GROBID
DTS
New_OutputData
Encoded TEI-XML catalogues
OCR-cat
OCRcat
Data por OCR
reconciliation
soldMss
Sold manuscripts : scripts and data.
utils
some cool tools
visualisations
visualisations produites à partir du json créé en fin d'étape 4 (4_TaggedData)
katabase's Repositories
katabase/3_WikidataEnrichment
align manuscript authors with wikidata entities, create a database on those through sparql, add the wikidata ids to the catalogues
katabase/Catalogues
Specifications and example for encoding catalogues with GROBID
katabase/Application
Web app and API of the Katabase/MSS project.
katabase/DTS
katabase/New_OutputData
Encoded TEI-XML catalogues
katabase/OCR-cat
katabase/OCRcat
Data por OCR
katabase/reconciliation
katabase/soldMss
Sold manuscripts : scripts and data.
katabase/utils
some cool tools
katabase/visualisations
visualisations produites à partir du json créé en fin d'étape 4 (4_TaggedData)
katabase/1_OutputData
Digitsed catalogues
katabase/2_CleanedData
Cleaned catalogues.
katabase/4_TaggedData
Tagged catalogues.
katabase/CatMan_ExhibCat_dataset
katabase/Data_extraction
This repository contains everything we need for the data extraction.
katabase/GROBID_typo
Training data with the typographical information for GROBID-Dictionaries