kbrbe/beltrans-data-integration
Creating a FAIR Linked Data corpus for the BELTRANS research project about Belgian book translations NL-FR and FR-NL between 1970 and 2020
Jupyter NotebookMIT
Issues
- 0
- 0
- 1
- 2
Identify KBR identifiers of originals by title lookup (or similarity) also from the correlation list
#282 opened by SvenLieber - 0
- 0
- 0
- 1
Missing sourcePublisherIdentifier even though there is a sourceKBRIdentifier
#272 opened by SvenLieber - 2
- 0
Correctly display organization region in the sheets all-orgs and org-contributors
#277 opened by SvenLieber - 0
Add column to translation sheet indicating if a translation is from a person who - in the corpus- is an author and a translator
#278 opened by SvenLieber - 0
- 0
- 0
Annotate manifestations that are part of one of the more specific BELTRANS genres
#279 opened by SvenLieber - 0
- 1
Parsing XML in a streaming fashion leads to unexpected results when clearing the root to save up RAM
#274 opened by SvenLieber - 0
- 1
Investigate the integration of person records from Unesco because wrong persons records are merged
#262 opened by SvenLieber - 0
Add all translators (and other contributor roles) from correlation list and data sources
#267 opened by SvenLieber - 0
Only show the name of a contributor in the contributor column of the translation sheet once
#269 opened by SvenLieber - 0
Only show one source title, especially if we have a single sourceKBRIdentifier
#261 opened by SvenLieber - 0
Fix display of a single manually curated date: one date should be shown instead of all date values across data sources including the manually curated date
#260 opened by SvenLieber - 0
- 0
Query and use schema:name for persons in the corpus, rdfs:label is less informative
#264 opened by SvenLieber - 0
Incomplete query log for data integration queries
#265 opened by SvenLieber - 0
Adapt RML mapping to correctly link translation URIs with contributor URIs, currently there is a link to a literal
#263 opened by SvenLieber - 1
The column targetYearOfPublication does no longer show conflicting dates strings such as "2013 or 2014"
#255 opened by SvenLieber - 0
Perform data integration based on Work-Set clustering algorithm instead of heavy SPARQL queries to drastically reduce runtime
#234 opened by SvenLieber - 0
Improve performance of CSV creation by replacing large monolithic SPARQL query
#257 opened by SvenLieber - 0
- 0
Missing `bf:identifiedBy` with local identifier leads to missing integrated data
#256 opened by SvenLieber - 1
- 0
Processing of identified publishers is broken
#249 opened by SvenLieber - 0
Enrich organization contributor list with country
#251 opened by SvenLieber - 2
- 1
Some contributors from the translations sheet are missing in the sheet of person contributors
#248 opened by SvenLieber - 0
Show also the role "adaptor" and have an additional column "translator/adapter"
#244 opened by SvenLieber - 1
Adapt order of columns for Excel corpus version
#243 opened by SvenLieber - 0
Add a sheet with the oldest manifestation per cluster to ease data cleaning
#242 opened by SvenLieber - 1
Add year of publication to integrated data
#245 opened by SvenLieber - 0
- 0
Process multilingual org labels and imprint information from org correlation list
#240 opened by SvenLieber - 0
- 0
Add contributor check for automatically detected originals to avoid wrong links
#235 opened by SvenLieber - 0
Add new contributor column that is the union of authors and scenarists to ease data analysis for comics
#226 opened by SvenLieber - 0
Don't display the long Unesco identifier anymore to ease manual curation: use the short identifier instead
#224 opened by SvenLieber - 0
Add missing original's data to the corpus CSV: year of publication and geo-related information
#225 opened by SvenLieber - 1
Use automatically detected translation-source links when creating integrated original data
#230 opened by SvenLieber - 0
- 0
Make dataprofile SPARQL query usable again by rewriting it (avoiding OPTIONAL statements)
#223 opened by SvenLieber