kbrbe/beltrans-data-integration

Creating a FAIR Linked Data corpus for the BELTRANS research project about Belgian book translations NL-FR and FR-NL between 1970 and 2020

Jupyter NotebookMIT

Issues

Add SPARQL UPDATE query to change contributor roles of hardcoded list
#287 opened a month ago by SvenLieber
0
Show the complete source title in the corpus Excel (title : subtitle)
#286 opened a month ago by SvenLieber
0
Add columns necessary for gender analysis to the translation sheet
#284 opened 4 months ago by SvenLieber
1
Identify KBR identifiers of originals by title lookup (or similarity) also from the correlation list
#282 opened 5 months ago by SvenLieber
2
Use generic XML extraction script for ISNI persons and organizations
#281 opened 5 months ago by SvenLieber
0
Process contributors from the translation correlation sheet
#228 opened 6 months ago by SvenLieber
0
Process additional columns from the translation correlation list
#254 opened 6 months ago by SvenLieber
0
Missing sourcePublisherIdentifier even though there is a sourceKBRIdentifier
#272 opened 6 months ago by SvenLieber
1
Fetching also KBR data if there are less identifiers than the batch size
#275 opened 6 months ago by SvenLieber
2
Correctly display organization region in the sheets all-orgs and org-contributors
#277 opened 6 months ago by SvenLieber
0
Add column to translation sheet indicating if a translation is from a person who - in the corpus- is an author and a translator
#278 opened 6 months ago by SvenLieber
0
Enrich person contributor list with language of person
#250 opened 6 months ago by SvenLieber
0
Process KBR authority records that indicate the type several times
#276 opened 6 months ago by SvenLieber
0
Annotate manifestations that are part of one of the more specific BELTRANS genres
#279 opened 6 months ago by SvenLieber
0
Distinguish person from orgs when extracting KBR authority data
#273 opened 6 months ago by SvenLieber
0
Parsing XML in a streaming fashion leads to unexpected results when clearing the root to save up RAM
#274 opened 6 months ago by SvenLieber
1
Avoid empty row at the end of Excel sheets to fix filter issue
#259 opened 7 months ago by SvenLieber
0
Investigate the integration of person records from Unesco because wrong persons records are merged
#262 opened 7 months ago by SvenLieber
1
Add all translators (and other contributor roles) from correlation list and data sources
#267 opened 7 months ago by SvenLieber
0
Only show the name of a contributor in the contributor column of the translation sheet once
#269 opened 7 months ago by SvenLieber
0
Only show one source title, especially if we have a single sourceKBRIdentifier
#261 opened 7 months ago by SvenLieber
0
Fix display of a single manually curated date: one date should be shown instead of all date values across data sources including the manually curated date
#260 opened 7 months ago by SvenLieber
0
Only considered manually curated place of publication
#266 opened 7 months ago by SvenLieber
0
Query and use schema:name for persons in the corpus, rdfs:label is less informative
#264 opened 7 months ago by SvenLieber
0
Incomplete query log for data integration queries
#265 opened 7 months ago by SvenLieber
0
Adapt RML mapping to correctly link translation URIs with contributor URIs, currently there is a link to a literal
#263 opened 7 months ago by SvenLieber
0
The column targetYearOfPublication does no longer show conflicting dates strings such as "2013 or 2014"
#255 opened 8 months ago by SvenLieber
1
Perform data integration based on Work-Set clustering algorithm instead of heavy SPARQL queries to drastically reduce runtime
#234 opened 8 months ago by SvenLieber
0
Improve performance of CSV creation by replacing large monolithic SPARQL query
#257 opened 8 months ago by SvenLieber
0
Include placename checkup at integration statistics
#253 opened 8 months ago by SvenLieber
0
Missing `bf:identifiedBy` with local identifier leads to missing integrated data
#256 opened 8 months ago by SvenLieber
0
Geonames enrichment removes country if no match is found
#252 opened 8 months ago by SvenLieber
1
Processing of identified publishers is broken
#249 opened 8 months ago by SvenLieber
0
Enrich organization contributor list with country
#251 opened a year ago by SvenLieber
0
Check and if necessary fix KBR pipeline for 264-field publishers.
#238 opened a year ago by SvenLieber
2
Some contributors from the translations sheet are missing in the sheet of person contributors
#248 opened a year ago by SvenLieber
1
Show also the role "adaptor" and have an additional column "translator/adapter"
#244 opened a year ago by SvenLieber
0
Adapt order of columns for Excel corpus version
#243 opened a year ago by SvenLieber
1
Add a sheet with the oldest manifestation per cluster to ease data cleaning
#242 opened a year ago by SvenLieber
0
Add year of publication to integrated data
#245 opened a year ago by SvenLieber
1
Process BB genre classification from the translation correlation list
#239 opened a year ago by SvenLieber
0
Process multilingual org labels and imprint information from org correlation list
#240 opened a year ago by SvenLieber
0
Add country code column for nationalities in the person contributor list
#237 opened a year ago by SvenLieber
0
Add contributor check for automatically detected originals to avoid wrong links
#235 opened a year ago by SvenLieber
0
Add new contributor column that is the union of authors and scenarists to ease data analysis for comics
#226 opened a year ago by SvenLieber
0
Don't display the long Unesco identifier anymore to ease manual curation: use the short identifier instead
#224 opened a year ago by SvenLieber
0
Add missing original's data to the corpus CSV: year of publication and geo-related information
#225 opened a year ago by SvenLieber
0
Use automatically detected translation-source links when creating integrated original data
#230 opened a year ago by SvenLieber
1
Update the corpus statistics for quality control of new corpus versions
#227 opened a year ago by SvenLieber
0
Make dataprofile SPARQL query usable again by rewriting it (avoiding OPTIONAL statements)
#223 opened a year ago by SvenLieber
0