NatLibFi/bib-rdf-pipeline

Create single file MARCXML distribution

osma opened this issue · 0 comments

osma commented

Currently the fennica-marc.zip file is created from the individual .mrcx slices which are the same files that are fed to marc2bibframe2 for conversion to BIBFRAME.

Instead we should create a separate, single file MARCXML distribution (fennica.mrcx) that contains all the MARC records. For now, the same preprocessing and stripping of personal information should be applied as for the mrcx slices. However, in future the processing for the mrcx slices may become a bit different than for the single file case, especially when cleanups from COMHIS are integrated into the pipeline.