/zml2lido

xslt that transforms Zetcom's xml to LIDO plus some Python glue to make things easy

Primary LanguageXSLTGNU General Public License v3.0GPL-3.0

zml2lido - Xslt that rewrites ZML to Lido

"zml" refers to Zetcom's "generic" xml format that comes out of MpRia's API.

Requires

  • lxml
  • Saxon. We're now using saxonche from pypi. Install with

pip install saxonche

The mapping is partly specific to the RIA installation of the SPK, but might inspire others.

Includes the following scripts

  • lido
  • lvalidate
  • saxon

Usage

  • -j picks a job e.g. smb or ohneLit
  • -i provide the xml source (required)
  • -c expect one or multiple zipped cache files as input
  • -f force; overwrite most files created during previous runs (except relWorks cache)

lido -j smb -i path/to/generic_xml

New

In previous versions, the script had to be executed in one particular directory for it to work correctly. Now it can be executed anywhere and the results are always saved in sdata/{label}/{date}.

Common Jobs

job 'smb' does the following steps

  1. convert to LIDO (lvl1)
  2. filter out lido records that are not published
  3. filter out relatedWorks tht are not pubished (except Literature)
  4. validate LIDO
  5. split

job 'dd' does the following (meant for debugging)

  1. convert to LIDO (lvl1)
  2. validate LIDO

Common Errors

todo