Support ALTO 4.0
zuphilip opened this issue ยท 6 comments
The new version ALTO 4.0 was recently released: https://github.com/altoxml/documentation/wiki/v4.0-Changes and any support here would be great. The validation with the XSD http://www.loc.gov/standards/alto/v4/alto-4-0.xsd should be straight forward to do. CC @cneud
Super quick reaction...but maybe too quick ๐
It seems there has been a little bit of confusion with both https://www.loc.gov/standards/alto/v4/alto.xsd and https://github.com/altoxml/schema/blob/master/v4/alto-4-0.xsd still containing the "draft" remark and wrong release date in the header section. Will clarify and rectify with the board members asap!
Adding issues is always quick, but solving them might need more time;-)
I checked a little further and it seems that we may already automatically be up-to-date for the ALTO schemas, because we clone your git repo:
ocr-fileformat/vendor/Makefile
Lines 94 to 95 in 5367a7a
Validation will be up-to-date with the upstream ALTO schema repo if you build from source.
I can prepare a new release with tarball including ALTO 4 xsd once it's ready for distribution.
Would now be a good point to make a new release including ALTO 4.0?
From an ALTO point of view, the release 4.1 fixed a few bugs with v4.0 and can be considered fairly stable.
With v0.3.0 we support now validation of ALTO 4.0 and 4.1 as well as transformations from and to the new ALTO versions. Thus, I would say that we can close this issue now. However, note that AFAIK we are treating the different ALTO versions more or less the same. If any new feature of ALTO 4.0 or 4.1 would be interesting to support in any transformation, then I suggest to open a new specific issue about this.