/dpres-ipt-deprecated-2019-11-22

This repository is deprecated, please use new repository: https://github.com/Digital-Preservation-Finland/dpres-ipt

Primary LanguagePythonGNU Lesser General Public License v3.0LGPL-3.0

Information Package Tools

This repository contains tools for validating Submission/Archival/Dissemination Information Packages (SIP/AIP/DIP) based on Open Archival Information System (OAIS) standard.

The aim is to provide digital preservation services for culture and research to ensure the access and use of materials long in the future. Documentation and specifications for the digital preservation service can be found in: http://digitalpreservation.fi

Installation

The software is tested with Python 2.7 with Centos 7.x / RHEL 7.x releases. For running in a tested and isolated environment, get python-virtuelenv software:

pip install virtualenv

Run the following to activate the virtual environment:

virtualenv .venv source ./.venv/bin/activate

Additional requirements

The following software is required for validation tools, depending on the file formats in the package:

You can install the software listed in requirements_github.txt with the following command:

pip install -r requirements_github.txt

This may require that gcc is installed in your system.

Other software listed above needs to be installed separately.

Usage

To validate a METS document:

python ipt/scripts/check_xml_schema_features.py <METS document>
python ipt/scripts/check_xml_schematron_features.py -s <schematron_file> <METS document>

See the schematron files from: https://github.com/Digital-Preservation-Finland/dpres-xml-schemas

To validate digital objects in an information package:

python ipt/scripts/check_sip_digital_objects.py <package directory> <linking_type> <linking_value>

Parameters <linking_type> and <linking_value> give values to PREMIS <relatedObjectIdentifierType> and <relatedObjectIdentifierValues> elements in the output. If you are not planning to use these, you may give random strings.

To check fixity of digital objects in an information package:

python ipt/scripts/check_sip_file_checksums.py <package directory>

Copyright

Copyright (C) 2018 CSC - IT Center for Science Ltd.

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.

You should have received a copy of the GNU Lesser General Public License along with this program. If not, see <https://www.gnu.org/licenses/>.