/repository-importer

Script to get all archival records for a repository and build an importable JSON file

Primary LanguageRuby

repository-importer

Script to get all archival records for a repository and build an importable JSON file.

Version

It is recommended that source and target instances of ArchivesSpace use the same version.

Currently tested version: v3.2.0.

Setup

Install Java & Rbenv.

Then pull all of the project dependencies:

./resolve_deps.sh

Reset

Use ./cleanup_deps.sh to remove all downloaded dependencies then rerun setup.

Export usage

./export.sh <source backend URL> <source repo id> <username> <password>

Testing export

A convenient way of testing the export process is to target one of the public ArchivesSpace test servers:

# download the "YNHSC" repository from test.archivesspace.org
./export.sh https://test.archivesspace.org/staff/api 2 admin admin
# download the "ATTracer" repository from test.archivesspace.org
./export.sh https://test.archivesspace.org/staff/api 3 admin admin
# download the "Space" repository from test.archivesspace.org
./export.sh https://test.archivesspace.org/staff/api 4 admin admin
# download the "test" repository from test.archivesspace.org
./export.sh https://test.archivesspace.org/staff/api 5 admin admin

The json output will be saved to the export folder as exported_<source repo id>.json.

Note: test.archivesspace.org is deployed from the ASpace master branch and therefore should only be used with an ASpace src destination for import.

Import usage

./import.sh <target backend URL> <target repo id> <username> <password> <file to import>

Testing import

A convenient way of testing the import process is to run a local instance of ArchivesSpace using Docker:

docker-compose up --detach

Note: the Docker instance uses a release version of ASpace and should only be used to import data from a source using the same version.

Wait for ArchivesSpace to start then create a new repository:

./create_repository.sh <target backend url> <username> <password> <repository code>
./create_repository.sh http://localhost:8089 admin admin "test"

If that command is successful it will create a new repository and output the response. Pay attention to the id:

{"status"=>"Created", "id"=>2, "lock_version"=>0, "stale"=>true, "uri"=>"/repositories/with_agent/2", "warnings"=>[]}

Note: if you're creating one repository in an empty ArchivesSpace the id will be 2.

You can also view the list of available repositories:

./list_repositories.sh http://localhost:8089 admin admin

Next, proceed to import:

# import into local "test" repository
./import.sh http://localhost:8089 $repo_id admin admin exported_2.json
./import.sh http://localhost:8089 2 admin admin exported_2.json

Note: the import file is expected to be in the exports folder. There is no intrinsic relationship between the repository ids for source and destination.

Records should have been created in the local repository.

All-in-one tester

Using ASPace src example. Setup and run ASpace from src, when ready:

./create_repository.sh http://localhost:4567 admin admin "test_2"
./create_repository.sh http://localhost:4567 admin admin "test_3"
./create_repository.sh http://localhost:4567 admin admin "test_4"
./create_repository.sh http://localhost:4567 admin admin "test_5"

./export.sh https://test.archivesspace.org/staff/api 2 admin admin
./export.sh https://test.archivesspace.org/staff/api 3 admin admin
./export.sh https://test.archivesspace.org/staff/api 4 admin admin
./export.sh https://test.archivesspace.org/staff/api 5 admin admin

./import.sh http://localhost:4567 2 admin admin exported_2.json
./import.sh http://localhost:4567 3 admin admin exported_3.json
./import.sh http://localhost:4567 4 admin admin exported_4.json
./import.sh http://localhost:4567 5 admin admin exported_5.json