Each file-level object should have a unique identifier

Question

Each file-level object should have a unique identifier

Closed this issue 6 years ago · 2 comments

In the new_manifest_proposal.json file, it would be useful if each file had assigned a unique identifier, or if a specified formula was given to calculate one (such as the MD5 of the collection, path, and filename).

This identifier would be useful for indexing.

Answer 1 · 2018-06-28T18:08:06.000Z

I don't think it would be sensible to create and maintain an id for the purpose of indexing since one can be generated in other ways. (trivially: depositor/collection/path/filename as you point out)

Answer 2 · 2018-08-09T14:16:31.000Z

The current proposal emphasizes packages and unique package identifiers as the key to connecting data (packages) in archival storage and metadata (about packages and their contents) in ArchivesSpace. The identifier for a file-level object will be a combination of the package_id and path/filename for the file.