Investigate metadata scalability

Question

Investigate metadata scalability

trishankkarthik opened this issue 13 years ago · 5 comments

How does the implementation plan to handle metadata for a software update repository with a large number of targets and target delegations? Presently, it looks like the metadata will be quite large if left uncompressed for a sufficiently large number of targets and target delegations.

A few solutions:

Compress metadata with standard (e.g. GZIP) techniques.
Investigate metadata difference schemes.

Answer 1 · 2013-03-19T07:47:07.000Z

#44 will give us some data about this issue.

Answer 2 · 2013-04-01T10:07:41.000Z

Things to do efficiently: downloading only the subset of target metadata relevant to the target file in question; downloading as much as possible in as few HTTP requests as possible.

Answer 3 · 2013-04-01T10:10:20.000Z

See #57 for a method to reduce metadata size in the common case where a delegated role is trusted with wildcard target paths.

Answer 4 · 2013-04-02T22:02:15.000Z

Maybe consider binary data exchange formats, such as Protocol Buffers or Cap'n Proto.

Answer 5 · 2013-08-05T18:40:10.000Z

The tentatively-named "lazy bin walk" scheme to address metadata scalability is discussed in our design document for PyPI+TUF.