webrecorder/py-wacz

Instructions how to create wacz from browsertrix crawl

despens opened this issue · 0 comments

A browsertrix crawl usually contains all the information required for a wacz to be created, especially text and pages metadata is already present. Is it possible to use that data for creating the wacz?

(Context: browsertrix exited after completing the crawl, leaving an incomplete wacz file, because the disk was full. Everything is already available, just needs to be compiled into wacz.)