webrecorder/specs

[Use Case]: Documenting disinformation campaigns

edsu opened this issue · 1 comments

edsu commented

Describe a use case for WACZ format.

A disinformation platform deploys WACZ to help informants produce evidence of coordinated international propaganda campaigns. WACZ is provided as one of many options for “tip submission,” and may be submitted alongside any other form of evidence (screenshot, photograph, text description, blobs shared between apps, etc).

Additional Requirements

  • List of entry pages to start browsing from
  • Full-text search index
  • Technical metadata about the web archive
  • User-defined descriptive metadata
  • Screenshots of key pages
  • Encryption of data
  • Proof of Authenticity (Signing and Verification)
  • Fast access to multiple WACZ files in aggregate
  • Crawl or capture logs

How will web archives be created for this use case?

  • Manually, using a browser to capture exact content as directed by the user.
  • Automatically, using a crawler to crawl desired content, either once or on a specified schedule.

Sensitive private content and access

  • No, this use case focuses on archiving publicly accessible data only, and web archive can be made public.
  • No, this use case focuses on archiving publicly data only, but web archive is not inteded to be public.
  • Yes, this use case involves archiving data that is not public, and the web archive should not be made public.
edsu commented

This has been added to the current use cases document.