/waybacktweets

Archived tweets on Wayback Machine in an easy way

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Important

If the application is down, please check the Streamlit Cloud Status.


🏛️ Wayback Tweets

Streamlit App GitHub release (latest by date including pre-releases) License

Tool that displays multiple archived tweets on Wayback Machine to avoid opening each link manually. Via Wayback CDX Server API.

Thanks Tristan Lee for the idea.

Features

  • 30 tweets per page
  • Filtering by only deleted tweets

Development

Requirement

  • Python 3.8+

Installation

$ git clone git@github.com:claromes/waybacktweets.git

$ cd waybacktweets

$ pip install -r requirements.txt

$ streamlit run app.py

Streamlit will be served at http://localhost:8501

Bugs

  • "web.archive.org took too long to respond."
  • only_deleted checkbox selected for handles without deleted tweets
  • Pagination: set session variable on first click
  • Pagination: scroll to top
  • IndexError
  • Timeout error

Roadmap

  • Pagination
    • Footer
    • Disabled/ Empty
  • Feedbacks
  • Download dataset
  • Review data cache
  • Changelog
  • Prevent duplicate URLs
  • Range size defined by user
  • parse_links exception
  • Add current page to page title
  • Parse MIME type warc/revisit
  • Parse MIME type text/plain
  • Filter by period/datetime
  • Apply filters by API endpoints
  • Add contributing guidelines