
different scripts to process twitter data for QC project

Primary LanguageJupyter Notebook


  • pandas
  • numpy
  • sqlite3
  • gzip



  • gnip_aggregate.ipynb - aggregates gnip dumps from chosen folder into the sqlite file in the same folder. At the momoent, provides only the list of tweets (not users)
  • gnip_aggregate.py - same functionality, can be activated from bash providing path to the folder as the onlt arguement