A one stop Python script that lets you go from Google Takeout's terrible dataset to one populated with video metadata, keywords, and channel insights
-
Go to (Google Takeout)[https://takeout.google.com/settings/takeout]
-
Scroll to the bottom and check YouTube and YouTube Music
-
Click All data included and only select history
-
Next Step -> Create Export
-
Download the .zip from Gmail
-
Run the following:
git clone https://github.com/tyler-keller/sadge-pub.git`
-
Unpack the .zip and move the watch-history.html file to the ./sadge-pub/data/ directory
-
Run the following:
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
python3 sadge_scraper.py
Note: length of time to completion subject to watchtimes and Google's rate limiter.