YouTube archiving made simple.
Installation ยท Managing your Archive ยท Viewing your Archive
Yark lets you continuously archive all videos and metadata for YouTube channels. You can also view your archive as a seamless offline website โจ
To install Yark, simply download Python 3.9+ and run the following:
$ pip3 install yark
Once you've installed Yark, think of a name for your archive and copy the target's url:
$ yark new owez https://www.youtube.com/channel/UCSMdm6bUYIBN0KfS2CVuEPA
Now that you've created the archive, you can tell Yark to download all videos and metadata using the refresh command:
$ yark refresh owez
Once everything has been downloaded, Yark will automatically give you a status report of what's changed since the last refresh:
Viewing you archive is easy, just type view
with your archives name:
$ yark view owez
This will pop up an offline website in your browser letting you watch all videos ๐
Under each video is a rich history report filled with timelines and graphs, as well as a noting feature which lets you add timestamped and permalinked comments ๐
Light and dark modes are both available and automatically apply based on the system's theme.
Here are some things to keep in mind when using Yark; the good and the bad:
- Don't create a new archive again if you just want to update it, Yark accumulates all new metadata for you via timestamps
- Feel free to suggest new features via the issues tab on this repository
- Scheduling isn't a feature just yet, please use
cron
or something similar!
The archive format itself is simple and consists of a directory-based structure with a core metadata file and all thumbnail/video data in their own directories as typical files:
[name]/
โ Your self-contained archiveyark.json
โ Archive file with all metadatayark.bak
โ Backup archive file to protect against data damagevideos/
โ Directory containing all known videos[id].mp4
โ Files containing video data for YouTube videos
thumbnails/
โ Directory containing all known thumbnails[hash].png
โ Files containing thumbnails with it's BLAKE2 hash
It's best to take a few minutes to familiarize yourself with your archive by looking at files which look interesting to you in it, everything is quite readable.