/crawl_subtitle

The code for crawling the best subtitles based on votes on https://subscene.com/

Primary LanguagePython

The code for crawling the best subtitle based on votes

Installation

pip install -r requirements.txt

Crawling Steps

1. Crawl the genre and popular films from rottentomatoes

python src/01_crawl_style_and_film_names.py

2. Crawl the good vietsub links for each film from subscene

python src/02_crawl_best_vietsub_url.py

3. Choose the best subtitle (by score) and download with wget

python src/03_download_best_vietsub.py

Author