wiki_series_spider is a crawler adapted from the WikiEpisodeTableSpider class in karoly-hars/gpt2_episode_summary_generator. The goal with the modified class is to obtain all relevant data of each episode.
See the original README, specifically this section on how to use the Wikipedia spider.
Example:
python3 run_spider.py --start_url https://en.wikipedia.org/wiki/Friends --title_keywords friends --url_substring Friends -o friends_wiki.json
The data will be stored in a json under the data/scraped directory.