A python script to create a dialogue dataset from https://bigbangtrans.wordpress.com/ requires beautifulsoup4 and requests The structure is {episode_name : [{character: utterance}, {character: utterance}...], ...}
pip install beautifulsoup4
pip install requests