- Using the novel Moby Dick from the website Project Gutenberg (which contains a large corpus of books) in the Python package requests.
- Extracted words from this web data using BeautifulSoup.
- Finally delve deeper into analyzing the distribution of words using the Natural Language ToolKit (nltk).