Repository containing multiple projects regarding NBC's mockumentary 'The Office'.
-- Project status: [Active]
I'm a fan of The Office and I wanted to use related data to dive deeper into the data science field and to experiment with some tools and technologies.
I started by scraping all the quotes from this website, building in this way a dataframe and a csv file.
Next, I've computed some basic exploratory analysis with pandas and seaborn as well as a sentiment analysis with VADER and Power BI.
Contents:
- Web Scraping
Web scraping of all quotes from 'The Office'.
Libraries: requests, BeautifulSoup, Pandas
- Notebooks
Exploratory data analysis of the dataset.
Sentiment analysis of the lines.
├── README.md
├── data
├── web-scraping
└── notebooks
- officequotes.net