#news-categorizer-post
News classification and insights combining MonkeyLearn and Kimono
This post shows you how to combine the power of Kimono and Monkeylearn to scrape news sites and analyze articles in order to gain insights on the different news outlets as well as what's happening around us.
This repository includes:
stritch.py
- a script that takes the raw data downloaded from Kimono and outputs a JSON with nicer data to work with.*_raw.json
- the raw data from Kimono to givestitch.py
, and what was used to write the post. The processed data is also included, so processing the raw data is not needed.notebook/
*.json
- the data to analyze.News.ipyn
- an IPython notebook that will classify, process and graph the data.
- Clone this repository.
- Navigate to the root directory of the repository.
- Install the dependencies with
pip install -r requirements.txt
. - Navigate to the
notebook
directory. - Do
ipython notebook
.