/nlp_readability

Analysis of large sample (n>8000) of articles across 5 topic tags — Design, Technology, Education, Politics and Life — published between August 2015 and April 2016. Data acquisition, cleaning, tokenization with NLTK, New Dale Chall readability scores and visulizations performed using Spark, Python and R using Databricks.

No issues in this repository yet.