I used BeautifulSoup and Newspaper library for extracting and parsing newspaper articles from "https://www.newindianexpress.com" with the goal to predict their popularity using NLP and Random forest regressor. The dataset used for training our model is:
Abstract: This dataset summarizes a heterogeneous set of features about articles published by Mashable in a period of two years. The goal is to predict the number of shares in social networks (popularity).