/The-Android-App-Market-on-Google-Play

Loaded, cleaned, and visualized the scraped Google Play Store data to gain insights into the Android app market.

Primary LanguageJupyter Notebook

project language topics

The-Android-App-Market-on-Google-Play

Loaded, cleaned, and visualized the scraped Google Play Store data to gain insights into the Android app market.

Project Description


Mobile apps are everywhere. They are easy to create and can be lucrative. Because of these two factors, more and more apps are being developed. In this project, I have done a comprehensive analysis of the Android app market by comparing over ten thousand apps in Google Play across different categories. I have looked for insights in the data to devise strategies to drive growth and retention. The data for this project was scraped from the Google Play website. While there are many popular datasets for Apple App Store, there aren't many for Google Play apps, which is partially due to the increased difficulty in scraping the latter as compared to the former. The data files are as follows:

  • apps.csv: contains all the details of the apps on Google Play. These are the features that describe an app.
  • user_reviews.csv: contains 100 reviews for each app, most helpful first. The text in each review has been pre-processed, passed through a sentiment analyzer engine and tagged with its sentiment score.

Project Tasks

  1. Google Play Store apps and reviews
  2. Data cleaning
  3. Correcting data types
  4. Exploring app categories
  5. Distribution of app ratings
  6. Size and price of an app
  7. Relation between app category and app price
  8. Filter out "junk" apps
  9. Popularity of paid apps vs free apps
  10. Sentiment analysis of user reviews

The Jupyter notebook contains the answer (code) for all of the above tasks:

Click here to check out