/Store_Sales_Prediction

A prediction pipeline using pyspark and kafka

Primary LanguageJupyter Notebook

Store Sales Prediction

Nowadays, shopping malls and Big Marts keep track of individual item sales data in order to forecast future client demand and adjust inventory management. In a data warehouse, these data stores hold a significant amount of consumer information and particular item details. By mining the data store from the data warehouse, more anomalies and common patterns can be discovered.

Objective

We have to build a solution that should able to predict the sales of the different stores of Big Mart according to the provided dataset.

Project Demo

https://www.youtube.com/watch?v=v_3vFV5tdg4&t=16s

Application link

https://store--sales.herokuapp.com/

Documents

HLD/LLD/Architecture/DPR : https://drive.google.com/drive/folders/1jOIL4jgiebj_3euKR6wRm6DNHX2EzQbA?usp=sharing

Database

MongoDB database has been used to store prediction dataset and for logging

🛠️ Requirements

  • python 3.x
  • Flask
  • pandas
  • pymongo
  • kneed
  • scikit-learn
  • xgboost

Contributor

  • Sayan Saha