/databricks-recommendation-system

This is a learning repository about Databricks and Recommendation Systems

Primary LanguagePythonMIT LicenseMIT

Databricks Recommendation System

GitHub MIT license GitHub Workflow Status

This project study aims to implement a recommendation system model using Databricks Feature Store, MLflow, and Surprise modules. The goal of the project is to build an accurate recommendation system using all tracking functionalities from Databricks environment.

To achieve this goal, the project will use Databricks Feature Store to manage the data and features used in the model. The Feature Store provides a centralized platform for managing the data, making it easier to track changes in the data and ensure that the recommendation system is using the most up-to-date data.

Next, the project will use the Surprise module to build the recommendation system model. Surprise is a Python library that provides a range of algorithms for building recommendation systems. The library is easy to use and provides a range of evaluation metrics for assessing the performance of the model.

Finally, the project will use MLflow to track the experimentation process and compare the performance of different models. MLflow provides a centralized platform for managing the machine learning lifecycle, allowing data scientists to track the experiments, compare the performance of different models, and deploy the best model to production.

Note: This project only runs on Databricks


Code

Download or clone this repository on Databricks.

How to run?

  1. Fill the kaggle.json file with your Kaggle API credential
  2. Run the notebooks in order

Also look ~