RePlay is a library providing tools for all stages of creating a recommendation system, from data preprocessing to model evaluation and comparison.
RePlay uses PySpark to handle big data.
You can
- Filter and split data
- Train models
- Optimize hyper parameters
- Evaluate predictions with metrics
- Combine predictions from different models
- Create a two-level model
Use Linux machine with Python 3.7+, Java 8+ and C++ compiler.
pip install replay-rec
It is preferable to use a virtual environment for your installation.
If you encounter an error during RePlay installation, check the troubleshooting guide.