- Data loading
- Cleanup and text embedding strategy
- Create Sklearn Pipeline to train and deploy model with preprocessing steps.
- Model Evaluation
- Save model
- FastAPI with Google App Engine - Completed - but stopped due to Cloud Costs
- Deployed with Container Registry & Cloud Run.
- Move model accuracy:
- Using better text embedding techniques
- Better model than RandomForest or try hyperparameter tuning