Side-Project to apply what I learned
.
├── api
├── data
├── front
├── indexer
├── libraries
└── mlapi
- Natural Language Processing (NLP)
- TF-IDF
- Latent Dirichlet Allocation (LDA)
- Users Preferences
- Features to consider : Genres, Actors, Directors, etc ... (For the moment I only use Genre)
- Based on the user ratings of several movie profiles, we establish a user profile
- Cosine Similarity to find the similarity between user profile and movie profile
- ... Not enough data yet ...
- Akka Actor System : restriction : 40 queries per second allowed by external API
- System based on a supervisor sending batch of movies to some workers to enrich the movie and indexing it
- Spark : Creating autocomplete index from movie titles in ES to ES
- Backpressure with
.coalesce(20)
- Backpressure with
- GraphQL : Use to query the machine learning API with Sangria
React/Redux