-> Documentation under construction
The project is intended to extract non-regular entities like movies, actors, genres and keywords with respect to their sentiment from Reddit submissions in a subreddit for movie recommendations.
Text format of the submissions: https://www.reddit.com/r/MovieSuggestions/
-
For data preprocessing and model training, head to 'notebooks'.
-
For inference, run 'pipeline.py'.
-
Google BERT: https://github.com/google-research/bert
-
BERT Google repo: https://github.com/google-research/bert/blob/master/multilingual.md
-
Hugging Face BERT pre-trained models: https://huggingface.co/models?filter=tf
-
BERT Fine-Tuning Tensorflow core: https://www.tensorflow.org/official_models/fine_tuning_bert
-
BERT fine tuning google colab: https://colab.research.google.com/drive/1ywsvwO6thOVOrfagjjfuxEf6xVRxbUNO