Learning important concepts of NLP such as vocab, corpus, tokenization, etc
I utilized https://www.kaggle.com/luisfredgs/imdb-ptbr database, which contains 50k movie reviews in portuguese.
If you want to see more of the work, please open the pyhton notebook