I love to write beautiful code. So, I implemented every piece for code myself even when I followed some tutorial. Let there be beautiful ML code. However, the code is not written for performance and tries to use python data structures for readability. The sole purpose of the codes is learning and not being used in production.
Each notebook sets the current working directory to the root of the repository. set projectFolder in init_notebook.py to the root of the repository before running a notebook. The first cell of the notebooks needs to be run only once per session (you will get an error if the working directory is already changed once). This setting helps us to organize libraries into different folders and access them from anywhere inside the repository.
- Stemming, Lemmatization, POS Preprocessing
- Vocabulary, word frequence, TD-IDF common_preprocess.py
- Using Pretrained models
- Debiasing