/nlp-practice

Primary LanguageJupyter NotebookGNU General Public License v3.0GPL-3.0

I love to write beautiful code. So, I implemented every piece for code myself even when I followed some tutorial. Let there be beautiful ML code. However, the code is not written for performance and tries to use python data structures for readability. The sole purpose of the codes is learning and not being used in production.

Getting Started:

Each notebook sets the current working directory to the root of the repository. set projectFolder in init_notebook.py to the root of the repository before running a notebook. The first cell of the notebooks needs to be run only once per session (you will get an error if the working directory is already changed once). This setting helps us to organize libraries into different folders and access them from anywhere inside the repository.

1. Building Vocabulary

2. Tasks

2.1 Word Embedding

  1. Using Pretrained models
  2. Debiasing

2.2 Classical Methods:

  1. Document Classification with Bag-of-Words
  2. Movie recommender with TD-iDF
  3. Text Classification with Naive Bayes
  4. Text Generation with Naive Bayes

2.3 NN Methods

Fine-tuning

  1. Fine Tuning Llama2 with LoRA