This repository owns a collection of assignments and notes for 'KUBIG' natural language processing session held from June to August 2021. After 3 months of session, free-themed nlp competition was held and our team took the 1st place. The lectures of the session are based on content from 'Deep learning from scratch 2'.
Weekly plan of the session and assignments are listed below.
Weekly plan
week
about
assignment
file
1
Session O.T.
-
-
2
Word Representation
pmi matrix, data crawling, wordcloud
Amazon review_n-gram
3
Word2vec
t-SNE, pre-trained models
Everytime_word2vec, t-SNE
4
Word2vec improvement
word2vec (skipgram)
News data_word2vec
5
RNN
simpleRNN
LSTM chatbot
6
Gated RNN
nlp contest prep
-
7
Text generation, Attention
nlp contest prep
-
NLP contest
Team name: MPTI (Make self-Pr Through aI)
Project: CV generation based on GPT-3 and one-line summary with LexRank
Result: 1st place (1/6 teams)
Weekly plan
date
about
to-do
21.08.16
Idea Brainstorming
Data crawling (Linkcareer, Jobkorea)
21.08.19
Data crawling/pre-processing
Data pre-processing
21.08.23
Data pre-processing
GPT-3 paper, code study
21.08.26
GPT-3 fine-tuning
Share GPT-3 fine-tuning results
21.08.30
GPT-3 fine-tuning
LexRank study, GPT-3 hyperparameter tuning
21.08.31
LexRank summarization/Final prep
Final prep
For more details about the project, visit 'gpt3-cv-generator' repository.