/kubig-nlp-session

[3-1] 데이터 사이언스 학회 NLP 세션

Primary LanguageJupyter Notebook

kubig-nlp-session

   

This repository owns a collection of assignments and notes for 'KUBIG' natural language processing session held from June to August 2021. After 3 months of session, free-themed nlp competition was held and our team took the 1st place. The lectures of the session are based on content from 'Deep learning from scratch 2'.
Weekly plan of the session and assignments are listed below.

Weekly plan

week about assignment file
1 Session O.T. - -
2 Word Representation pmi matrix, data crawling, wordcloud Amazon review_n-gram
3 Word2vec t-SNE, pre-trained models Everytime_word2vec, t-SNE
4 Word2vec improvement word2vec (skipgram) News data_word2vec
5 RNN simpleRNN LSTM chatbot
6 Gated RNN nlp contest prep -
7 Text generation, Attention nlp contest prep -

NLP contest

  • Team name: MPTI (Make self-Pr Through aI)
  • Project: CV generation based on GPT-3 and one-line summary with LexRank
  • Result: 1st place (1/6 teams)

Weekly plan

date about to-do
21.08.16 Idea Brainstorming Data crawling (Linkcareer, Jobkorea)
21.08.19 Data crawling/pre-processing Data pre-processing
21.08.23 Data pre-processing GPT-3 paper, code study
21.08.26 GPT-3 fine-tuning Share GPT-3 fine-tuning results
21.08.30 GPT-3 fine-tuning LexRank study, GPT-3 hyperparameter tuning
21.08.31 LexRank summarization/Final prep Final prep
For more details about the project, visit 'gpt3-cv-generator' repository.