Issues
- 1
古文オンリーデータセット
#24 opened - 0
os path join使う
#23 opened - 0
config file や file_pathの整理
#22 opened - 0
add decoding algo
#21 opened - 0
efficient generate
#20 opened - 0
損失のところソフトマックスかけてるか見る
#19 opened - 0
dataloaderをJAX likeにする
#18 opened - 0
dropout 0にしてみる
#17 opened - 0
Shakespeare で損失の下がり具合見る
#16 opened - 1
mhaを消してみる
#15 opened - 1
wikitext-ja
#14 opened - 0
Publish models on HuggingFace
#13 opened - 1
Training on 青空文庫
#12 opened - 0
training on my blog posts
#11 opened - 1
get batches with dynamic slice
#10 opened - 0
Dataclass GPTConfig
#9 opened - 1
- 0
Code reading about Tiktoken
#7 opened - 0
RLHF
#6 opened - 0
fine tuning
#5 opened - 0
vocab size from 50257 to 50304
#4 opened - 2
distributed training
#3 opened - 0
evaluate my llm
#2 opened - 1