Issues
- 1
Training on TPU
#89 opened by Dhanachandra - 0
GPT-2 doesn't have Encode.py or Train.py.
#88 opened by thegoldenboy542 - 1
how to use trained data to generate text?
#87 opened by bag7dad - 3
- 0
avg stays in 2.6-2.9 range
#86 opened by freedmann2 - 7
- 2
How can I use 1558m data or 775 data instead of using 177m data to train my own model?
#57 opened by drizzt00s - 1
Encoder doesn't work
#83 opened by ruskyvisky - 3
I can't train my dataset
#80 opened by shreesha345 - 0
- 4
How to generate interactive conditional samples after retraining on custom dataset?
#51 opened by nikilp - 7
- 2
Dockerfile no longer accurate?
#78 opened by teledemic - 0
- 1
File "encode.py", line 23
#76 opened by Jucamhc - 4
Cannot download the models
#75 opened by giorgiogiudice - 4
OOM on 345M with GPU
#69 opened by babaraza - 1
gpt2 translation task
#65 opened by jzl0166 - 1
Where to enter model name as parameter
#74 opened by kalyons1 - 0
- 2
- 0
Early Stopping
#68 opened by bala1802 - 1
- 1
refine-tuning by GPU generating repeated words
#60 opened by drizzt00s - 1
How to train in multiple gpu
#66 opened by teja835 - 1
Error when calculating the validation loss - indices[0,1200] = 1200 is not in [0, 1024)
#63 opened by yijunzhouzoey - 1
Process gets killed when training
#61 opened by 50417 - 1
- 0
Why the label of training is like this
#58 opened by SchenbergZY - 0
Image GPT Training
#56 opened by cellininicholas - 8
- 0
- 0
What is the minimum size of GPU I need to set batch_size more than 1 to train 345M model using train.py?
#52 opened by shamiul94 - 2
Encoding on GPU
#43 opened by Zer0-dev115 - 1
Train.py issues windows 10
#35 opened by veldermon - 0
Sample Length
#50 opened by cachilders - 0
How to get the model embeddings?
#47 opened by Leggerla - 2
Is possible to use this train script for another language dataset? In order to train it from start in a new language.
#49 opened by nikkon3 - 0
Manipulating how a sample should start
#48 opened by fabpapi - 0
NameError: name 'How' is not defined
#45 opened by mikkokotila - 1
dataset
#44 opened by AlgoRhythm-Technologies - 4
- 10
Finetuning on the Full Model - OOM 1558M
#37 opened by vince-lynch - 1
Batch size in training GPT_2
#41 opened by ngocpham97 - 4
Apologies, but HELP
#32 opened by OdincoGaming - 0
About Perplexity
#36 opened by curly0613 - 2
Unicode characters each considered as a token
#34 opened by Masum06 - 0
Consued about vocab and encoder
#31 opened by weiguowilliam - 0
- 0
Train loss
#26 opened by alecalma