nshepperd/gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

PythonNOASSERTION

Issues

GPT-2: Q/A Training Question
#4 opened 5 years ago by josiahls
2
OOM With Gradient Checkpointing on 1080 Ti
#5 opened 5 years ago by 9of9
2
generate_samples() code from gpt2 train.py -- InvalidArgumentError
#14 opened 5 years ago by radiodee1
1
ResourceExhaustedError (see above for traceback): OOM when allocating tensor with shape[1,12,1024,1024] and type float on /job:localhost/replica:0/task:0/device:GPU
#8 opened 5 years ago by josai
12
774M Model running out of memory
#24 opened 5 years ago by sdan
13
Intermediate Layer Output
#13 opened 5 years ago by bakszero
4
how to train on multi gpu
#21 opened 5 years ago by brianjcj
1
Question about training with small dataset entries
#25 opened 5 years ago by ProtoxiDe22
2
[SOLUTION] UnicodeEncodeError: 'charmap' codec can't encode character '\ufffd' in position 29: character maps to <undefined>
#10 opened 5 years ago
1
module 'tensorflow' has no attribute 'sort'
#20 opened 5 years ago by shoegazerstella
0
Failed to interpret file %s as a pickle
#19 opened 5 years ago by Ceebox
0
Training from scratch?
#11 opened 5 years ago by bkj
2
Zero Division Error
#18 opened 5 years ago by Chris-Rigas
2
Restriction on only training transformer layers?
#12 opened 5 years ago by bakszero
6
Splitting the model across multiple graphics cards
#17 opened 5 years ago by thoughtsofacrow
0
"past" is not used in training
#16 opened 5 years ago by cookielee77
2
Freezing layers while finetuning
#15 opened 5 years ago by bakszero
0
Windows doesn't automatically use UTF-8 encoding
#9 opened 5 years ago by MrKrzYch00
2