Issues
- 1
tips and scripts related to data collection
#74 opened by alvinchangw - 0
control code
#85 opened by dedededefo - 0
- 5
- 1
repeats the last word on AWS
#71 opened by dataSci-rigo - 0
Cuda out of memory issue.
#83 opened by jamalabdul1 - 0
- 2
- 0
- 2
Sampling settings used in the paper
#52 opened by bilal2vec - 1
- 2
control code not recognised
#78 opened by cainesap - 4
Fine-tuning on Colab
#63 opened by manueltonneau - 9
Out of memory when fine-tuning
#43 opened by orenmelamud - 0
Will BERT+transformer-decoder better than tensor2tensor for text-generation?
#77 opened by guotong1988 - 2
Using ctrl for summarization
#64 opened by Hellisotherpeople - 0
Altering the tone of the output
#76 opened by qdx45 - 0
Are control codes required for finetuning?
#75 opened by umhau - 0
Sampling method used for translation
#73 opened by MichaelZhouwang - 0
License for pre-trained model
#72 opened by MobiusLooper - 0
training curriculum used
#70 opened by v1nc3nt27 - 0
Source attribution - Cannot replicate results
#67 opened by sipity19 - 5
How to finetune on TPU v3-8 nodes? It runs without error but does not seem to progress.
#38 opened by eurka - 1
Is that a way to do "general" generation?
#66 opened by cloudygoose - 0
TPU configuration - fine tuning
#65 opened by pgrandinetti - 2
- 1
- 1
- 0
Is it possible to run CTRL (full) on Gradient?
#59 opened by ckoshka - 0
❓ Question about testing procedure
#58 opened by astariul - 1
Custom model conversion to Hugging Face
#57 opened by orenmelamud - 1
Line breaks in prompts?
#56 opened by vince-lynch - 12
Running full model on V100 outputs last word
#49 opened by dimitri320 - 1
training_utils on new vocab and codes
#53 opened by ctoffo - 2
Running the model on TPUs?
#55 opened by vessenes - 1
- 2
OutOfMemory in fine-tuning.
#46 opened by LiuYixian - 1
more than 600 labels
#45 opened by leejason - 4
How to add new control code into vocabulary?
#37 opened by leejason - 2
benchmarking with GPT-2
#47 opened by leejason - 5
Finetuning Errors
#32 opened by nickwalton - 1
- 2
- 1
Just for fun: How long would training this model take on a Nvidia 1080Ti GPU (12gb)
#44 opened by timpal0l - 2
using pytorch_generation.py: setting --temperature argument to any value causes a failure
#41 opened by tanselmi - 0
- 1
multiple tags as control code
#35 opened by leejason - 1
smaller model
#36 opened by leejason - 1
Question about vocabulary file
#42 opened by htw2012 - 1