huggingface/open-muse

Test out hyperparameters in paper

isamu-isozaki opened this issue · 0 comments

I noticed that our current training hyper parameters for the base model is similar to the hyper parameters for the super-resolution model. Instead of the base one(hidden size 1024 vs 2048) so might be interesting trying out with the default parameters too