Train BigConv:

python train.py

Train MoEBigConv:

python train.py ----model-type BigConvMoE

or with pretrained BigConv model

python train.py --model-type BigConvMoE --p pretrained/path --lr-max 0.01