Error in README
Closed this issue · 1 comments
odellus commented
In the README it states that you need to run
python3 -m coh.coh_train \
--mp_mesh_dim=16 \
--load_opt_config='huggingface::EleutherAI/gpt-j-6B' \
--model='opt' # Shouldn't this be gptj? \
--pretrain_dataset.split='train' \
--pretrain_dataset.path='c4' \
--pretrain_dataset.seq_length=1024 \
--pretrain_dataset.batch_size=512 \
--feedback_dataset.tokenizer='EleutherAI/gpt-j-6B' \
--feedback_dataset.split='train' \
--feedback_dataset.seq_length=1024 \
--feedback_dataset.batch_size=512 \
--log_all_worker=False \
--logger.online=False \
--logger.project_id="" \
--logger.experiment_id="" \
--logger.experiment_note="" \
--logger.gcs_output_dir="" \
--logger.output_dir="$HOME/coh_output"
to do COH with GPTJ but it pretty clearly says to use the OPT model in the given arguments.
haoliuhl commented
Nice catch. thank you.