BERT for RACE dataset Run multi-worker, multi-GPU (blind cpu): bash run_multiworker.sh 0 <addr> 0 1 <model name> <dataset name> 320 <batch size on single GPU> evaluation: bash eval.sh