CLUEbenchmark/CLUE

Models supports Pytorch

JiaojiaoYe1994 opened this issue · 2 comments

I try to run some training process using models implemented in Pytorch, which is under ./models_pytorch/classifier_pytorch. I already successfully go through train and predict process of BERT. However, when I try to run ALBERT, OSERROR tells that it's not founded.

Is ALBERT, ROBERT etc. implemented in Pytorch?

  • execution command:
    python run_classifier.py --model_type=albert --model_name_or_path=albert_xlarge_zh --task_name=iflytek --do_train --do_eval --do_lower_case --data_dir=./CLUEdatasets/iflytek --max_seq_length=128 --per_gpu_train_batch_size=16 --per_gpu_eval_batch_size=16 --learning_rate=2e-5 --num_train_epochs=3.0 --logging_steps=759 --save_steps=759 --output_dir=./iflytek_output/ --seed=42 --overwrite_output_dir

  • return:
    OSError: Model name 'albert_xlarge_zh' was not found in model name list (bert-base-uncased, bert-large-uncased, bert-base-cased, bert-large-cased, bert-base-multilingual-uncased, bert-base-multilingual-cased, bert-base-chinese, bert-base-german-cased, bert-large-uncased-whole-word-masking, bert-large-cased-whole-word-masking, bert-large-uncased-whole-word-masking-finetuned-squad, bert-large-cased-whole-word-masking-finetuned-squad, bert-base-cased-finetuned-mrpc, bert-base-german-dbmdz-cased, bert-base-german-dbmdz-uncased). We assumed 'albert_xlarge_zh' was a path or url to a configuration file named config.json or a directory containing such a file but couldn't find any such file at this path or url.

I have the same problem. Did you solve it?

if you want to use albert_xlarge_zh in pytorch, you can check here:
https://github.com/brightmart/albert_zh