pre-training

Question

pre-training

wjczf123 opened this issue 3 years ago · 10 comments

File "/deepo_data/pretrain/code/model.py", line 152, in forward
l_h_state = l_outputs[0][indice, l_ph] # (batch, hidden_size)
result = self.forward(*input, **kwargs)
File "/deepo_data/CZF/Counterfactual-RE/origin/pretrain/code/model.py", line 152, in forward
IndexError: too many indices for tensor of dimension 0

I encounter this bug. Can you provide your pretrained model MTB and CP (like step 2 in pretrain, it seems that MTB and CP in step 2 still needs pre-training).

Answer 1 · 2021-08-05T06:31:54.000Z

For the first problem, do you install the version of transformers in our repo?
For the second problem, the ckpts provided in step 2 in pretrain are the final checkpoints we used.

Answer 2 · 2021-08-05T07:17:03.000Z

For the second problem, we use your provided MTB model. But the results seem wrong.
The run.sh is:
ckpt="MTB"
for seed in 42 43 44 45 46
do
bash train.sh 1 $seed $ckpt 0.01 20
done

And I put the MTB into pretrain/ckpt/.

I run this code three times and the results are as follow:

MTB wiki80 Thu Aug 5 06:59:15 2021
@Result: Best Dev score is 0.481, Test score is 0.510

MTB wiki80 Thu Aug 5 07:03:47 2021
@Result: Best Dev score is 0.502, Test score is 0.537

MTB wiki80 Thu Aug 5 07:08:15 2021
@Result: Best Dev score is 0.498, Test score is 0.532

Is there a problem with my operation?

Answer 3 · 2021-08-05T07:27:01.000Z

You can look into the train.sh. 0.01 means proportion of training set. If you want to use normal supervised set, set this to 1.

Answer 4 · 2021-08-05T07:29:38.000Z

I know this. But MTB achieves 0.585 F1 with 1% training data and C+M setting. I use this setting. But the results seem wrong.

Answer 5 · 2021-08-05T07:30:16.000Z

I can reproduce BERT version. But MTB seems wrong.

Answer 6 · 2021-08-05T07:36:28.000Z

Is the mode C+M? And you can try set max_epoch to 50. Because 20 epoches may not converge.

Answer 7 · 2021-08-05T07:38:14.000Z

OK. I will try again. Thank you very much for your help!

Answer 8 · 2021-08-05T07:39:42.000Z

One more thing, if the final result is a little different with paper, that's normal. Because 1% set is very sensitive. But the relative improvement is consistent.

Answer 9 · 2021-08-05T07:40:43.000Z

OK. Thank you very much!

Answer 10 · 2021-08-05T08:11:43.000Z

OK. The results are normal. This method even achieves better results than reported in the paper. Thank you very much.

pre-training

MTB wiki80 Thu Aug 5 06:59:15 2021
@Result: Best Dev score is 0.481, Test score is 0.510

MTB wiki80 Thu Aug 5 07:03:47 2021
@Result: Best Dev score is 0.502, Test score is 0.537

MTB wiki80 Thu Aug 5 07:08:15 2021
@Result: Best Dev score is 0.498, Test score is 0.532

MTB wiki80 Thu Aug 5 07:47:46 2021
@Result: Best Dev score is 0.594, Test score is 0.622

MTB wiki80 Thu Aug 5 07:57:37 2021
@Result: Best Dev score is 0.604, Test score is 0.648

MTB wiki80 Thu Aug 5 08:06:35 2021
@Result: Best Dev score is 0.593, Test score is 0.629

MTB wiki80 Thu Aug 5 06:59:15 2021 @Result: Best Dev score is 0.481, Test score is 0.510

MTB wiki80 Thu Aug 5 07:03:47 2021 @Result: Best Dev score is 0.502, Test score is 0.537

MTB wiki80 Thu Aug 5 07:08:15 2021 @Result: Best Dev score is 0.498, Test score is 0.532

MTB wiki80 Thu Aug 5 07:47:46 2021 @Result: Best Dev score is 0.594, Test score is 0.622

MTB wiki80 Thu Aug 5 07:57:37 2021 @Result: Best Dev score is 0.604, Test score is 0.648

MTB wiki80 Thu Aug 5 08:06:35 2021 @Result: Best Dev score is 0.593, Test score is 0.629

MTB wiki80 Thu Aug 5 06:59:15 2021
@Result: Best Dev score is 0.481, Test score is 0.510

MTB wiki80 Thu Aug 5 07:03:47 2021
@Result: Best Dev score is 0.502, Test score is 0.537

MTB wiki80 Thu Aug 5 07:08:15 2021
@Result: Best Dev score is 0.498, Test score is 0.532

MTB wiki80 Thu Aug 5 07:47:46 2021
@Result: Best Dev score is 0.594, Test score is 0.622

MTB wiki80 Thu Aug 5 07:57:37 2021
@Result: Best Dev score is 0.604, Test score is 0.648

MTB wiki80 Thu Aug 5 08:06:35 2021
@Result: Best Dev score is 0.593, Test score is 0.629