microsoft/rat-sql

The process is killed every 280 steps

Opened this issue · 2 comments

How can I change the total number of steps

For me, the training process was killed just after 1 epoch. @1999zhouwei Were you able to fix it?

Same here!@
'To use data.metrics please install scikit-learn. See https://scikit-learn.org/stable/index.html
[2023-01-11T06:48:14] Logging to logdir/bert_run/bs=6,lr=7.4e-04,bert_lr=3.0e-06,end_lr=0e0,att=1
[2023-01-11T06:49:57] Step 0 stats, train: loss = 161.15000915527344
[2023-01-11T06:51:04] Step 0 stats, val: loss = 195.58768463134766
Killed
'

Anyone knows how to fix it please? Thanks!