potential bug of checkpointing
drcege opened this issue · 3 comments
As the title suggests, the issue lies in the following code.
data-juicer/data_juicer/utils/ckpt_utils.py
Lines 86 to 101 in c85e024
When the new process_list
is shorter than op_record
, Python does not raise an error for out-of-range indices but rather truncates to the maximum available length, thus len(prefix_process) < len(self.op_record)
. Similarly, the zip
function terminates at the shorter iterable's length. This results in the check_ops_to_skip
function incorrectly assuming that the recorded operators match the prefix of the current operators list.
Yes, that's a problem when meeting this situation. 👍🏻
This issue is marked as stale because there has been no activity for 21 days. Remove stale label or add new comments or this issue will be closed in 3 day.
Close this stale issue.