Bug: wrong vacab index for WFST decoding?

Question

Bug: wrong vacab index for WFST decoding?

iou2much opened this issue 4 years ago · 3 comments

iou2much commented 4 years ago

https://github.com/athena-team/athena/blob/master/athena/models/mtl_seq2seq.py#L153

predictions = [self.vocab[prediction] - 1 for prediction in words_prediction]

Seems a bug, not suppose to minus 1 here

iou2much commented 4 years ago

#326

Answer 1 · 2020-11-09T02:45:48.000Z

Yeah it probably is a bug to complement the fact our previous vocab file didn't start with 0. Can you make sure removing the minus 1 here will get correct results on AISHELL and make a pr for us?

Answer 2 · 2020-11-09T05:45:35.000Z

Yes. I've already try to remove -1 here, and get correct result. I'll make a pr. thanks