question about the data processing?
puppyapple opened this issue · 0 comments
puppyapple commented
Thanks for the work.
When I dig deep into the code, I found that in the result of the function 'read_noisy_corpus', all the start token is marked as 'O' but not for the end token; is there any intention specific for this point?