Why does the author treat 'where value' as a classification problem?
shaomai00 opened this issue · 0 comments
shaomai00 commented
The start and end token are treated as a classification problem and use cross-entropy loss. Have you tried nll loss and treated it as a tagging problem? Thank you for the great work. ^^