Base code and data (csv file) pre-processed using code from:
https://github.com/joosthub/PyTorchNLPBook/tree/master/chapters/chapter_5/5_3_doc_classification
Wordpiece and sentencepiece code adapted from:
https://github.com/rsennrich/subword-nmt/tree/master/subword_nmt