Pinned Repositories
bert
TensorFlow code and pre-trained models for BERT
bread
deep-subjecthood
Code for running the experiments in Deep Subjecthood: Higher Order Grammatical Features in Multilingual BERT
except-when-it-matters
injecting-structural-hints
izzy-vim
parse-wiki-dump
Simple script to get the first n tokens out of a wikipedia latest dump.
tilt-transfer
Code to run the TILT transfer learning experiments
toizzy.github.io
wiki-corpus-creator
A collection of scripts to download, clean and tokenize a wikipedia dump, and split the corpus into train/val/test sets.
toizzy's Repositories
toizzy/tilt-transfer
Code to run the TILT transfer learning experiments
toizzy/deep-subjecthood
Code for running the experiments in Deep Subjecthood: Higher Order Grammatical Features in Multilingual BERT
toizzy/wiki-corpus-creator
A collection of scripts to download, clean and tokenize a wikipedia dump, and split the corpus into train/val/test sets.
toizzy/injecting-structural-hints
toizzy/except-when-it-matters
toizzy/bread
toizzy/parse-wiki-dump
Simple script to get the first n tokens out of a wikipedia latest dump.
toizzy/toizzy.github.io
toizzy/bert
TensorFlow code and pre-trained models for BERT
toizzy/izzy-vim
toizzy/latex-pset-template
A very simple latex template for university problem sets, with a few iz shortcuts in the preamble.
toizzy/Moro-database
toizzy/wikiextractor
A tool for extracting plain text from Wikipedia dumps
toizzy/nlp-in-ling
Natural Language Processing Research in North American Linguistics Departments