Thesis Outline Background Learning Theory Zipf's Law Unzipfing Data Typical Sets Filtering Language Modeling Methods Results Conclusions TODO research write down implement