Information theory on sequences (probably mostly language modeling and transformers)
Primary LanguagePython