/Normalized-Information-Payload

codes for paper: What Dense Graph do You Need for Self-attention?

Primary LanguagePython

Watchers