/non-parametric-transformers

Code for "Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning"

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers