/mru-lm

An LM forked from my transformer-train-script repo that replaces attention with a novel idea called "matrix recurrent units."

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers