mikayahlevi/mru-lm
An LM forked from my transformer-train-script repo that replaces attention with a novel idea called "matrix recurrent units."
PythonApache-2.0
An LM forked from my transformer-train-script repo that replaces attention with a novel idea called "matrix recurrent units."
PythonApache-2.0