The RWKV Language Model with Token-shift. Better and Faster than usual transformer / GPT.
Primary LanguagePythonBSD 2-Clause "Simplified" LicenseBSD-2-Clause
No issues in this repository yet.