radarFudan/attention_with_linear_biases
Code for the ALiBi method for transformer language models (ICLR 2022)
PythonMIT
No issues in this repository yet.
Code for the ALiBi method for transformer language models (ICLR 2022)
PythonMIT
No issues in this repository yet.