Location:China
A series of large language models developed by Baichuan Intelligent Technology
A high-throughput and memory-efficient inference and serving engine for LLMs
CUDA Templates for Linear Algebra Subroutines