/FasterTransformer4CodeFuse

High-performance LLM inference based on our optimized version of FastTransfomer

Primary LanguageC++OtherNOASSERTION