Pinned Repositories
FunASR
A Fundamental End-to-End Speech Recognition Toolkit
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
inferflow
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
CTranslate2
Fast inference engine for Transformer models
faster-whisper
Faster Whisper transcription with CTranslate2
tensorrtllm_backend
The Triton TensorRT-LLM Backend
insanely-fast-whisper
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
dyyzhmm's Repositories
dyyzhmm/FunASR
A Fundamental End-to-End Speech Recognition Toolkit
dyyzhmm/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs