Pinned Repositories
distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
AudioClassification
The repository provides a quick demo for training, testing, and inferencing audio classification. It uses two basic models, Random Forest and Gradient Boosting Trees, and takes audio-based gender classification as an example. The code is simple and clear.
awesome-denoiser
This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very friendly for beginners.
xinliu9451's Repositories
xinliu9451/awesome-denoiser
This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very friendly for beginners.
xinliu9451/AudioClassification
The repository provides a quick demo for training, testing, and inferencing audio classification. It uses two basic models, Random Forest and Gradient Boosting Trees, and takes audio-based gender classification as an example. The code is simple and clear.