luxiaolululu's Stars
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
BayesWitnesses/m2cgen
Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies
dmlc/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
tky823/DNN-based_source_separation
A PyTorch implementation of DNN-based source separation.
axinc-ai/ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
aask1357/hilcodec
High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec
babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
csteinmetz1/pyloudnorm
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
maggie0830/DCCRN
implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch
tyiannak/pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
fazledyn/gender-classification-from-audio-clips
In this project, we built a machine learning model that can identify the gender of a person from their voice recording.
primaryobjects/voice-gender
Gender recognition by voice and speech analysis
ina-foss/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
huckiyang/awesome-neural-reprogramming-prompting
A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
nii-yamagishilab/Extended_VQVAE
Rikorose/DeepFilterNet
Noise supression using deep filtering
maum-ai/univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
audiolabs/torch-pesq
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
egrinstein/roomfuser
Acoustic impulse response generation using diffusion models
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
FengQuanLi/WZCQ
用基于策略梯度得强化学习方法训练AI玩王者荣耀
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
warmsound/crystal-face
Garmin Connect IQ watch face
RoyJames/room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
locuslab/TCN
Sequence modeling benchmarks and temporal convolutional networks
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
DataTalksClub/data-engineering-zoomcamp
Free Data Engineering course!