luxiaolululu

luxiaolululu's Stars

AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Language:Python10k856
BayesWitnesses/m2cgen
Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies
Language:Python2.8k241
dmlc/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Language:C++26.1k8.7k
tky823/DNN-based_source_separation
A PyTorch implementation of DNN-based source separation.
Language:Python28448
axinc-ai/ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
Language:Python2k318
aask1357/hilcodec
High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec
Language:Jupyter Notebook636
babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python34.9k5.2k
csteinmetz1/pyloudnorm
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
Language:Python62155
maggie0830/DCCRN
implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch
18031
tyiannak/pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
Language:Python5.8k1.2k
fazledyn/gender-classification-from-audio-clips
In this project, we built a machine learning model that can identify the gender of a person from their voice recording.
Language:Jupyter Notebook41
primaryobjects/voice-gender
Gender recognition by voice and speech analysis
Language:R338102
ina-foss/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Language:Python733126
huckiyang/awesome-neural-reprogramming-prompting
A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022
Language:Python35
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
57926
nii-yamagishilab/Extended_VQVAE
Language:Python6318
Rikorose/DeepFilterNet
Noise supression using deep filtering
Language:Python2.4k218
maum-ai/univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
Language:Python26146
audiolabs/torch-pesq
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
Language:Python13613
egrinstein/roomfuser
Acoustic impulse response generation using diffusion models
Language:Jupyter Notebook58
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python5.9k644
FengQuanLi/WZCQ
用基于策略梯度得强化学习方法训练AI玩王者荣耀
Language:Python1.5k387
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python11.5k2.4k
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Language:Python3.4k305
warmsound/crystal-face
Garmin Connect IQ watch face
Language:Monkey C380124
RoyJames/room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
Language:Shell38628
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language:Shell14.1k5.3k
locuslab/TCN
Sequence modeling benchmarks and temporal convolutional networks
Language:Python4.1k874
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python32.6k3.8k
DataTalksClub/data-engineering-zoomcamp
Free Data Engineering course!
Language:Jupyter Notebook24.5k5.3k