xiaozhuo12138

Pinned Repositories

3D-sound-panorama
Panoraming a sound object in 3D using HRTF
Language:MATLAB0 0 00
3dti_AudioToolkit
3D Tune-In Toolkit is a custom open-source C++ library developed within the EU-funded project 3D Tune-In. The Toolkit provides a high level of realism and immersiveness within binaural 3D audio simulations, while allowing for the emulation of hearing aid devices and of different typologies of hearing loss.
Language:C++0 0 00
A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement
Implement A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement by PyTorch.
Language:Python0 0 00
acoustic-model
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
Language:Python0 0 00
AEC-Challenge
AEC Challenge
0 0 00
agc
Provides automatic gain control to normalize power levels for real or complex signals
Language:C++0 1 01
microphoneArray
Language:C1 0 00
open-unmix-tensorflow
open unmix - music source separation for tensorflow
1 0 01
pianotrans
Simple GUI for ByteDance's Piano Transcription with Pedals
Language:PowerShell1 0 00
PitchNet
An unofficial implementation of the paper titled "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network".
Language:Python27 1 03

xiaozhuo12138's Repositories

xiaozhuo12138/Applio
VITS-based Voice Conversion focused on simplicity, quality and performance.
Language:Python0 0
xiaozhuo12138/AudioLDM2
Text-to-Audio/Music Generation
Language:Python0 0
xiaozhuo12138/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python0 0
xiaozhuo12138/BABE2
Language:Python0 0
xiaozhuo12138/beat_this
Accurate and general beat tracker
Language:Python0 0
xiaozhuo12138/beatrice-trainer-colab
Language:Jupyter Notebook0 0
xiaozhuo12138/ccmusic-database.github.io
This platform is a multi-functional music data sharing platform for academic research. It contains many music datas such as the sound information of Chinese traditional musical instruments and the labeling information of Chinese pop music, which is available for free use by MIR researchers.
Language:HTML0 0
xiaozhuo12138/ChordSync
Code for ChordSync, a conformer-based audio-to-chord synchroniser
xiaozhuo12138/CoMoSVC
CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone
Language:Python0 0
xiaozhuo12138/DelayCat
DelayCat Feature Based Delay Line Audio Plugin
Language:Jupyter Notebook0 0
xiaozhuo12138/dry_sing_multi_eval
Five-Dimensional Acapella Singing Evaluation System based on funASR, include pronunciation, pitch accuracy, rhythm, fluency, and emotion.
Language:Python0 0
xiaozhuo12138/FCPE
Language:Python0 0
xiaozhuo12138/FreeV
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
Language:Python0 0
xiaozhuo12138/FSPEN
Language:Python0 0
xiaozhuo12138/FxNorm-automix
FxNorm-Automix - Implementation of automatic music mixing systems. We show how we can use wet music data and repurpose it to train a fully automatic mixing system
Language:Python0 0
xiaozhuo12138/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python0 0
xiaozhuo12138/grok-1
Grok open release
Language:Python0 0
xiaozhuo12138/gtcrn
The official implementation of GTCRN, an ultra-lite speech enhancement model.
Language:Python0 0
xiaozhuo12138/hilcodec
xiaozhuo12138/icefall
Language:Python0 0
xiaozhuo12138/mustango
Mustango: Toward Controllable Text-to-Music Generation
Language:Python0 0
xiaozhuo12138/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
Language:Python0 0
xiaozhuo12138/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python0 0
xiaozhuo12138/openWakeWord
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.
Language:Jupyter Notebook0 0
xiaozhuo12138/POPDG
Data and PopDanceSet are coming soon.
Language:Python0 0
xiaozhuo12138/sherpa-onnx
Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
Language:C++0 0
xiaozhuo12138/stream-vc
An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)
Language:Python0 0
xiaozhuo12138/StreamVC
An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".
xiaozhuo12138/tinyvc
a lightweight voice conversion
Language:Python0 0
xiaozhuo12138/XNNPACK
High-efficiency floating-point neural network inference operators for mobile, server, and Web
Language:C0 0