Pinned Repositories
ai-audio-datasets-list
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
BEGANSing
Korean Singing Voice Synthesis based on Auto-regressive Boundary Equilibrium GAN
CVPR2021-Papers-with-Code
CVPR 2021 论文和开源项目合集
emotional-vits
无需情感标注的情感可控语音合成模型,基于VITS
guided-diffusion
jukebox-diffusion
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
MP-SENet
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
NSF-HIFIGAN
NSF by 王鑫
Liujingxiu23's Repositories
Liujingxiu23/ai-audio-datasets-list
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
Liujingxiu23/awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Liujingxiu23/jukebox-diffusion
Liujingxiu23/MP-SENet
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
Liujingxiu23/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Liujingxiu23/audio-pipeline
Liujingxiu23/AudioSep
Official implementation of "Separate Anything You Describe"
Liujingxiu23/awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
Liujingxiu23/basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Liujingxiu23/Bert-VITS2
vits2 backbone with multilingual-bert
Liujingxiu23/DeepMIR
Teaching material for the course "Deep Learning for Music Analysis and Generation" I taught at National Taiwan University (2023 Fall)
Liujingxiu23/Diff-BGM
official code for CVPR'24 paper Diff-BGM
Liujingxiu23/diffiner
Liujingxiu23/ggml
Tensor library for machine learning
Liujingxiu23/HeyGenClone
A simple and open-source analogue of the HeyGen system
Liujingxiu23/INTERSPEECH-2023-Papers
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
Liujingxiu23/lp-music-caps
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
Liujingxiu23/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Liujingxiu23/open-tts-tracker
Liujingxiu23/parler-tts
Inference and training library for high-quality TTS models.
Liujingxiu23/Qwen-7B
The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud.
Liujingxiu23/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Liujingxiu23/speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
Liujingxiu23/supervoice-gpt
GPT-style network for phonemization with durations of text
Liujingxiu23/tts-generation-webui
TTS Generation Web UI (Bark, MusicGen, Tortoise)
Liujingxiu23/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
Liujingxiu23/vampnet
music generation with masked transformers!
Liujingxiu23/vits2_pytorch
unofficial vits2-TTS implementation in pytorch
Liujingxiu23/voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
Liujingxiu23/WavJourney
WavJourney: Compositional Audio Creation with LLMs