Liujingxiu23

Pinned Repositories

ai-audio-datasets-list
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
1 0 00
awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
1 0 00
BEGANSing
Korean Singing Voice Synthesis based on Auto-regressive Boundary Equilibrium GAN
Language:Python1 0 00
CVPR2021-Papers-with-Code
CVPR 2021 论文和开源项目合集
1 0 01
emotional-vits
无需情感标注的情感可控语音合成模型，基于VITS
Language:Jupyter Notebook1 0 00
guided-diffusion
Language:Python1 0 00
jukebox-diffusion
Language:Python1 0 00
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
Language:Python2 0 00
MP-SENet
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
Language:Python1 0 00
NSF-HIFIGAN
NSF by 王鑫
Language:Python3 0 00

Liujingxiu23's Repositories

Liujingxiu23/ai-audio-datasets-list
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
1 0 00
Liujingxiu23/awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
1 0 00
Liujingxiu23/jukebox-diffusion
Language:Python1 0 00
Liujingxiu23/MP-SENet
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
Language:Python1 0 00
Liujingxiu23/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Language:Python0 0
Liujingxiu23/audio-pipeline
Language:Python0 0
Liujingxiu23/AudioSep
Official implementation of "Separate Anything You Describe"
Language:Python0 0
Liujingxiu23/awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案
0 0
Liujingxiu23/basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Language:Python0 0
Liujingxiu23/Bert-VITS2
vits2 backbone with multilingual-bert
Language:Python0 0
Liujingxiu23/DeepMIR
Teaching material for the course "Deep Learning for Music Analysis and Generation" I taught at National Taiwan University (2023 Fall)
0 0
Liujingxiu23/Diff-BGM
official code for CVPR'24 paper Diff-BGM
Liujingxiu23/diffiner
Language:Python0 0
Liujingxiu23/ggml
Tensor library for machine learning
Language:C0 0
Liujingxiu23/HeyGenClone
A simple and open-source analogue of the HeyGen system
Language:Python0 0
Liujingxiu23/INTERSPEECH-2023-Papers
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
0 0
Liujingxiu23/lp-music-caps
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
Language:Python0 0
Liujingxiu23/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Language:Python0 0
Liujingxiu23/open-tts-tracker
0 0
Liujingxiu23/parler-tts
Inference and training library for high-quality TTS models.
Language:Python0 0
Liujingxiu23/Qwen-7B
The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python0 0
Liujingxiu23/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Python0 0
Liujingxiu23/speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
Language:Python0 0
Liujingxiu23/supervoice-gpt
GPT-style network for phonemization with durations of text
Language:Python0 0
Liujingxiu23/tts-generation-webui
TTS Generation Web UI (Bark, MusicGen, Tortoise)
Language:Python0 0
Liujingxiu23/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
Language:Python1 0
Liujingxiu23/vampnet
music generation with masked transformers!
Language:Python0 0
Liujingxiu23/vits2_pytorch
unofficial vits2-TTS implementation in pytorch
Language:Jupyter Notebook0 0
Liujingxiu23/voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
Liujingxiu23/WavJourney
WavJourney: Compositional Audio Creation with LLMs
0 0