Pinned Repositories
AByteOfNLP
some code for nlp tour
AlignmentServer
API for alignment of singing voice to lyrics as used in www.voicemagix.com. Core Machine Learning Algorithms are MLP neural networks and hidden markov models. Based on Django Rest Framework
Automatic_Speech_Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
awesome-music-informatics
A curated list of awesome article, tutorial, library, webpage, etc.
Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
DL-AFx
Deep Learning for Black-Box Modeling of Audio Effects - website:
FastImageProcessing
Fast Image Processing with Fully-Convolutional Networks
GPUImage
An open source iOS framework for GPU-based image and video processing
marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
merlin
This is now the official location of the Merlin project.
xzm2004260's Repositories
xzm2004260/awesome-music-informatics
A curated list of awesome article, tutorial, library, webpage, etc.
xzm2004260/ai-audio-startups
Community list of startups working with AI in audio and music technology
xzm2004260/ai-research-code
xzm2004260/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models and Score-based Models, a darkhorse in the field of Generative Models
xzm2004260/book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
xzm2004260/course
高性能并行编程与优化 - 课件
xzm2004260/ddsp-singing-vocoders
Official implementation of SawSing (ISMIR'22)
xzm2004260/DeepAFx-ST
DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/
xzm2004260/deepaudio-tts
xzm2004260/diffwave-sr
xzm2004260/DualCycleGAN
Official implementation of DualCycleGAN for nonparallel audio super resolution
xzm2004260/genmusic_demo_list
a list of demo websites for automatic music generation research
xzm2004260/LINNE
(Beta) LInear-predictive Neural Net Encoder -- A lossless audio codec
xzm2004260/midi-ddsp
Synthesis of MIDI with DDSP (https://midi-ddsp.github.io/)
xzm2004260/Mixed_Emotions
This is the code for "Speech Synthesis with Mixed Emotions".
xzm2004260/Muskits
An opensource music processing toolkit
xzm2004260/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
xzm2004260/onoma-to-wave_transformer
Unofficial implementations of environmental sound synthesis system with Transformer
xzm2004260/opentts
Open Text to Speech Server
xzm2004260/pop2piano
Official Repo of the paper "Pop2Piano : Pop Audio-based Piano Cover Generation"
xzm2004260/Speech-Editing-Toolkit
It's a repository for implementations of neural speech editing algorithms.
xzm2004260/StarGANv2-VC
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
xzm2004260/study-music
A survey of books, resources and courses to study everything about music and sound in the broadest sense
xzm2004260/survey
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
xzm2004260/Text-to-sound-Synthesis
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
xzm2004260/Unet-TTS
One-shot TTS with Improved Unseen Speaker and Style Transfer
xzm2004260/VI-SVS
Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.
xzm2004260/voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
xzm2004260/Waveformer
An efficient architecture for real-time target sound extraction.
xzm2004260/wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit