Pinned Repositories
AdaIN-VC
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
articulatory_inversion
ARU-Net
ASR_mixed
ASR system for mixed Chinese and English based on Kaldi multi_cn
asr_preprocessing
Python implementation of pre-processing for End-to-End speech recognition
attention-lvcsr
End-to-End Attention-Based Large Vocabulary Speech Recognition
attention-ocr
A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.
audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
DigitalAudioEffects
MATLAB DSP Scripts for algorithmic reverb, flanger and compression.
jupinter's Repositories
jupinter/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
jupinter/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
jupinter/audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
jupinter/chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
jupinter/CosyVoice
LLM based TTS model, providing inference/training/deployment full-stack ability.
jupinter/EduCDM
The Model Zoo of Cognitive Diagnosis Models, including classic Item Response Ranking (IRT), Multidimensional Item Response Ranking (MIRT), Deterministic Input, Noisy "And" model(DINA), and advanced Fuzzy Cognitive Diagnosis Framework (FuzzyCDF), Neural Cognitive Diagnosis Model (NCDM) and Item Response Ranking framework (IRR).
jupinter/face-vid2vid
Unofficial implementation of One-Shot Free-View Neural Talking Head Synthesis
jupinter/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
jupinter/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
jupinter/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
jupinter/HierSpeechpp
The official implementation of HierSpeech++
jupinter/http_server_cpp
C++的http服务器,简单好用
jupinter/hubert
HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
jupinter/jtubespeech
jupinter/manim
Animation engine for explanatory math videos
jupinter/mfa_conformer
jupinter/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
jupinter/NaturalSpeech2
jupinter/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
jupinter/Neural_Cognitive_Diagnosis-NeuralCD
jupinter/One-Shot_Free-View_Neural_Talking_Head_Synthesis
Unofficial pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"
jupinter/prml
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop
jupinter/retinaface
RetinaFace: Deep Face Detection Library for Python
jupinter/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
jupinter/so-vits-svc
SoftVC VITS Singing Voice Conversion
jupinter/Sonic
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
jupinter/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
jupinter/StarGANv2-VC
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
jupinter/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
jupinter/wav2vec2mdd-Text