Pinned Repositories
AIF-PyTorch
(NOT Official) Implementation Auto-regressive Integrate-and-Fire (AIF)
cmas-sample
a simple sample to illustrate circular microphone array separator based on beamforming
ContextNet
Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recognition using global context
kaldi
This is now the official location of the Kaldi project.
knn-vc
Voice conversion with just k-nearest neighbors
lwnn
Lightweight Neural Network
mlas
Project_sp_ehance_matlab
VALL-E-X-Trainer
VALL-E-X-Trainer
vc-lm
将任意人的音色转换为成千上万种不同音色
ishine's Repositories
ishine/kaldi
This is now the official location of the Kaldi project.
ishine/stac-speech-translation
ishine/auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
ishine/bark.cpp
Port of Suno AI's Bark in C/C++ for fast inference
ishine/Bert-VITS2
vits2 backbone with bert
ishine/ChatTTS
TTS
ishine/CoMoSpeech
ishine/DiSeg
Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"
ishine/dynamic-window-speechformer
ishine/espnet
End-to-End Speech Processing Toolkit
ishine/FunASR
A Fundamental End-to-End Speech Recognition Toolkit
ishine/GenTranslate
Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"
ishine/Lightvoc
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
ishine/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
ishine/noisereduce
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
ishine/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
ishine/rwkv.cpp
INT4 and FP16 inference on CPU for RWKV language model
ishine/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
ishine/STAR-Adapt
Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"
ishine/stream-vc
An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)
ishine/StreamingSpeakerDiarization
Official open source implementation of the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"
ishine/StreamVC
An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".
ishine/TeleSpeech-ASR
ishine/tinyvc
a lightweight voice conversion
ishine/tortoise.cpp
ishine/UMOE-Scaling-Unified-Multimodal-LLMs
The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"
ishine/valle
Zero-Shot Text-To-Speech
ishine/vallex-webui
An open source implementation of Microsoft's VALL-E X zero-shot TTS model
ishine/Whispering-LLaMA
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
ishine/X-E-Speech-code
X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion