wsstriving

Shanghai Jiao Tong University

wsstriving's Stars

2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python26.9k 164 3462.9k
kenjihiranabe/The-Art-of-Linear-Algebra
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
Language:PostScript15.7k 136 142k
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Language:Python11k 162 217748
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook10.5k 139 3331k
chenzomi12/AISystem
AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Language:Jupyter Notebook9.4k 134 311.3k
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language:C++7.6k 74 148404
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python5.6k 46 73501
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook5.5k 68 972724
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Language:Python4.2k 60 167525
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:Python3.9k 90 9871k
Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
Language:Python2.1k 23 33163
bshall/knn-vc
Voice Conversion With Just Nearest Neighbors
Language:Python427 14 3564
karaokenerds/python-audio-separator
Easy to use vocal separation from CLI or as a python package, using a variety of amazing models (primarily trained by @Anjok07 as part of UVR)
Language:Python297 8 6350
KdaiP/StableTTS
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
Language:Python285 26 1230
huangwb8/ChineseResearchLaTeX
**科研常用LaTeX模板集
Language:TeX233 7 729
quickvc/QuickVC-VoiceConversion
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
Language:Python208 22 2024
wavmark/wavmark
AI-based Audio Watermarking Tool
Language:Python188 8 1226
zhenye234/CoMoSpeech
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
Language:Python164 11 1017
Grace9994/CoMoSVC
CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone
Language:Python114 3 1217
Vincent-ZHQ/CA-MSER
Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information
Language:Python114 2 1914
thuhcsi/SECap
Language:Python104 2 69
line/LibriTTS-P
LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning
93 8 11
yukara-ikemiya/friendly-stable-audio-tools
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.
Language:Python86 3 16
flinkerlab/neural_speech_decoding
Language:Jupyter Notebook8310
liyunlongaaa/NSD-MS2S
CHIME-7 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture
Language:Shell55 3 73
DigitalPhonetics/speaker-anonymization
Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.
Language:Python46 5 44
RicherMans/Dasheng
Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"
Language:Python312
ARDiT-TTS/ardit-tts.github.io
Language:HTML171
xjchenGit/SingGraph
Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).
Language:Python7 1 21
npuichigo/snake
Data loading with combined async Rust stream and Python
Language:Rust5 2 0

wsstriving

wsstriving's Stars

2noise/ChatTTS

kenjihiranabe/The-Art-of-Linear-Algebra

openai/tiktoken

facebookresearch/seamless_communication

chenzomi12/AISystem

SJTU-IPADS/PowerInfer

facebookresearch/DiT

pyannote/pyannote-audio

Zejun-Yang/AniPortrait

wenet-e2e/wenet

Camb-ai/MARS5-TTS

bshall/knn-vc

karaokenerds/python-audio-separator

KdaiP/StableTTS

huangwb8/ChineseResearchLaTeX

quickvc/QuickVC-VoiceConversion

wavmark/wavmark

zhenye234/CoMoSpeech

Grace9994/CoMoSVC

Vincent-ZHQ/CA-MSER

thuhcsi/SECap

line/LibriTTS-P

yukara-ikemiya/friendly-stable-audio-tools

flinkerlab/neural_speech_decoding

liyunlongaaa/NSD-MS2S

DigitalPhonetics/speaker-anonymization

RicherMans/Dasheng

ARDiT-TTS/ardit-tts.github.io

xjchenGit/SingGraph

npuichigo/snake