cyanbx

MS student@ZJU | Looking for jobs on speech/audio generation

Zhejiang University

cyanbx's Stars

SuperKogito/SER-datasets
A collection of datasets for the purpose of emotion recognition/detection in speech.
Language:HTML28340
ys1305/ML-hand
各种机器学习算法的手写实现
Language:Python185
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language:Python3.1k328
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python58742
zju-vipa/Odyssey
Odyssey: Empowering Agents with Open-World Skills
Language:JavaScript25512
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Language:Python11.1k1.1k
Akshat4112/SpeakerDiff
SpeakerDiff: Denoising Diffusion Probalistic Models on Speaker Embeddings
Language:Python2
RickyL-2000/ROSVOT
Robust Singing Voice Transcription and MIDI Extraction
Language:Python482
cyanbx/FastLTS
Implementation of FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis (ACM MM'22)
Language:Python81
joannahong/Lip2Wav-pytorch
a PyTorch implementation of Lip2Wav
Language:Python4810
zehanwang01/FreeBind
Language:Python12
bytedance/Make-An-Audio-2
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
Language:Python11914
cyanbx/Prompt-Singer
Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).
Language:Python649
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python4.5k384
roger-tseng/av-superb
A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)
Language:Python444
yangdongchao/UniAudio
The Open Source Code of UniAudio
Language:Python50931
ccfddl/ccf-deadlines
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
Language:Vue6k420
LucidaLu/QAOA-with-fewer-qubits
Data and code repository for "QAOA with fewer qubits: a coupling framework to solve larger-scale Max-Cut problem".
Language:Python1
RickyL-2000/AlignSTS
Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment
Language:Python626
pengsida/learning_research
本人的科研经验
5.6k335
revsic/torch-nansy
Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513
Language:Jupyter Notebook639
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Language:Python10k858
Yirui-Wang/ZJU-CSE-Latex
浙江大学控制学院本科生毕业论文Latex模板。
Language:TeX14
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
1.9k67
collabora/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
Language:Jupyter Notebook3.8k207
lyc8503/EasierConnect
NJU EasyConnect 第三方开源 Golang 客户端 / NJU EasyConnect protocol reimplementation in Go
Language:Go534118
Mythologyli/zju-connect
ZJU RVPN 客户端的 Go 语言实现
Language:Go33523
Mythologyli/ZJU-Connect-for-Windows
基于 Qt 编写的 ZJU 网络客户端
Language:C++28117
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
Language:HTML10.8k934
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
Language:Python1.9k167

cyanbx

cyanbx's Stars

SuperKogito/SER-datasets

ys1305/ML-hand

huggingface/speech-to-speech

ddlBoJack/emotion2vec

zju-vipa/Odyssey

lucidrains/DALLE2-pytorch

Akshat4112/SpeakerDiff

RickyL-2000/ROSVOT

cyanbx/FastLTS

joannahong/Lip2Wav-pytorch

zehanwang01/FreeBind

bytedance/Make-An-Audio-2

cyanbx/Prompt-Singer

open-mmlab/Amphion

roger-tseng/av-superb

yangdongchao/UniAudio

ccfddl/ccf-deadlines

LucidaLu/QAOA-with-fewer-qubits

RickyL-2000/AlignSTS

pengsida/learning_research

revsic/torch-nansy

AIGC-Audio/AudioGPT

Yirui-Wang/ZJU-CSE-Latex

archinetai/audio-ai-timeline

collabora/WhisperSpeech

lyc8503/EasierConnect

Mythologyli/zju-connect

Mythologyli/ZJU-Connect-for-Windows

diff-usion/Awesome-Diffusion-Models

archinetai/audio-diffusion-pytorch