yzyouzhang

PhD Candidate at Audio Information Research Lab @ UR

University of RochesterNY, US

Pinned Repositories

samo
SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING
Language:Python37 2 29
CtrSVDD2024_Baseline
Baseline system for SVDD 2024 Challenge CtrSVDD track
Language:Python25 2 42
SingFake
Official Repository for "SingFake: Singing Voice Deepfake Detection"
Language:JavaScript53 2 37
AIR-ASVspoof
Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"
Language:Jupyter Notebook115 4 3132
ASVspoof2021_AIR
Official implementation of our ASVspoof 2021 paper, "UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021"
Language:Python53 4 135
Audio_Research_in_US
For students who would like to apply for RA, PhD, postdoc in audio research.
24 4 00
Awesome-Multimedia-Deepfake-Detection
Materials for "Multimedia Deepfake Detection" Tutorial @ ICME 2024
15 1 00
Empirical-Channel-CM
Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems"
Language:Python15 3 31
hrtf_field
Official implementation of the ICASSP 2023 paper "HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields"
Language:Python22 2 12
SASV_PR
Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"
Language:Python17 2 05

yzyouzhang's Repositories

yzyouzhang/AIR-ASVspoof
Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"
Language:Jupyter Notebook115 4 3132
yzyouzhang/ASVspoof2021_AIR
Official implementation of our ASVspoof 2021 paper, "UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021"
Language:Python53 4 135
yzyouzhang/Audio_Research_in_US
For students who would like to apply for RA, PhD, postdoc in audio research.
24 4 00
yzyouzhang/hrtf_field
Official implementation of the ICASSP 2023 paper "HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields"
Language:Python22 2 12
yzyouzhang/SASV_PR
Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"
Language:Python17 2 05
yzyouzhang/Awesome-Multimedia-Deepfake-Detection
Materials for "Multimedia Deepfake Detection" Tutorial @ ICME 2024
15 1 00
yzyouzhang/Empirical-Channel-CM
Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems"
Language:Python15 3 31
yzyouzhang/CS61Bsp18-proj2-byog
Project BYoG for UCB course CS61B Data Structures Spring 2018
Language:Java7 1 03
yzyouzhang/HBAS_chapter_voice3
Official implementation of the handbook chapter "Generalizing Voice Presentation Attack Detection to Unseen Synthetic Attacks and Channel Variation"
Language:Python4 2 21
yzyouzhang/HRTF_field_norm
Official Implementation of our WASPAA 2023 paper "Mitigating Cross-Database Differences for Learning Unified HRTF Representation"
Language:Python1 0 0
yzyouzhang/awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
0 1 00
yzyouzhang/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
0 1 00
yzyouzhang/INFO159-LHW4-Chatbot
A pytorch Chatbot for INFO159 Natural Language Processing
Language:Python0 1 00
yzyouzhang/PhaseAntispoofing_INTERSPEECH
Official repository of the Interspeech 2023 paper "Phase perturbation improves channel robustness for speech spoofing countermeasures"
Language:Python0 0 01
yzyouzhang/chcochleagram
cochleagram generation code in pytorch
Language:Jupyter Notebook0 0
yzyouzhang/CtrSVDD2024_Baseline
Baseline system for SVDD 2024 Challenge CtrSVDD track
Language:Python
yzyouzhang/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Language:Python0 0
yzyouzhang/espnet
End-to-End Speech Processing Toolkit
Language:Python1 0
yzyouzhang/FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
Language:Python0 0
yzyouzhang/flowtron
Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
Language:Jupyter Notebook0 0
yzyouzhang/mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
Language:Jupyter Notebook0 0
yzyouzhang/samo
SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING
Language:Python0 0
yzyouzhang/serve
Serve, optimize and scale PyTorch models in production
Language:Java1 0
yzyouzhang/SingFake
Official Repository for "SingFake: Singing Voice Deepfake Detection"
Language:JavaScript0 0
yzyouzhang/SpeechEmotionAVLearning
Language:HTML0 0
yzyouzhang/SpeechTasks
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.
0 0
yzyouzhang/versa
Versatile Evaluation of Speech and Audio
Language:Python
yzyouzhang/words_spoken_daily
Language:Python1 0
yzyouzhang/yzyouzhang
introduce You (Neil) Zhang
1 0
yzyouzhang/yzyouzhang.github.io
Language:JavaScript2 01