Pinned Repositories
Audio_Network_Dissection
acl2016-convincing-arguments
Code and data for ACL2016 article "Which argument is more convincing? Analyzing and predicting convincingness of Web arguments using bidirectional LSTM" by Ivan Habernal and Iryna Gurevych"
AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
audioset-processing
Toolkit for downloading and processing Google's AudioSet dataset.
Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
Computer-Network-Programming-Secure
Socket Programming with Openssl
Crawler
Some useful downloader or others
descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
dynamic-superb
The official repository of Dynamic-SUPERB.
Whisper-and-ChatGPT
YuXiangLin1234's Repositories
YuXiangLin1234/Whisper-and-ChatGPT
YuXiangLin1234/acl2016-convincing-arguments
Code and data for ACL2016 article "Which argument is more convincing? Analyzing and predicting convincingness of Web arguments using bidirectional LSTM" by Ivan Habernal and Iryna Gurevych"
YuXiangLin1234/AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
YuXiangLin1234/audioset-processing
Toolkit for downloading and processing Google's AudioSet dataset.
YuXiangLin1234/Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
YuXiangLin1234/Computer-Network-Programming-Secure
Socket Programming with Openssl
YuXiangLin1234/Crawler
Some useful downloader or others
YuXiangLin1234/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
YuXiangLin1234/dynamic-superb
The official repository of Dynamic-SUPERB.
YuXiangLin1234/FAcodec
Training code for FAcodec presented in NaturalSpeech3
YuXiangLin1234/ICML-rebuttal
YuXiangLin1234/majong-offline
YuXiangLin1234/Note-Website
The website for share notes
YuXiangLin1234/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
YuXiangLin1234/Textless-NER
YuXiangLin1234/Languagecodec
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models
YuXiangLin1234/metavoice-src
Foundational model for human-like, expressive TTS
YuXiangLin1234/personal-website
YuXiangLin1234/PL-BERT
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
YuXiangLin1234/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
YuXiangLin1234/TextRL
YuXiangLin1234/transformers-whisper
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
YuXiangLin1234/trl
Train transformer language models with reinforcement learning.
YuXiangLin1234/twcc-hpc
YuXiangLin1234/Unit2Mel
YuXiangLin1234/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
YuXiangLin1234/yt-dl-for-avsr