HuangChengChou's Stars
bitcoin/bitcoin
Bitcoin Core integration/staging tree
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
state-spaces/mamba
Mamba SSM architecture
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
adap/flower
Flower: A Friendly Federated AI Framework
microsoft/promptbench
A unified evaluation framework for large language models
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
microsoft/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
acl-org/acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
pliang279/MultiBench
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
goatpig/BitcoinArmory
Python-Based Bitcoin Software
ZhangXInFD/SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
audeering/w2v2-how-to
How to use our public wav2vec2 dimensional emotion model
kyegomez/Gemini
The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google
CheyneyComputerScience/CREMA-D
Crowd Sourced Emotional Multimodal Actors Dataset (CREMA-D)
facebookresearch/SONAR
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
apple/pfl-research
Simulation framework for accelerating research in Private Federated Learning
loshchil/AdamW-and-SGDW
Decoupled Weight Decay Regularization (ICLR 2019)
ControlNet/MARLIN
[CVPR] MARLIN: Masked Autoencoder for facial video Representation LearnINg
voidful/Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
dynamic-superb/dynamic-superb
The official repository of Dynamic-SUPERB.
ihp-lab/LibreFace
[WACV 2024] LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis
Jonathan-Pearce/calibration_library
Pytorch library for model calibration metrics and visualizations as well as recalibration methods. In progress!
ntucllab/imbalanced-DL
A Python Package for Deep Imbalanced Learning
lucadellalib/bdl-rul-svgd
Bayesian deep learning for remaining useful life estimation via Stein variational gradient descent
prabhat1081/Anxiety-Detection-from-free-form-audio-journals
Repository for CS224S project: Detecting anxiety from short clips of free-form speech