HuangChengChou

HuangChengChou's Stars

bitcoin/bitcoin
Bitcoin Core integration/staging tree
Language:C++79.2k 4k 8.2k36.3k
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook35.9k 331 4414.2k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python20.1k 309 1.4k2.5k
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language:Shell14.3k 693 1.6k5.3k
state-spaces/mamba
Mamba SSM architecture
Language:Python13.1k 98 5401.1k
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook10.9k 140 3571.1k
adap/flower
Flower: A Friendly Federated AI Framework
Language:Python5.1k 42 584867
microsoft/promptbench
A unified evaluation framework for large language models
Language:Python2.4k 20 54182
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python2.3k 46 398485
microsoft/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
Language:Python1.1k 49 150411
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Language:Python955 9 37175
acl-org/acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
Language:TeX725 8 26179
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python619 15 4343
pliang279/MultiBench
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
Language:HTML487 16 3569
goatpig/BitcoinArmory
Python-Based Bitcoin Software
Language:C++470 59 335174
ZhangXInFD/SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
Language:Python466 16 1940
audeering/w2v2-how-to
How to use our public wav2vec2 dimensional emotion model
Language:Jupyter Notebook454 9 1647
kyegomez/Gemini
The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google
Language:Python424 12 856
CheyneyComputerScience/CREMA-D
Crowd Sourced Emotional Multimodal Actors Dataset (CREMA-D)
Language:R362 10 7120
facebookresearch/SONAR
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
Language:Python338 14 1934
apple/pfl-research
Simulation framework for accelerating research in Private Federated Learning
Language:Jupyter Notebook291 22 1328
loshchil/AdamW-and-SGDW
Decoupled Weight Decay Regularization (ICLR 2019)
Language:Lua264 7 329
ControlNet/MARLIN
[CVPR] MARLIN: Masked Autoencoder for facial video Representation LearnINg
Language:Python227 9 2420
voidful/Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
Language:Python209 12 1722
dynamic-superb/dynamic-superb
The official repository of Dynamic-SUPERB.
Language:Python155 6 16789
ihp-lab/LibreFace
[WACV 2024] LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis
Language:Python94 3 416
Jonathan-Pearce/calibration_library
Pytorch library for model calibration metrics and visualizations as well as recalibration methods. In progress!
Language:Python68 4 112
ntucllab/imbalanced-DL
A Python Package for Deep Imbalanced Learning
Language:Python52 6 05
lucadellalib/bdl-rul-svgd
Bayesian deep learning for remaining useful life estimation via Stein variational gradient descent
Language:Python18 2 00
prabhat1081/Anxiety-Detection-from-free-form-audio-journals
Repository for CS224S project: Detecting anxiety from short clips of free-form speech
Language:Jupyter Notebook41