JuneAndJuly's Stars
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
yeyupiaoling/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
mapull/chinese-dictionary
中文汉语拼音辞典,汉字拼音字典,词典,成语词典,常用字、多音字字典数据库
520hacker/awesome-ai
对开源AI转发套壳应用生态进行研究,收集开源AI转发套壳应用,并进行对比。 ChatGPT,OPENAI.AZURE,BAIDU,XUNFEI
OpenNMT/CTranslate2
Fast inference engine for Transformer models
a16z-infra/ai-town
A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
chenzomi12/AISystem
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
NAOSI-DLUT/Campus2024
2024届互联网校招信息汇总
davidmrau/mixture-of-experts
PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538
fxsjy/jieba
结巴中文分词
Mleader2/bert_music_correct
音乐类语料的意图识别填槽以及槽值纠错模型
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
strawberrypie/bert_adapter
Implementation of the paper Parameter-Efficient Transfer Learning for NLP, Houlsby [Google], 2019. Published in ICML 2019.
dongzelian/SSF
[NeurIPS'22] This is an official implementation for "Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning".
OI-wiki/OI-wiki
:star2: Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)
k2-fsa/k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
zyzisyz/mfa_conformer
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
huyanxin/phasen
A unofficial Pytorch implementation of Microsoft's PHASEN
HobbitLong/SupContrast
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
vocaliodmiku/wav2vec2mdd-Text
ReneeYe/ConST
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
zzpDapeng/speech_data_augment
A summary of speech data augment algorithms
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
adlered/CSDNGreener
《专 业 团 队》🕺🏿 🕺🏿 🕺🏿 🕺🏿 ⚰️🕺🏿 🕺🏿 🕺🏿 🕺🏿 | 专治 CSDN 广告与各种灵魂打击 | 🐵 油猴脚本 | TamperMonkey | Chrome | FireFox | CSDN 页面浮窗广告完全过滤净化 | 国服最强 CSDN 绿化脚本
aismlv/zindi-ai4d-wolof
4th place solution to Zindi's low-resource automatic speech recognition competition