marcoyang1998's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
meta-llama/llama
Inference code for Llama models
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
janishar/mit-deep-learning-book-pdf
MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
espnet/espnet
End-to-End Speech Processing Toolkit
imfunniee/gitfolio
:octocat: personal website + blog for every github user
nianticlabs/monodepth2
[ICCV 2019] Monocular depth estimation from a single image
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
adambielski/siamese-triplet
Siamese and triplet networks with online pair/triplet mining in PyTorch
dragen1860/Deep-Learning-with-PyTorch-Tutorials
深度学习与PyTorch入门实战视频教程 配套源代码和PPT
k2-fsa/k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
k2-fsa/sherpa-ncnn
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.
bytedance/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
graykode/gpt-2-Pytorch
Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation
k2-fsa/icefall
lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
k2-fsa/sherpa
Speech-to-text server framework with next-gen Kaldi
k2-fsa/libriheavy
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
tjmoon0104/pytorch-tiny-imagenet
pytorch-tiny-imagenet
SpeechColab/GigaSpeech2
An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
k2-fsa/text_search
Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup
cambridge-mlg/mphil-intro-module
Jupyter notebooks on inference, regression and classification for MPhil students
k2-fsa/multi_quantization
dlchao/FluTE
Agent-based influenza epidemic model
HilbertXu/MAML-Tensorflow
Tensorflow r2.1 reimplementation of Model-Agnostic Meta-Learning
k2-fsa/divide_lm
marcoyang1998/icefall