jinmingteo's Stars
eugeneyan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
qdrant/qdrant
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
khangich/machine-learning-interview
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
kubernetes/autoscaler
Autoscaling components for Kubernetes
google/diff-match-patch
Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
spotify/pedalboard
🎛 🔊 A Python library for audio.
arcee-ai/mergekit
Tools for merging pretrained large language models.
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
kpu/kenlm
KenLM: Faster and Smaller Language Model Queries
wiseman/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
microsoft/Olive
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
jquesnelle/yarn
YaRN: Efficient Context Window Extension of Large Language Models
Lightning-AI/lightning-thunder
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
asteroid-team/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
peci1/nvidia-htop
A tool for enriching the output of nvidia-smi.
huggingface/dataspeech
LudwigStumpp/llm-leaderboard
A joint community effort to create one central leaderboard for LLMs.
aisingapore/sealion
South-East Asia Large Language Models
zhenghuatan/rVADfast
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
kurianbenoy/whisper_normalizer
A python package for whisper normalizer
simonw/webvtt-to-json
Convert WebVTT to JSON, optionally removing duplicate lines