jinmingteo

Singapore

jinmingteo's Stars

eugeneyan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
27.3k 948 243.7k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python22.3k 187 5042.2k
qdrant/qdrant
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Language:Rust20.6k 126 1.3k1.4k
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
Language:Python19.2k 174 1.4k1.6k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.2k 119 1.1k1.3k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python12.1k 206 2.3k2.5k
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python11.2k 183 1.9k1.9k
khangich/machine-learning-interview
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.
9.8k 223 41.6k
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python8.9k 135 1.1k1.4k
kubernetes/autoscaler
Autoscaling components for Kubernetes
Language:Go8.1k 139 2.3k4k
google/diff-match-patch
Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.
Language:Python7.5k 116 1101.1k
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Language:Python7k 77 610721
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python7k 65 1.2k740
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.3k 71 993781
spotify/pedalboard
🎛 🔊 A Python library for audio.
Language:C++5.2k 58 191262
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python4.8k 52 317439
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python3.6k 65 104294
kpu/kenlm
KenLM: Faster and Smaller Language Model Queries
Language:C++2.5k 70 371511
wiseman/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
Language:C2.1k 50 82409
microsoft/Olive
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
Language:Python1.6k 31 200168
jquesnelle/yarn
YaRN: Efficient Context Window Extension of Large Language Models
Language:Python1.4k 14 56118
Lightning-AI/lightning-thunder
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
Language:Python1.2k 34 54380
asteroid-team/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
Language:Python960 12 10688
peci1/nvidia-htop
A tool for enriching the output of nvidia-smi.
Language:Python540 10 1459
huggingface/dataspeech
Language:Python303 13 1546
LudwigStumpp/llm-leaderboard
A joint community effort to create one central leaderboard for LLMs.
Language:Python285 9 1126
aisingapore/sealion
South-East Asia Large Language Models
Language:Shell268 22 517
zhenghuatan/rVADfast
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
Language:Python126 8 223
kurianbenoy/whisper_normalizer
A python package for whisper normalizer
Language:Jupyter Notebook44 5 117
simonw/webvtt-to-json
Convert WebVTT to JSON, optionally removing duplicate lines
Language:Python10 2 22

jinmingteo

jinmingteo's Stars

eugeneyan/applied-ml

hpcaitech/Open-Sora

qdrant/qdrant

mlc-ai/mlc-llm

Dao-AILab/flash-attention

NVIDIA/NeMo

PaddlePaddle/PaddleSpeech

khangich/machine-learning-interview

speechbrain/speechbrain

kubernetes/autoscaler

google/diff-match-patch

modelscope/modelscope

modelscope/FunASR

pyannote/pyannote-audio

spotify/pedalboard

arcee-ai/mergekit

huggingface/distil-whisper

kpu/kenlm

wiseman/py-webrtcvad

microsoft/Olive

jquesnelle/yarn

Lightning-AI/lightning-thunder

asteroid-team/torch-audiomentations

peci1/nvidia-htop

huggingface/dataspeech

LudwigStumpp/llm-leaderboard

aisingapore/sealion

zhenghuatan/rVADfast

kurianbenoy/whisper_normalizer

simonw/webvtt-to-json