c9o

c9o's Stars

mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.2k162
exo-explore/exo
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Language:Python5.1k235
ggerganov/ggml
Tensor library for machine learning
Language:C++10.5k972
google/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
Language:C++5.8k496
OpenBMB/llama.cpp
Port of Facebook's LLaMA model in C/C++
Language:C++388
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python3.4k251
apple/corenet
CoreNet: A library for training deep neural networks
Language:Python6.9k527
netease-youdao/QAnything
Question and Answer based on Anything.
Language:Python10.9k1k
ollama/ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
Language:Go82.3k6.3k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python21k2k
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter
Language:C++2.6k302
OpenTalker/SadTalker
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Language:Python11.3k2.1k
nyadla-sys/whisper.tflite
Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices
Language:C++13027
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python24.3k5k
usefulsensors/openai-whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:C5823
yhirose/cpp-httplib
A C++ header-only HTTP/HTTPS server and client library
Language:C++12.4k2.2k
Codium-ai/pr-agent
🚀CodiumAI PR-Agent: An AI-Powered 🤖 Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! 💻🔍
Language:Python5.1k460
lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
Language:Python911206
k2-fsa/icefall
Language:Python849276
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
7.3k878
rhasspy/piper
A fast, local neural text to speech system
Language:C++5.3k373
lencx/Noi
🚀 Power Your World with AI - Explore, Extend, Empower.
Language:JavaScript5.7k408
facebookresearch/fairseq2
FAIR Sequence Modeling Toolkit 2
Language:Python64065
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook10.6k1k
PlayVoice/whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
Language:Python2.6k914
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
Language:TypeScript73.5k58.3k
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Language:Python6.2k918
MooreThreads/Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
Language:Python3k231
janhq/jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
Language:TypeScript21.2k1.2k
ufal/whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
Language:Python1.5k187