c9o's Stars
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
exo-explore/exo
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
ggerganov/ggml
Tensor library for machine learning
google/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
OpenBMB/llama.cpp
Port of Facebook's LLaMA model in C/C++
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
apple/corenet
CoreNet: A library for training deep neural networks
netease-youdao/QAnything
Question and Answer based on Anything.
ollama/ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
nyadla-sys/whisper.tflite
Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
usefulsensors/openai-whisper
Robust Speech Recognition via Large-Scale Weak Supervision
yhirose/cpp-httplib
A C++ header-only HTTP/HTTPS server and client library
Codium-ai/pr-agent
🚀CodiumAI PR-Agent: An AI-Powered 🤖 Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! 💻🔍
lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
k2-fsa/icefall
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
rhasspy/piper
A fast, local neural text to speech system
lencx/Noi
🚀 Power Your World with AI - Explore, Extend, Empower.
facebookresearch/fairseq2
FAIR Sequence Modeling Toolkit 2
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
PlayVoice/whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
MooreThreads/Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
janhq/jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
ufal/whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation