MichaelDays

MichaelDays's Stars

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++61.2k 517 3.3k8.7k
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C++32.9k 302 1.2k3.3k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python31.8k 273 1.1k3.8k
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
Language:Python17.7k 167 1.2k1.4k
karpathy/llama2.c
Inference Llama 2 in one file of pure C
Language:C16.7k 188 2161.9k
joonspk-research/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
15.8k 130 1192k
smol-ai/developer
the first library to let you embed a developer agent in your own app!
Language:Python11.7k 156 891k
rhasspy/piper
A fast, local neural text to speech system
Language:C++5k 68 402349
smol-ai/GodMode
AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.
Language:TypeScript4.1k 42 160321
KoboldAI/KoboldAI-Client
Language:Python3.4k 70 270739
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python3.2k 30 982282
freedmand/semantra
Multi-tool for semantic search
Language:Python2.4k 33 59135
AI-Engineer-Foundation/agent-protocol
Common interface for interacting with AI agents. The protocol is tech stack agnostic - you can use it with any framework for building agents.
Language:Python850 11 39102
nickm980/smallville
Generative Agents for video games. Based on Generative Agents: Interactive Simulacra of Human Behavior
Language:Java611 9 2647
XiongjieDai/GPU-Benchmarks-on-LLM-Inference
Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?
Language:Jupyter Notebook606 15 1023
akashmjn/tinydiarize
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
Language:Python391 24 1514
philipturner/metal-flash-attention
Faster alternative to Metal Performance Shaders
Language:Swift317 16 1214
arcee-ai/DALM
Domain Adapted Language Modeling Toolkit - E2E RAG
Language:Python277 11 3133
neonbjb/ocotillo
Performant and accurate speech recognition built on Pytorch
Language:Python238 9 424
neph1/LlamaTale
Giving the power of LLM's to a MUD lib.
Language:Python114 5 64
fkodom/dilated-attention-pytorch
(Unofficial) Implementation of dilated attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens" (https://arxiv.org/abs/2307.02486)
Language:Python46 5 79
Maximilian-Winter/AIRoleplay
Little AI roleplay program
Language:Jupyter Notebook46 4 01
alexisrozhkov/dilated-self-attention
Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
Language:Python13 5 41