howkhang's Stars
open-webui/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
continuedev/continue
⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other building blocks
DS4SD/docling
Get your documents ready for gen AI
VikParuchuri/marker
Convert PDF to markdown + JSON quickly with high accuracy
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
jingyaogong/minimind
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
microsoft/BitNet
Official inference framework for 1-bit LLMs
neuml/txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
vikhyat/moondream
tiny vision language model
lancedb/lancedb
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
MadcowD/ell
A language model programming library.
facebookresearch/lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
pytorch/torchtitan
A PyTorch native library for large model training
allenai/open-instruct
AllenAI's post-training codebase
getzep/graphiti
Build and query dynamic, temporally-aware Knowledge Graphs
huggingface/chat-macOS
Making the community's best AI chat models available to everyone.
PufferAI/PufferLib
Simplifying reinforcement learning for complex game environments
AnswerDotAI/ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
mlc-ai/xgrammar
Fast, Flexible and Portable Structured Generation
PrithivirajDamodaran/FlashRank
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.
google/langfun
OO for LLMs
ServiceNow/BrowserGym
🌎💪 BrowserGym, a Gym environment for web task automation
LMCache/LMCache
10x Faster Long-Context LLM By Smart KV Cache Optimizations
texttron/tevatron
Tevatron - A flexible toolkit for neural retrieval research and development.
microsoft/Trace
End-to-end Generative Optimization for AI Agents
aiverify-foundation/moonshot
Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
thunlp/LLMxMapReduce
lechmazur/confabulations
Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.
webis-de/lightning-ir
One-stop shop for running and fine-tuning transformer-based language models for retrieval