fishead's Stars
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models with support for multiple inference backends.
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
juliansteenbakker/mobile_scanner
A universal scanner for Flutter based on MLKit. Uses CameraX on Android and AVFoundation on iOS.
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
SylphAI-Inc/AdalFlow
AdalFlow: The library to build & auto-optimize LLM applications.
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
zou-group/textgrad
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
rrousselGit/riverpod
A reactive caching and data-binding framework. Riverpod makes working with asynchronous code a breeze.
gusye1234/nano-graphrag
A simple, easy-to-hack GraphRAG implementation
HKUDS/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Whisky-App/Whisky
A modern Wine wrapper for macOS built with SwiftUI
utmapp/UTM
Virtual machines for iOS and macOS
iamkun/dayjs
⏰ Day.js 2kB immutable date-time library alternative to Moment.js with the same modern API
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
dottxt-ai/outlines
Structured Text Generation
Mirascope/mirascope
LLM abstractions that aren't obstructions
fastapi-users/fastapi-users
Ready-to-use and customizable users management for FastAPI
ggerganov/llama.cpp
LLM inference in C/C++
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
DAGWorks-Inc/burr
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
open-webui/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
immortalwrt/homeproxy
The modern ImmortalWrt proxy platform for ARM64/AMD64 (powered by sing-box)
fengyuan-liang/deploy-certificate-to-aliyun
每两个月自动部署泛解析证书到阿里云CDN上
tj/git-extras
GIT utilities -- repo summary, repl, changelog population, author commit percentages and more
vikejs/vike-react
🔨 React integration for Vike
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
k3s-io/k3s
Lightweight Kubernetes