lynnliu030's Stars
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Trinity-data-store/Trinity
EuroSys '24: "Trinity: A Fast Compressed Multi-attribute Data Store"
mda590/cloudping.co
AWS Inter-Region Latency Monitoring
google-research/deduplicate-text-datasets
lerocha/chinook-database
Sample database for SQL Server, Oracle, MySQL, PostgreSQL, SQLite, DB2
SqueezeAILab/LLMCompiler
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
jwkirchenbauer/lm-watermarking
huggingface/text-embeddings-inference
A blazing fast inference solution for text embeddings models
sgl-project/sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
rclone/rclone
"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files
supabase/supabase
The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
run-llama/rags
Build ChatGPT over your data, all with natural language
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
stanford-futuredata/ARES
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
outlines-dev/outlines
Structured Text Generation
huggingface/text-generation-inference
Large Language Model Text Generation Inference
cpacker/MemGPT
Create LLM agents with long-term memory and custom tools 📚🦙
andialbrecht/sqlparse
A non-validating SQL parser module for Python
rustformers/llm
[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
localstack/localstack
💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
whitead/paper-qa
LLM Chain for answering questions from documents with citations
nmslib/hnswlib
Header-only C++/python library for fast approximate nearest neighbors
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
ArianaGu/3D-RSD
Fast Non-line-of-sight Imaging with Non-planar Relay Surfaces
chanwutk/ucbhciprelim