alapini's Stars
localsend/localsend
An open-source cross-platform alternative to AirDrop
imartinez/privateGPT
Interact with your documents using the power of GPT, 100% privately, no data leaks
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
jmorganca/ollama
Get up and running with Llama 2, Mistral, and other large language models locally.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
schollz/croc
Easily and securely send things from one computer to another :crocodile: :package:
spf13/viper
Go configuration with fangs
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
Infisical/infisical
♾ Infisical is the open-source secret management platform: Sync secrets across your team/infrastructure, prevent secret leaks, and manage internal PKI
bentoml/OpenLLM
Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.
mistralai/mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
willianjusten/awesome-svg
A curated list of SVG.
turboderp/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
neuralmagic/deepsparse
Sparsity-aware deep learning inference runtime for CPUs
huggingface/text-embeddings-inference
A blazing fast inference solution for text embeddings models
turboderp/exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
signavio/react-mentions
@mention people in a textarea
griptape-ai/griptape
Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Planetary-Computers/autotab-starter
Build browser agents for real world tasks
ollama-ui/ollama-ui
Simple HTML UI for Ollama
lmstudio-ai/model-catalog
A collection of standardized JSON descriptors for Large Language Model (LLM) files.
crashappsec/chalk
Chalk allows you to follow code from development, through builds and into production.
runpod/runpodctl
🧰 | RunPod CLI for pod management
triton-inference-server/vllm_backend
zevv/nmqtt
Native Nim MQTT client library
tensorchord/modelz-template-vllm
Dockerfile and templates for vLLM