mahaddad

Founder at konko.ai // Electrical and Computer Engineer with an interest in ML, gaming mods & scripts

New York, New York

mahaddad's Stars

mui/material-ui
Material UI: Comprehensive React component library that implements Google's Material Design. Free forever.
Language:TypeScript93.8k 1.3k 19.5k32.3k
shadcn-ui/ui
Beautifully designed components that you can copy and paste into your apps. Accessible. Customizable. Open Source.
Language:TypeScript73.4k 234 2.4k4.5k
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models.
Language:Python40.4k 328 3.7k5.3k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python29.5k 242 5.1k4.4k
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Language:Python13.4k 72 3.5k1.6k
ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Language:Python11.4k 99 237988
openai/triton
Development repository for the Triton language and compiler
Language:C++11.2k 179 1.2k1.3k
vercel/ai
Build AI-powered applications with React, Svelte, Vue, and Solid
Language:TypeScript9.9k 59 9311.5k
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
9.4k 187 17717
huggingface/text-generation-inference
Large Language Model Text Generation Inference
Language:Python9k 101 1.3k1.1k
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.5k 95 1.9k969
kedacore/keda
KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes
Language:Go8.5k 91 2.3k1.1k
alibaba/ali-dbhub
已迁移新仓库，此版本将不再维护
8.3k 104 2411.3k
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Language:Python8.3k 143 3.8k1.5k
skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Language:Python6.7k 72 1.8k498
TimDettmers/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python5.8k 48 968584
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
Language:Python5.2k 49 187400
kserve/kserve
Standardized Serverless ML Inference Platform on Kubernetes
Language:Python3.6k 64 1.9k1.1k
juncongmoo/pyllama
LLaMA: Open and Efficient Foundation Language Models
Language:Python2.8k 35 93311
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Language:Jupyter Notebook2.6k 38 34128
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.5k 24 177191
casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Language:Python1.7k 15 402207
openai/openai-openapi
OpenAPI specification for the OpenAI API
1.3k 118 99371
ray-project/ray-llm
RayLLM - LLMs on Ray
Language:Python1.2k 20 8994
OpenLMLab/LOMO
LOMO: LOw-Memory Optimization
Language:Python977 12 7068
Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
Language:Jupyter Notebook846 8 4252
paradigmxyz/flux
Graph-based LLM power tool for exploring many completions in parallel.
Language:TypeScript786 12 54113
ray-project/llmperf
LLMPerf is a library for validating and benchmarking LLMs
Language:Python629 9 30105
cli99/llm-analysis
Latency and Memory Analysis of Transformer Models for Training and Inference
Language:Python350 8 1042
stanford-crfm/ecosystem-graphs
Language:JavaScript258 16 8035

mahaddad

mahaddad's Stars

mui/material-ui

shadcn-ui/ui

oobabooga/text-generation-webui

vllm-project/vllm

BerriAI/litellm

ShishirPatil/gorilla

openai/triton

vercel/ai

Mooler0410/LLMsPracticalGuide

huggingface/text-generation-inference

NVIDIA/TensorRT-LLM

kedacore/keda

alibaba/ali-dbhub

triton-inference-server/server

skypilot-org/skypilot

TimDettmers/bitsandbytes

imoneoi/openchat

kserve/kserve

juncongmoo/pyllama

FranxYao/chain-of-thought-hub

mit-han-lab/llm-awq

casper-hansen/AutoAWQ

openai/openai-openapi

ray-project/ray-llm

OpenLMLab/LOMO

Muennighoff/sgpt

paradigmxyz/flux

ray-project/llmperf

cli99/llm-analysis

stanford-crfm/ecosystem-graphs