Pinned Repositories
captainCache
prompt caching to save dollars on generative AI API usage.
frugal
⚡️ Transform AI/ML operations: Transparency, Control and Cost Optimization. ⚡️
Environmental_Intelligence
Data for Environmental Intelligence: A mega list of Earth System Datasets covering earth observations, climate, water, forests, biodiversity, ecology, protected areas, natural hazards, marine and the tracking of UN's Sustainable Development Goals
falcosidekick
Connect Falco to your ecosystem
fetch-github-followers
HalfUnet-ImageSegmentation
size-converter
A size converter package which converts bytes into KB, MB, GB.
worker-sglang
SGLang is fast serving framework for large language models and vision language models.
worker-tensortllm
Tensort-LLM worker [BETA]
worker-vllm
The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
pandyamarut's Repositories
pandyamarut/pandyamarut
pandyamarut/SDXL-TensorRT
pandyamarut/a40s_benchs
pandyamarut/awesome-compound-ai-systems
Papers about infrastructure (deployment & serving) and systems for compound AI
pandyamarut/axolotl
Go ahead and axolotl questions
pandyamarut/bandwidthTest
gpu-bandwidthTest
pandyamarut/containers
🐳 | Dockerfiles for the RunPod container images used for our official templates.
pandyamarut/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
pandyamarut/fake-gpu-operator
pandyamarut/flashinfer
FlashInfer: Kernel Library for LLM Serving
pandyamarut/gorilla
Gorilla: An API store for LLMs
pandyamarut/llama-stack
Model components of the Llama Stack APIs
pandyamarut/llm-finetuning
[WIP]
pandyamarut/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
pandyamarut/maybe
The OS for your personal finances
pandyamarut/MemGPT
Create LLM agents with long-term memory and custom tools 📚🦙
pandyamarut/multi-node
Basic multi-node training.
pandyamarut/optimized-sdxl
pandyamarut/pandyamarut.github.io
✨ Build a beautiful and simple website in literally minutes. Demo at https://beautifuljekyll.com
pandyamarut/phyraui
pandyamarut/rocmprofile
pandyamarut/runpod-python
🐍 | Python library for RunPod API and serverless worker SDK.
pandyamarut/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
pandyamarut/SWE-agent
SWE-agent: Agent Computer Interfaces Enable Software Engineering Language Models
pandyamarut/tetra
A Light-weight distributed computing framework for AI workloads.
pandyamarut/trt-oai
pandyamarut/TurboRT
TensorRT LLM Engine Builder on Runpod
pandyamarut/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
pandyamarut/vllm-load-balancer
pandyamarut/worker-vllm-tests