asaiacai
Building tools for deep learning training and serving. CTO @Trainy-ai (YC S23). Physics PhD UC Berkeley '22.
asaiacai's Stars
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
grafana/grafana
The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
juanfont/headscale
An open source, self-hosted implementation of the Tailscale control server
tailscale/tailscale
The easiest, most secure way to use WireGuard and 2FA.
prometheus/node_exporter
Exporter for machine metrics
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
google-deepmind/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
axboe/fio
Flexible I/O Tester
gpu-mode/lectures
Material for gpu-mode lectures
glasskube/glasskube
🧊 The next generation Package Manager for Kubernetes 📦 Featuring a GUI and a CLI. Glasskube packages are dependency aware, GitOps ready and can get automatic updates via a central public package repository.
srcbookdev/srcbook
TypeScript-centric app development platform: notebook and AI app builder
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
tegonhq/tegon
Tegon is an open-source, dev-first alternative to Jira, Linear
squaredtechnologies/thread
AI-powered Jupyter Notebook — use local AI to generate and edit code cells, automatically fix errors, and chat with your data
vllm-project/llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
pytorch-labs/attention-gym
Helpful tools and examples for working with flex-attention
imbue-ai/cluster-health
leptonai/gpud
empower-ai/empower-functions
GPT-4 level function calling models for real-world tool using use cases
foundation-model-stack/fms-fsdp
🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.
glasskube/gitops-template
ArgoCD based GitOps template with preconfigured Glasskube Package Manager and an example application.
CambioML/any-parser
Accurate, private and configurable document retrieval LLM
Trainy-ai/konduktor
cluster/scheduler health monitoring for GPU jobs on k8s
metoro-io/statusphere
Batteries included open-source api-first status page aggregator
vllm-project/dashboard
vLLM performance dashboard
lianakoleva/no-libtorch-compile
romilbhardwaj/kube-tutorial
Kubernetes Tutorial for the PS2 group meetings at UC Berkeley
ocf/transpire
the OCF Kubernetes helper library (not for public consumption)