kannon92's Stars
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
ggerganov/llama.cpp
LLM inference in C/C++
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Mozilla-Ocho/llamafile
Distribute and run LLMs with a single file.
ggerganov/ggml
Tensor library for machine learning
marimo-team/marimo
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
aws/containers-roadmap
This is the public roadmap for AWS container services (ECS, ECR, Fargate, and EKS).
EricLBuehler/mistral.rs
Blazingly fast LLM inference.
karmada-io/karmada
Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration
kubernetes-sigs/kwok
Kubernetes WithOut Kubelet - Simulates thousands of Nodes and Clusters.
kubernetes-sigs/cri-tools
CLI and validation tools for Kubelet Container Runtime Interface (CRI) .
premAI-io/state-of-open-source-ai
:closed_book: Clarity in the current fast-paced mess of Open Source innovation
kuasar-io/kuasar
A multi-sandbox container runtime that provides cloud-native, all-scenario multiple sandbox container solutions.
Project-HAMi/HAMi
Heterogeneous AI Computing Virtualization Middleware
vllm-project/llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
open-cluster-management-io/ocm
Core components in the OCM project. Report here if you found any issues in OCM.
sosreport/sos
A unified tool for collecting system logs and other debug information
armadaproject/armada
A multi-cluster batch queuing system for high-throughput workloads on Kubernetes.
containers/conmon
An OCI container runtime monitor.
cncf-tags/container-device-interface
kubernetes-sigs/kubectl-validate
kubernetes-sigs/jobset
JobSet: a k8s native API for distributed ML training and HPC workloads
kubernetes-sigs/lws
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
G-Research/fasttrackml
Experiment tracking server focused on speed and scalability
google/xpk
xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.
kubernetes-sigs/dra-example-driver
Example DRA driver that developers can fork and modify to get them started writing their own.
openshift-eng/ocp-build-data
Configuration data used to build OCP images
IBM/autopilot
A tool to detect infrastructure issues on cloud native AI systems
dejanzele/batch-simulator
batch-simulator is a Golang CLI tool that simulates the lifecycle of Kubernetes API resources, such as Nodes, Pods, etc. using KWOK
openshift-virtualization/wasp-agent