simon-mo's Stars
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
cloudflare/pingora
A library for building fast, reliable and evolvable network services.
aristocratos/btop
A monitor of resources
ml-explore/mlx
MLX: An array framework for Apple silicon
fastapi/sqlmodel
SQL databases in Python, designed for simplicity, compatibility, and robustness.
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
atelier-anchor/smiley-sans
得意黑 Smiley Sans:一款在人文观感和几何特征中寻找平衡的中文黑体
adityatelange/hugo-PaperMod
A fast, clean, responsive Hugo theme.
huggingface/text-generation-inference
Large Language Model Text Generation Inference
pydantic/FastUI
Build better UIs faster.
leptonai/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
containers/youki
A container runtime written in Rust
observablehq/framework
A static site generator for data apps, dashboards, reports, and more. Observable Framework combines JavaScript on the front-end for interactive graphics with any language on the back-end for data analysis.
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
pages-cms/pages-cms
The No-Hassle CMS for Static Sites Generators
sustainable-computing-io/kepler
Kepler (Kubernetes-based Efficient Power Level Exporter) uses eBPF to probe performance counters and other system stats, use ML models to estimate workload energy consumption based on these stats, and exports them as Prometheus metrics
skyplane-project/skyplane
🔥 Blazing fast bulk data transfers between any cloud 🔥
evanmiller/LLM-Reading-List
LLM papers I'm reading, mostly on inference and model compression
mirage-project/mirage
Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA
timohausmann/quadtree-js
A lightweight quadtree implementation for javascript
kubewharf/godel-scheduler
a unified scheduler for online and offline tasks
inkandswitch/tiny-essay-editor
simple markdown editor w inline comments, on latest automerge stack
kelseyhightower/standalone-kubelet-tutorial
Standalone Kubelet Tutorial
coreweave/tensorizer
Module, Model, and Tensor Serialization/Deserialization
containerd/rust-extensions
Rust crates to extend containerd
Nugine/s3s
S3 Service Adapter
bluefishjs/bluefish-archive
A SolidJS diagramming framework
crossroadsfpga/enso
Ensō is a high-performance streaming interface for NIC-application communication.
HewlettPackard/dockerfile-parser-rs
a Rust library for parsing, validating, and modifying Dockerfiles
novln/docker-parser
Docker image identifier parser.