xieus
Interested in building AI/ML core systems for scaling LLM and generative AI applications
@anyscale
xieus's Stars
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
crossplane/crossplane
The Cloud Native Control Plane
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
meta-llama/llama-agentic-system
Agentic components of the Llama Stack APIs
alpa-projects/alpa
Training and serving large-scale neural networks with auto parallelization.
turboderp/exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
HuaizhengZhang/AI-System-School
🚀 AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials.
ray-project/ray-llm
RayLLM - LLMs on Ray
p4lang/p4c
P4_16 reference compiler
ray-project/llmperf
LLMPerf is a library for validating and benchmarking LLMs
ray-project/llmperf-leaderboard
efficient/rdma_bench
A framework to understand RDMA
BlockLiu/ElasticSketchCode
in-ATP/ATP
futurewei-cloud/merak
Merak: Large-scale cloud emulator
futurewei-cloud/arion
Arion: An intelligent programmable data plane framework
xieus/alcor-int
Mizar Management plane
xieus/arion
Arion: An intelligent programmable data plane framework
xieus/arion-agent
Arion Agent: Local Network Agent on each Arion Wing
xieus/arion-dp
Arion DP: High Performance Cloud Scale Data Plane
xieus/arion-master
Arion Master: Regional GW Control Plane
xieus/chogori-platform
xieus/ingen-sdk
xieus/llama
Inference code for LLaMA models
xieus/llmperf
LLMPerf is a library for validating and benchmarking LLMs
xieus/merak
Merak: Large-scale cloud emulator
xieus/network_ml
Network ML tools and framework
xieus/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.