Pinned Repositories
kubernetes-scheduler-simulator
Kubernetes Scheduler Simulator
Metis
Metis: Learning to Schedule Long-Running Applications in Shared Container Clusters with at Scale
go-nvml
Go Bindings for the NVIDIA Management Library (NVML)
2D-bin-packing-heuristic
This repo contains a 2D bin packing strategy that is based on two greedy heuristics.
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
clusterdata
cluster data collected from production clusters in Alibaba for cluster management research
clusterdata-cluster-trace-gpu-v2020-data
ColossalAI
Making large AI models cheaper, faster and more accessible
FastChat
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.
open-simulator
K8s cluster simulator for capacity planning
qzweng's Repositories
qzweng/clusterdata-cluster-trace-gpu-v2020-data
qzweng/open-gpu-share
qzweng/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
qzweng/clusterdata
cluster data collected from production clusters in Alibaba for cluster management research
qzweng/ColossalAI
Making large AI models cheaper, faster and more accessible
qzweng/credentials-nodejs
Alibaba Cloud Credentials for TypeScript/Node.js
qzweng/FastChat
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.
qzweng/graph-learn-tracing
qzweng/hkust-latex-thesis-template
A Better HKUST LaTeX Thesis Template
qzweng/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
qzweng/open-simulator
K8s cluster simulator for capacity planning
qzweng/qzweng.github.io
My Academic Personal Pages:
qzweng/skypilot
SkyPilot is a framework for easily running machine learning workloads on any cloud through a unified interface.
qzweng/credentials-python
qzweng/DeepPlan
Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)
qzweng/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
qzweng/HeliosArtifact
HeliosArtifact
qzweng/k8s-device-plugin
OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical capacity. It is designed for ease of use of extended device memory for AI workloads.
qzweng/k8s-vgpu-scheduler
OpenAIOS vGPU scheduler for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory.
qzweng/kdash
A simple and fast dashboard for Kubernetes
qzweng/kubernetes-scheduler-simulator
Kubernetes Scheduler Simulator
qzweng/modelzoo
qzweng/obsidian-things
An Obsidian theme inspired by the beautifully-designed app, Things.
qzweng/qzweng
qzweng/qzweng.github.io-202308
My Academic Personal Pages:
qzweng/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
qzweng/seed_rl
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
qzweng/typeset
自动修正中文、英文、代码混合排版中的全半角、空格等问题
qzweng/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
qzweng/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)