qzweng

Ph.D. in AI System and Cloud Computing

Shanghai AI LabShanghai, China

Pinned Repositories

kubernetes-scheduler-simulator
Kubernetes Scheduler Simulator
Language:Shell92 4 715
Metis
Metis: Learning to Schedule Long-Running Applications in Shared Container Clusters with at Scale
Language:Jupyter Notebook17 1 15
go-nvml
Go Bindings for the NVIDIA Management Library (NVML)
Language:C327 17 5272
2D-bin-packing-heuristic
This repo contains a 2D bin packing strategy that is based on two greedy heuristics.
Language:Python0 0 00
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
0 0 00
clusterdata
cluster data collected from production clusters in Alibaba for cluster management research
Language:Jupyter Notebook0 0 00
clusterdata-cluster-trace-gpu-v2020-data
Language:Shell1 1 02
ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python0 0 00
FastChat
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.
Language:Python0 0 00
open-simulator
K8s cluster simulator for capacity planning
Language:Go0 0 01

qzweng's Repositories

qzweng/clusterdata-cluster-trace-gpu-v2020-data
Language:Shell1 1 02
qzweng/open-gpu-share
Language:Go1 1 01
qzweng/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
0 0 00
qzweng/clusterdata
cluster data collected from production clusters in Alibaba for cluster management research
Language:Jupyter Notebook0 0 00
qzweng/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python0 0 00
qzweng/credentials-nodejs
Alibaba Cloud Credentials for TypeScript/Node.js
Language:TypeScript0 0 00
qzweng/FastChat
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.
Language:Python0 0 00
qzweng/graph-learn-tracing
Language:Python0 1 00
qzweng/hkust-latex-thesis-template
A Better HKUST LaTeX Thesis Template
Language:TeX0 0 00
qzweng/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Language:Python0 0 00
qzweng/open-simulator
K8s cluster simulator for capacity planning
Language:Go0 0 01
qzweng/qzweng.github.io
My Academic Personal Pages:
Language:HTML0 1 01
qzweng/skypilot
SkyPilot is a framework for easily running machine learning workloads on any cloud through a unified interface.
Language:Python0 0 00
qzweng/credentials-python
Language:Python0 0
qzweng/DeepPlan
Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)
Language:C++0 0
qzweng/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
Language:Python0 0
qzweng/HeliosArtifact
HeliosArtifact
qzweng/k8s-device-plugin
OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical capacity. It is designed for ease of use of extended device memory for AI workloads.
Language:Go0 0
qzweng/k8s-vgpu-scheduler
OpenAIOS vGPU scheduler for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory.
Language:Go0 0
qzweng/kdash
A simple and fast dashboard for Kubernetes
Language:Rust0 0
qzweng/kubernetes-scheduler-simulator
Kubernetes Scheduler Simulator
Language:Shell0 0
qzweng/modelzoo
Language:Python0 0
qzweng/obsidian-things
An Obsidian theme inspired by the beautifully-designed app, Things.
Language:CSS0 0
qzweng/qzweng
1 0
qzweng/qzweng.github.io-202308
My Academic Personal Pages:
Language:JavaScript1
qzweng/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
Language:Python0 0
qzweng/seed_rl
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
Language:Python0 0
qzweng/typeset
自动修正中文、英文、代码混合排版中的全半角、空格等问题
Language:Python0 0
qzweng/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 01
qzweng/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Language:Python0 0