wzhao18

MSCS Student @ Stanford University

Palo Alto

Pinned Repositories

tilt
Language:C++10 2 04
antlr4
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
Language:Java00
ava
Automatic virtualization of (general) accelerators.
Language:C++00
awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
00
benchmark
A microbenchmark support library
Language:C++00
clusterdata
cluster data collected from production clusters in Alibaba for cluster management research
Language:Jupyter Notebook00
cpp-ipc
C++ IPC Library: A high-performance inter-process communication using shared memory on Linux/Windows.
Language:C++00
cricket
cricket is a virtualization solution for GPUs
Language:C00
fault-tolerent-kv-store
Fault-tolerant distributed key-value store that consists of multiple key-value servers, each of which is responsible for a portion of the key space.
Language:C10
streambox
Language:Gnuplot10

wzhao18's Repositories

wzhao18/streambox
Language:Gnuplot10
wzhao18/ava
Automatic virtualization of (general) accelerators.
Language:C++00
wzhao18/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
00
wzhao18/clusterdata
cluster data collected from production clusters in Alibaba for cluster management research
Language:Jupyter Notebook00
wzhao18/cpp-ipc
C++ IPC Library: A high-performance inter-process communication using shared memory on Linux/Windows.
Language:C++00
wzhao18/cricket
cricket is a virtualization solution for GPUs
Language:C00
wzhao18/cuda-graph-with-dynamic-parameters
wzhao18/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Language:Python
wzhao18/finetune-gpt2xl
Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed
wzhao18/GPU-Virtualization-Benchmarks
Language:HTML
wzhao18/hidet
An open-source efficient deep learning framework.
Language:Python
wzhao18/HUVM
wzhao18/iceoryx
Eclipse iceoryx™ - true zero-copy inter-process-communication
wzhao18/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
wzhao18/Lucid
Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs
wzhao18/ml-cvnets
CVNets: A library for training computer vision networks
wzhao18/needle
Language:Python
wzhao18/open-gpu-kernel-modules
NVIDIA Linux open GPU kernel module source
wzhao18/protobuf-messaging
C++ library for sending/receiving protobuf messages over various channels (pipe, socket, kafka, etc.)
Language:C++
wzhao18/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python
wzhao18/rocksdb
A library that provides an embeddable, persistent key-value store for fast storage.
wzhao18/Saber
Window-Based Hybrid CPU/GPU Stream Processing Engine
wzhao18/streambench
wzhao18/tenset
Language:Python
wzhao18/TensorRT
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
wzhao18/tlp
Language:Python
wzhao18/triton
Development repository for the Triton language and compiler
wzhao18/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:Python
wzhao18/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
wzhao18/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Language:Python