Pinned Repositories
dgNN
[Mlsys'22] Understanding gnn computational graph: A coordinated computation, io, and memory perspective
2.5D_ROT
The HDL framework for our 2.5D root of trust.
abc
ABC: System for Sequential Logic Synthesis and Formal Verification
accelergy
Accelergy is an energy estimation infrastructure for accelerator energy estimations
ahead
AHEAD: A Tool for Projecting Next-Generation Hardware Enhancements on GPU-Accelerated Systems
Bloom
cacti
An integrated cache and memory access time, cycle time, area, leakage, and dynamic power model
dgNN
LLMCompass_ISCA_AE
awesome_ai4eda
HenryChang213's Repositories
HenryChang213/LLMCompass_ISCA_AE
HenryChang213/dgNN
HenryChang213/2.5D_ROT
The HDL framework for our 2.5D root of trust.
HenryChang213/accelergy
Accelergy is an energy estimation infrastructure for accelerator energy estimations
HenryChang213/Bloom
HenryChang213/cacti
An integrated cache and memory access time, cycle time, area, leakage, and dynamic power model
HenryChang213/COS598D-Serverless
HenryChang213/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
HenryChang213/cva6
The CORE-V CVA6 is an Application class 6-stage RISC-V CPU capable of booting Linux
HenryChang213/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
HenryChang213/dgl
Python package built to ease deep learning on graph, on top of existing DL frameworks.
HenryChang213/dgl-lifesci
Python package for graph neural networks in chemistry and biology
HenryChang213/FasterTransformer
Transformer related optimization, including BERT, GPT
HenryChang213/gem5
This is an read-only mirror of the gem5 simulator. The upstream repository is stored in https://gem5.googlesource.com, code reviews should be submitted to https://gem5-review.googlesource.com/. The mirrors are synchronized every 15 minutes.
HenryChang213/GPU4GNN
HenryChang213/ILAng
A Modeling and Verification Platform for SoCs using ILAs
HenryChang213/LLMCompass_ISCA_AE_docker
HenryChang213/MG-GCN
MG-GCN: Scalable Multi-GPU GCN Training Framework
HenryChang213/nccl-tests
NCCL Tests
HenryChang213/nicsefc-readme
some docs for rookies in nics-efc
HenryChang213/ns3-datacenter
HenryChang213/ppo_libtorch
HenryChang213/pytorch_sparse
PyTorch Extension Library of Optimized Autograd Sparse Matrix Operations
HenryChang213/reinforcement-learning-for-per-flow-buffer-sizing
Implementation of the paper "LFQ: Online Learning of Per-Flow Queuing Policies Using Deep Reinforcement Learning", Contact: Maximilian Bachl
HenryChang213/scale-sim-v2
Repository to host and maintain scale-sim-v2 code
HenryChang213/splitwise-sim
LLM serving cluster simulator
HenryChang213/TestDocs
HenryChang213/timeloop
Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.
HenryChang213/timeloop-accelergy-exercises
Exercises for exploring the Fibertree, Timeloop and Accelergy tools
HenryChang213/tiny-training
On-Device Training Under 256KB Memory [NeurIPS'22]