zhenghh04
Huihuo is currently a computer scientist at Argonne National Laboratory, working on high performance computing, machine learning and data management.
Argonne National LaboratoryLemont, IL
Pinned Repositories
dlio_benchmark
An I/O benchmark for deep Learning applications
vol-cache
HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O overhead.
ALCF_SDLWorkshop_2019
This folder contains the example files I used for ALCF Simulation, Data, and Learning Workshop
dlio_ml_workloads
Reference workloads for DLIO Benchmark
dlio_profiling
This repo demonstrate how to profile I/O for deep learning applications. This is based on VaniDL
E3SM-IO
Benchmark programs using the I/O pattern of E3SM
io_benchmarks
This is a set of tools for io benchmarks
MLPerf_training
Reference implementations of MLPerf™ training benchmarks
pyutils
This is a set of utils that I created throughout the years
QCWorkshop
Tutorials for ALCF Quantum Computing Workshop
zhenghh04's Repositories
zhenghh04/dlio_ml_workloads
Reference workloads for DLIO Benchmark
zhenghh04/MLPerf_training
Reference implementations of MLPerf™ training benchmarks
zhenghh04/pyutils
This is a set of utils that I created throughout the years
zhenghh04/test_dali
zhenghh04/ai-science-training-series
zhenghh04/dlio_profiling
This repo demonstrate how to profile I/O for deep learning applications. This is based on VaniDL
zhenghh04/E3SM-IO
Benchmark programs using the I/O pattern of E3SM
zhenghh04/amrex
AMReX: Software Framework for Block Structured AMR
zhenghh04/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
zhenghh04/dl_scaling_hang
zhenghh04/dlio-profiler
A low-level profiler for capture I/O calls from deep learning applications.
zhenghh04/dlio_benchmark
An I/O benchmark for deep Learning applications
zhenghh04/dlio_microbenchmark
zhenghh04/E4S-Documenter
A tool to generate documentation for a project based on project metadata (README, Changelog, License, etc.) stored in a yaml file.
zhenghh04/exahdf5
zhenghh04/exahdf5_sdk
ExaHDF5 project build scripts
zhenghh04/fsdp_proposal
Compare FSDP with DeepSpeed
zhenghh04/h5bench
A benchmark suite for measuring HDF5 performance.
zhenghh04/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
zhenghh04/incubator-mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
zhenghh04/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
zhenghh04/mlperf_storage
zhenghh04/scorpio
A high-level Parallel I/O Library for structured grid applications
zhenghh04/test_pp_comm
Test communication for pipeline parallelism in LLM
zhenghh04/training_results_v1.1
zhenghh04/user-guides
ALCF Systems User Documentation
zhenghh04/vanidl
VaniDL is an tool for analyzing I/O patterns and behavior with Deep Learning Applications.
zhenghh04/vol-async
HDF5 Asynchronous I/O VOL connector that enables asynchronous I/O for HDF5 applications
zhenghh04/vol-cache
HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O overhead.
zhenghh04/vol-log-based