neozhang307
RIKEN-RCCS Post Doc Tokyo Tech Ph.D. Interested in parallel programming and GPU programming. Currently working on code generation and machine learning topic
RIKENTokyo
Pinned Repositories
asynccopybench
doctoral_showcase_materials
EBISU-ICS23
This is a repo to keep the experimental implementation of EBISU used in ICS23
Microbenchmark
neozhang307
Config files for my GitHub profile.
neozhang307.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
PERKS
persistent kernel sample implementation of iterative stencil solver and conjugate gradient solver
stencilImples
Try to track the available stencil implementations
SyncMicrobenchmark
This work aims at characterizing the synchronization methods in CUDA.
vec_mem
neozhang307's Repositories
neozhang307/PERKS
persistent kernel sample implementation of iterative stencil solver and conjugate gradient solver
neozhang307/SyncMicrobenchmark
This work aims at characterizing the synchronization methods in CUDA.
neozhang307/EBISU-ICS23
This is a repo to keep the experimental implementation of EBISU used in ICS23
neozhang307/vec_mem
neozhang307/stencilImples
Try to track the available stencil implementations
neozhang307/asynccopybench
neozhang307/doctoral_showcase_materials
neozhang307/Microbenchmark
neozhang307/neozhang307
Config files for my GitHub profile.
neozhang307/neozhang307.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
neozhang307/Reduction
This project is used to develop a high performance multi-gpu reduction kernel
neozhang307/test_concurrent_cooperative_launch
test_concurrent_cooperative
neozhang307/test_tensor
neozhang307/tpu_graphs