Pinned Repositories
ADEPT
revamping adept from scratch to make more usable in library form
adept-proxy
BabelStream
STREAM, for lots of devices written in many programming models
cg-hipex
CUDA's cooperative group API has a HIP analogue for AMD GPUs. There are several functions missing in HIP's variant of cooperative groups. This repo includes analogues for those missing functions. Functions in this repo have been implemented in software without any low level tuning so the performance is not promised.
charm
The Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.
diBELLA.2D
Parallel String Graph Construction and Transitive Reduction for De Novo Genome Assembly
GPU-ArraySort
a GPU based algorithm for sorting large number of arrays, this version only sorts arrays of same size. details can be found in original publication : http://scholarworks.wmich.edu/cgi/viewcontent.cgi?article=1004&context=pcds_reports
GPU-BSW
mhm2_staging
a capture of mhm2 gpu local assembly work for sc21 submission
nersc_cuda_tutorial
repo for hands on exercises provided during the CUDA tutorial at NERSC's GPUs for Science event
mgawan's Repositories
mgawan/GPU-BSW
mgawan/mhm2_staging
a capture of mhm2 gpu local assembly work for sc21 submission
mgawan/ADEPT
revamping adept from scratch to make more usable in library form
mgawan/cg-hipex
CUDA's cooperative group API has a HIP analogue for AMD GPUs. There are several functions missing in HIP's variant of cooperative groups. This repo includes analogues for those missing functions. Functions in this repo have been implemented in software without any low level tuning so the performance is not promised.
mgawan/nersc_cuda_tutorial
repo for hands on exercises provided during the CUDA tutorial at NERSC's GPUs for Science event
mgawan/adept-proxy
mgawan/BabelStream
STREAM, for lots of devices written in many programming models
mgawan/charm
The Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.
mgawan/diBELLA.2D
Parallel String Graph Construction and Transitive Reduction for De Novo Genome Assembly
mgawan/GPU-ArraySort
a GPU based algorithm for sorting large number of arrays, this version only sorts arrays of same size. details can be found in original publication : http://scholarworks.wmich.edu/cgi/viewcontent.cgi?article=1004&context=pcds_reports
mgawan/GPU-ArraySort-2.0
This version of GPU-ArraySort is capable of sorting large number of variable sized arrays, also includes some big fixes. Original publication can be found here : http://scholarworks.wmich.edu/cgi/viewcontent.cgi?article=1004&context=pcds_reports
mgawan/gpu_local_ht
thread local hashtable for gpus
mgawan/linking_reproducer
mgawan/MaSS-Simulator
source for the MaSS-Simulator
mgawan/miniapp_kcount
miniapp for GPU kcount portion of metahipmer pipeline
mgawan/Modern-CPP-Programming
Modern C++ Programming Course (C++03/11/14/17/20/23/26)
mgawan/MS-REDUCE
a data reduction algorithm for Mass Spectrometry based proteomics, details can be found in original publication at : http://bioinformatics.oxfordjournals.org/content/early/2016/01/21/bioinformatics.btw023.short
mgawan/MSREDUCE
source code for MSREDUCE
mgawan/nvbio
NVBIO is a library of reusable components designed to accelerate bioinformatics applications using CUDA.
mgawan/openacc-training-materials
Training materials provided by OpenACC.org.
mgawan/poggers
A library of high performance data structures for cuda. This library is header only and can be added to a project using the C package manager.
mgawan/STREAM
stream benchmarks from https://www.cs.virginia.edu/stream/ref.html#what
mgawan/Thermo-nuclear-network
mgawan/timemory
Cross-language (C, C++, CUDA, and/or Python) Utility for recording timing, memory, resource usage, and hardware counters
mgawan/toast3
Time Ordered Astrophysics Scalable Tools