jrmadsen
HPC C/C++/Python developer with interest in performance portable solutions. Background in radiation transport. Senior Member of Technical Staff at AMD
AMDAustin, TX
Pinned Repositories
compile-time-perf
Measures high-level timing and memory usage metrics during compilation
madthreading
A low-overhead, task-based threading API using a thread-pool of C++11 threads
omnitrace
Omnitrace: Application Profiling, Tracing, and Analysis
PTL
Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task queue
pyctest
Python bindings of select portions of CMake/CTest package -- enabling generation of CTest test files from Python without a CMake build system
timemory-testing
Scripts for extended testing
Vectorization-Example
An example testing SIMD with AVX/AVX2 Intrinsics vs. OpenMP SIMD vs. compiler (gcc) auto-vectorization
pykokkos-base
Python bindings for data interoperability with Kokkos (View, DynRankView)
timemory
Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
omnitrace
Omnitrace: Application Profiling, Tracing, and Analysis
jrmadsen's Repositories
jrmadsen/madthreading
A low-overhead, task-based threading API using a thread-pool of C++11 threads
jrmadsen/Vectorization-Example
An example testing SIMD with AVX/AVX2 Intrinsics vs. OpenMP SIMD vs. compiler (gcc) auto-vectorization
jrmadsen/boost-compute
Testing of Boost.Compute for OpenCL
jrmadsen/cdash-docker
jrmadsen/CMake
Mirror of CMake upstream repository
jrmadsen/dynamorio
Dynamic Instrumentation Tool Platform
jrmadsen/GOTCHA-tracer
Tracer generater example using GOTCHA
jrmadsen/gperftools
Main gperftools repository
jrmadsen/gprof2dot
Converts profiling output to a dot graph.
jrmadsen/kokkos-kernels
Kokkos C++ Performance Portability Programming EcoSystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels
jrmadsen/nersc-roofline
jrmadsen/ovis
OVIS High Performance Computing monitoring, analysis, and visualization project.
jrmadsen/pyasync
Python + C++11 - example using std::future and std::async to work around the GIL
jrmadsen/pyctest-feedstock
A conda-smithy repository for pyctest.
jrmadsen/pylikwid
Python interface for the LIKWID C API (https://github.com/RRZE-HPC/likwid)
jrmadsen/scikit-build-feedstock
A conda-smithy repository for scikit-build.
jrmadsen/stable_vector
STL-compliant stable vector container
jrmadsen/static_math
Compile time mathematic functions for C++14
jrmadsen/TECA
TECA, theToolkit for Extreme Climate Analysis, contains a collection of climate anlysis algorithms targetted at extreme event detection and analysis.
jrmadsen/time-machine
Time machine is a Jekyll theme for GitHub Pages
jrmadsen/TrackingBlocker
Interface to update and maintain a HOSTS file on Linux/macOS systems that block DNS queries from ad and tracking websites from every leaving your computer