jrmadsen
HPC C/C++/Python developer with interest in performance portable solutions. Background in radiation transport. Senior Member of Technical Staff at AMD
AMDAustin, TX
Pinned Repositories
compile-time-perf
Measures high-level timing and memory usage metrics during compilation
madthreading
A low-overhead, task-based threading API using a thread-pool of C++11 threads
omnitrace
Omnitrace: Application Profiling, Tracing, and Analysis
PTL
Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task queue
pyctest
Python bindings of select portions of CMake/CTest package -- enabling generation of CTest test files from Python without a CMake build system
timemory-testing
Scripts for extended testing
Vectorization-Example
An example testing SIMD with AVX/AVX2 Intrinsics vs. OpenMP SIMD vs. compiler (gcc) auto-vectorization
pykokkos-base
Python bindings for data interoperability with Kokkos (View, DynRankView)
timemory
Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
omnitrace
Omnitrace: Application Profiling, Tracing, and Analysis
jrmadsen's Repositories
jrmadsen/PTL
Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task queue
jrmadsen/pyctest
Python bindings of select portions of CMake/CTest package -- enabling generation of CTest test files from Python without a CMake build system
jrmadsen/omnitrace
Omnitrace: Application Profiling, Tracing, and Analysis
jrmadsen/Caliper
Caliper is a flexible application introspection system
jrmadsen/cereal
A C++11 library for serialization
jrmadsen/compiler-explorer
Run compilers interactively from your web browser and interact with the assembly
jrmadsen/coz
Coz: Causal Profiling
jrmadsen/dyninst
DyninstAPI: Tools for binary instrumentation, analysis, and modification.
jrmadsen/dyninst-external-tests
Tests for Dyninst that don't fit in the framework of the test suite
jrmadsen/dyninst-testsuite
The testsuite for Dyninst and Dyninst component libraries.
jrmadsen/ELFIO
ELFIO - ELF (Executable and Linkable Format) reader and producer implemented as a header only C++ library
jrmadsen/googletest
Googletest - Google Testing and Mocking Framework
jrmadsen/GOTCHA
GOTCHA is a library for wrapping function calls in shared libraries
jrmadsen/hatchet
Graph-indexed Pandas DataFrames for analyzing hierarchical performance data
jrmadsen/HIP
HIP: C++ Heterogeneous-Compute Interface for Portability
jrmadsen/hipamd
jrmadsen/kokkos
Kokkos C++ Performance Portability Programming EcoSystem: The Programming Model - Parallel Execution and Memory Abstraction
jrmadsen/kokkos-tools
Kokkos C++ Performance Portability Programming EcoSystem: Profiling and Debugging Tools
jrmadsen/libunwind
libunwind official github repo
jrmadsen/pybind11
Seamless operability between C++11 and Python
jrmadsen/pykokkos-base
Python bindings for interacting with Kokkos
jrmadsen/ROCclr
jrmadsen/rocm_smi_lib
ROCm SMI LIB
jrmadsen/rocprofiler
ROC profiler library. Profiling with perf-counters and derived metrics.
jrmadsen/spack
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
jrmadsen/timem
Command-line tool which provides the same capabilities of the UNIX command-line tool time but extends it to several additional metrics
jrmadsen/timemory
Cross-language (C, C++, CUDA, and/or Python) Utility for recording timing, memory, resource usage, and hardware counters
jrmadsen/tomlplusplus
Header-only TOML config file parser and serializer for C++17 (and later!).
jrmadsen/tomopy
Tomographic Reconstruction in Python
jrmadsen/yaml-cpp
A YAML parser and emitter in C++