amathews-amd's Stars
RobertElderSoftware/roberteldersoftwarediff
ROCm/rocprofiler-compute
Advanced Profiling and Analytics for AMD Hardware
ROCm/omnitrace
Omnitrace: Application Profiling, Tracing, and Analysis
facebookincubator/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
aserio/switchboard
facebookresearch/FAMBench
Benchmarks to capture important workloads.
ezyang/nvprof2json
Convert nvprof profiles into about:tracing compatible JSON files