Pinned Repositories
composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
hcc
HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform
HIP
HIP: C++ Heterogeneous-Compute Interface for Portability
HIPIFY
HIPIFY: Convert CUDA to Portable C++ Code
MIOpen
AMD's Machine Intelligence Library
rocBLAS
Next generation BLAS implementation for ROCm platform
ROCK-Kernel-Driver
AMDGPU Driver with KFD used by the ROCm project. Also contains the current Linux Kernel that matches this base driver
ROCm
AMD ROCm™ Software - GitHub Home
ROCm-docker
Dockerfiles for the various software layers defined in the ROCm software platform
tensorflow-upstream
TensorFlow ROCm port
AMD ROCm™ Software's Repositories
ROCm/LLVM-AMDGPU-Assembler-Extra
LLVM AMDGPU Assembler Helper Tools
ROCm/Experimental_ROC
Experimental and Intriguing Tools for ROCm
ROCm/ROCm-OpenCL-Driver
ROCm OpenCL Compiler Tool Driver
ROCm/clARMOR
OpenCL tool to detect buffer overflows in GPU kernels
ROCm/StatusMonitor
OpenCL multiGPU sample monitoring system health
ROCm/nnvm-rocm
NNVM for ROCm Examples
ROCm/DirectGMA_CL
Simple example showing how to use DGMA in OpenCL
ROCm/HCC-Example-Application
HCC Sample Applications
ROCm/OSU_Microbenchmarks
ROCm - UCX enabled OSU_Benchmarks
ROCm/clang
ROCm/rocm-papi-component
PAPI integration in ROCm profiling and tracking tools
ROCm/rocm-timeline-generator
ROCm/mshadow
Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning
ROCm/CNTK-1
Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
ROCm/cublasgemm-benchmark
code for benchmarking GPU performance based on cublasSgemm and cublasHgemm
ROCm/miopen-benchmark
benchmarking miopen
ROCm/multi-gpu-programming-models
Examples demonstrating available options to program multiple GPUs in a single node or a cluster