Pinned Repositories
composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
hcc
HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform
HIP
HIP: C++ Heterogeneous-Compute Interface for Portability
HIPIFY
HIPIFY: Convert CUDA to Portable C++ Code
MIOpen
AMD's Machine Intelligence Library
rocBLAS
Next generation BLAS implementation for ROCm platform
ROCK-Kernel-Driver
AMDGPU Driver with KFD used by the ROCm project. Also contains the current Linux Kernel that matches this base driver
ROCm
AMD ROCm™ Software - GitHub Home
ROCm-docker
Dockerfiles for the various software layers defined in the ROCm software platform
tensorflow-upstream
TensorFlow ROCm port
AMD ROCm™ Software's Repositories
ROCm/ROCm
AMD ROCm™ Software - GitHub Home
ROCm/MIOpen
AMD's Machine Intelligence Library
ROCm/tensorflow-upstream
TensorFlow ROCm port
ROCm/composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
ROCm/rccl
ROCm Communication Collectives Library (RCCL)
ROCm/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
ROCm/aomp
AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
ROCm/AMDMIGraphX
AMD's graph optimization engine.
ROCm/rocPRIM
ROCm Parallel Primitives
ROCm/llvm-project
This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.
ROCm/rocSPARSE
Next generation SPARSE implementation for ROCm platform
ROCm/triton
Development repository for the Triton language and compiler
ROCm/rocWMMA
rocWMMA
ROCm/rocALUTION
Next generation library for iterative sparse solvers for ROCm platform
ROCm/hipSPARSE
ROCm SPARSE marshalling library
ROCm/hipBLASLt
hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
ROCm/ROCmValidationSuite
A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high-performance computing environments
ROCm/rocm-cmake
CMake modules used within the ROCm libraries
ROCm/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
ROCm/rocprofiler-sdk
ROCm/rocDecode
rocDecode is a high performance video decode SDK for AMD hardware
ROCm/TransformerEngine
ROCm/rocprofiler-systems
ROCm Systems Profiler
ROCm/Megatron-LM
Ongoing research training transformer models at scale
ROCm/rtg_tracer
ROCm/onnxruntime
ONNX Runtime: cross-platform, high performance scoring engine for ML models
ROCm/rocPyDecode
rocPyDecode is a set of Python bindings to rocDecode C++ library which provides full HW acceleration for video decoding on AMD GPUs.
ROCm/device-metrics-exporter
Device Metrics Exporter exports metrics from AMD devices (GPUs) to collectors like Prometheus.
ROCm/hipBLAS-common
Common files shared by hipBLAS and hipBLASLt
ROCm/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow