Pinned Repositories
composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
hcc
HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform
HIP
HIP: C++ Heterogeneous-Compute Interface for Portability
HIPIFY
HIPIFY: Convert CUDA to Portable C++ Code
MIOpen
AMD's Machine Intelligence Library
rocBLAS
Next generation BLAS implementation for ROCm platform
ROCK-Kernel-Driver
AMDGPU Driver with KFD used by the ROCm project. Also contains the current Linux Kernel that matches this base driver
ROCm
AMD ROCm™ Software - GitHub Home
ROCm-docker
Dockerfiles for the various software layers defined in the ROCm software platform
tensorflow-upstream
TensorFlow ROCm port
AMD ROCm™ Software's Repositories
ROCm/ROCK-Kernel-Driver
AMDGPU Driver with KFD used by the ROCm project. Also contains the current Linux Kernel that matches this base driver
ROCm/k8s-device-plugin
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
ROCm/rccl-tests
RCCL Performance Benchmark Tests
ROCm/rocm-blogs
ROCm/TransferBench
TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)
ROCm/MISA
Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)
ROCm/OpenFOAM_HMM
Refactoring OpenFOAM with OpenMP target offloading and use of HMM to offload work onto GPUs
ROCm/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
ROCm/pyrsmi
python package of rocm-smi-lib
ROCm/hip-python
HIP Python Low-level Bindings
ROCm/hipCollections
Header-only library of GPU-accelerated, concurrent data structures.
ROCm/Megatron-LM
Ongoing research training transformer models at scale
ROCm/MITuna
ROCm/Gromacs
ROCm's implementation of Gromacs
ROCm/onnxruntime
ONNX Runtime: cross-platform, high performance scoring engine for ML models
ROCm/hipBench
HIP Kernel Benchmarking Library
ROCm/MIFin
Tuna centric MIOpen client
ROCm/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
ROCm/builder
Continuous builder and binary build scripts for pytorch
ROCm/gpu-cluster-networking
Cluster networking documentation for AMD Instinct accelerators
ROCm/hipCOMP-core
hipCOMP is a library for fast lossless compression/decompression on the GPU. This repository contains the algorithms.
ROCm/jitify
A single-header C++ library for simplifying the use of HIP Runtime Compilation (HIPRTC).
ROCm/rmm-rocm
RMM: RAPIDS Memory Manager
ROCm/CTranslate2
Fast inference engine for Transformer models
ROCm/hipBLAS-common
Common files shared by hipBLAS and hipBLASLt
ROCm/rocm-install-on-windows
ROCm/.github
AMD ROCm™ Platform - GitHub Home
ROCm/ompi
Copy of the Open MPI repository
ROCm/text-generation-inference
Large Language Model Text Generation Inference
ROCm/tritoninferenceserver-vllm