Pinned Repositories
composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
hcc
HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform
HIP
HIP: C++ Heterogeneous-Compute Interface for Portability
HIPIFY
HIPIFY: Convert CUDA to Portable C++ Code
MIOpen
AMD's Machine Intelligence Library
rocBLAS
Next generation BLAS implementation for ROCm platform
ROCK-Kernel-Driver
AMDGPU Driver with KFD used by the ROCm project. Also contains the current Linux Kernel that matches this base driver
ROCm
AMD ROCm™ Software - GitHub Home
ROCm-docker
Dockerfiles for the various software layers defined in the ROCm software platform
tensorflow-upstream
TensorFlow ROCm port
AMD ROCm™ Software's Repositories
ROCm/ROCm-OpenCL-Runtime
ROCm OpenOpenCL Runtime
ROCm/gpufort
GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify
ROCm/HIP-CPU
An implementation of HIP that works on CPUs, across OSes.
ROCm/amd_matrix_instruction_calculator
A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators
ROCm/atmi
Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provides a consistent, declarative API to create task graphs on CPUs and GPUs (integrated and discrete).
ROCm/ROCclr
ROCm/hipamd
ROCm/roc-stdpar
ROCm/aws-ofi-rccl
ROCm/criu
Checkpoint/Restore tool
ROCm/FAMBench
Benchmarks to capture important workloads.
ROCm/gputt
gpuTT: GPU Tensor Transpose library
ROCm/kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
ROCm/og
OpenMP GCC support
ROCm/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
ROCm/blis
BLAS-like Library Instantiation Software Framework
ROCm/ClassyVision
An end-to-end PyTorch framework for image and video classification
ROCm/dask
Parallel computing with task scheduling
ROCm/faiss
A library for efficient similarity search and clustering of dense vectors.
ROCm/frugally-deep
Header-only library for using Keras (TensorFlow) models in C++.
ROCm/gloo
Collective communications library with various primitives for multi-machine training.
ROCm/libfabric
Open Fabric Interfaces
ROCm/libflame
High-performance object-based library for DLA computations
ROCm/LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
ROCm/pytorch-examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
ROCm/pytorch_scatter
PyTorch Extension Library of Optimized Scatter Operations
ROCm/rocgputreeshap
ROCm support for GPUTreeShap
ROCm/tensorpipe
A tensor-aware point-to-point communication primitive for machine learning
ROCm/tiny-rocm-cxx
ROCm/ucx-py-rocm
Python bindings for UCX