This repo contains a hacked version of MIOpenGEMM that runs on NVIDIA GPU and theoretically any OpenCL device.
Note that MIOpenGEMM's demos require a hint for MIOpenGEMM to select shich OpenCL device to use. Currently it is setted to find a NVIDIA GPU in bench.cpp
and find a AMD GPU in other files. Please change them if needed. They look like this.
CLHint devhint({"780", "GTX", "Ti", "CUDA"})
An OpenCL general matrix multiplication (GEMM) API and kernel generator. More information is available on the wiki.
- OpenCL - OpenCL libraries and header files
- ROCm cmake modules
First create a build directory:
mkdir build; cd build;
Next configure cmake; if OpenCL is installed in one of the standard locations,
cmake ..
otherwise manually set OpenCL cmake variables, either in CMakeCache.txt
or
cmake -DOPENCL_LIBRARIES=<opencl-library-path> -DOPENCL_INCLUDE_DIRS<opencl-headers-path> ..
By default the install location is set to '/opt/rocm', this can be set by using CMAKE_INSTALL_PREFIX
:
cmake -DCMAKE_INSTALL_PREFIX=<miopen-installed-path> ..
The library can be built, from the build
directory
make miopengemm
And can be installed by using the 'install' target
make install
MIOpenGEMM provides an OpenCL GEMM API in gemm.hpp
, which should be included in your C++ source file
#include <miopengemm/gemm.hpp>
The key function is
template <typename T>
MIOpenGEMM::GemmStatus xgemm(...)
which provides the same functionality as clBLAS' clblasSgemm
and clblasDgemm
. Currently only T=float
and T=double
are supported. More information on xgemm
can be found on the wiki here.
To obtain just OpenCL kernel strings without executing GEMM, one can use miogemm.hpp
, as done by MIOpen.
Currently there is 1 basic test, which can be run with
make smallgeometrytests
./tests/smallgeometrytests
or
make check
All examples can be built with
make examples
or individually by name, for example
make find
The examples are described on the the wiki here.
HTML and PDF documentation can be built using:
make doc
This will build a local searchable web site inside the ./MIOpenGEMM/doc/html folder and a PDF document inside the ./MIOpenGEMM/doc/pdf folder.
Documentation is generated using Doxygen and should be installed separately.
HTML and PDFs are generated using Sphinx and Breathe, with the ReadTheDocs theme.
Requirements for both Sphinx, Breathe, and the ReadTheDocs theme can be filled for these in the MIOpenGEMM/doc folder:
pip install -r ./requirements.txt
Depending on your setup sudo
may be required for the pip install.