Simple BLAS Operations Accelerated in Raspberry Pi GPU using V3DLib.
Initially, this was a part of waffle and made stale due to some technical complexities.
v3dBLAS uses, V3DLib to program the Broadcom Videocore VI GPU in the BCM2711 SoC of the Raspberry Pi 4 single board computer.
Use the shell scrips to clone, build and test the kernels.