Pinned Repositories
aparapi
babylon
https://openjdk.org/projects/babylon
blis
BLAS-like Library Instantiation Software Framework
BLPSLibrary
chickensnake
clBLAS
a software library containing BLAS functions written in OpenCL
clFFT
a software library containing FFT functions written in OpenCL
clpeak
A tool which profiles OpenCL devices to find their peak capacities
freebsd-base-graphics
Fork of FreeBSD's base repository to work on graphics-stack-related projects
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
iotamudelta's Repositories
iotamudelta/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
iotamudelta/babylon
https://openjdk.org/projects/babylon
iotamudelta/blis
BLAS-like Library Instantiation Software Framework
iotamudelta/chickensnake
iotamudelta/dlrm
An implementation of a deep learning recommendation model (DLRM)
iotamudelta/faiss
A library for efficient similarity search and clustering of dense vectors.
iotamudelta/faiss_docker
iotamudelta/freeocl
Automatically exported from code.google.com/p/freeocl
iotamudelta/hcc-clang-upgrade
stage the upgrade of hcc-clang to clang ToT
iotamudelta/HIP
HIP : Convert CUDA to Portable C++ Code
iotamudelta/jgrapht
Master repository for the JGraphT project
iotamudelta/kokkos
Kokkos C++ Performance Portability Programming EcoSystem: The Programming Model - Parallel Execution and Memory Abstraction
iotamudelta/libflame
High-performance object-based library for DLA computations
iotamudelta/ogolem
This is the open-source ogolem framework for GA-based global optimization.
iotamudelta/ossci-job-dsl
Jenkins job definitions for OSSCI
iotamudelta/py-spy
Sampling profiler for Python programs
iotamudelta/remoteprocess
Cross platform process information in Rust
iotamudelta/roc-stdpar
iotamudelta/rocBLAS
Next generation BLAS implementation for ROCm platform
iotamudelta/rocm-docker
ROCm docker image
iotamudelta/ROCm.github.io
ROCm Website
iotamudelta/rocminfo
ROCm Application for Reporting System Info
iotamudelta/rocmProfileData_pub
iotamudelta/rocPRIM
ROCm Parallel Primitives
iotamudelta/ROCR-Runtime
ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime
iotamudelta/ROCT-Thunk-Interface
Radeon Open Compute Thunk Interface
iotamudelta/tornado_rusticl_docker
iotamudelta/TornadoVM
TornadoVM: A practical and efficient heterogeneous programming framework for managed languages
iotamudelta/UVM_benchmark
iotamudelta/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs