ozturkosu's Stars
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
triton-lang/triton
Development repository for the Triton language and compiler
python-trio/trio
Trio – a friendly Python library for async concurrency and I/O
andreasfertig/cppinsights
C++ Insights - See your source code with the eyes of a compiler
AnswerDotAI/gpu.cpp
A lightweight library for portable low-level GPU computation using WebGPU.
kokkos/kokkos
Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction
intel/intel-extension-for-pytorch
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
tensor-compiler/taco
The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs
ROCm/MIOpen
AMD's Machine Intelligence Library
ROCm/HIPIFY
HIPIFY: Convert CUDA to Portable C++ Code
LLNL/RAJA
RAJA Performance Portability Layer (C++)
UoB-HPC/BabelStream
STREAM, for lots of devices written in many programming models
kokkos/kokkos-tutorials
Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem
ROCm/rccl
ROCm Communication Collectives Library (RCCL)
codeplaysoftware/portBLAS
Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.
ROCm/ROCR-Runtime
ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime
ROCm/MIVisionX
MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.
ROCm/rocPRIM
ROCm Parallel Primitives
ROCm/rocMLIR
kokkos/kokkos-tools
Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools
ROCm/rocSPARSE
Next generation SPARSE implementation for ROCm platform
ROCm/rocRAND
RAND library for HIP programming language
ROCm/hipCUB
Reusable software components for ROCm developers
ROCm/amd_matrix_instruction_calculator
A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators
ROCm/rpp
AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.
ROCm/amdsmi
AMD SMI
ROCm/hipTensor
AMD’s C++ library for accelerating tensor primitives
merthidayetoglu/HiCCL
A hierarchical collective communications library with portable optimizations
pmodels/mpi-tutorial-examples