Pinned Repositories
100DaysOfRTL
100 Days of RTL
ace
Ace: A Jekyll theme.
aditya4d1.github.io
AllSpark
C/C++/CUDA project generator
amdgpu-conv-asm
AMDIL
CopyRXVega64
Analyze performance of Copy kernels on RXVega64
gemm-vega64
Implement asm gemm on vega64 for 4096x4096 fp32 matrix
ia64go
Intel AVX and SSE extensions for Go-Lang
aditya4d1's Repositories
aditya4d1/gemm-vega64
Implement asm gemm on vega64 for 4096x4096 fp32 matrix
aditya4d1/ia64go
Intel AVX and SSE extensions for Go-Lang
aditya4d1/CopyRXVega64
Analyze performance of Copy kernels on RXVega64
aditya4d1/amdgpu-conv-asm
aditya4d1/100DaysOfRTL
100 Days of RTL
aditya4d1/ace
Ace: A Jekyll theme.
aditya4d1/aditya4d1.github.io
aditya4d1/AllSpark
C/C++/CUDA project generator
aditya4d1/AMDIL
aditya4d1/biodbms
This repo contains the code for the paper "A New Parallel Searching Model for Integration of Biological Databases"
aditya4d1/ByteMLPerf
AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.
aditya4d1/cutlass
CUDA Templates for Linear Algebra Subroutines
aditya4d1/docker-commands
Commands for Docker
aditya4d1/float-toy
Use this to build intuition for the IEEE floating-point format
aditya4d1/how-to-compile-your-language
An introduction to language design with building a compiler frontend on top of LLVM.
aditya4d1/hsa-runtime
aditya4d1/notebooks
Collection of notebook guides created by the Brev.dev team!
aditya4d1/ORF-betav1.1
aditya4d1/pycuda
CUDA integration for Python, plus shiny features
aditya4d1/pyopencl
OpenCL integration for Python, plus shiny features
aditya4d1/rccl
ROCm Communication Collectives Library
aditya4d1/Redox
aditya4d1/travis-template
Project template to use travis-ci