VimalWill's Stars
practical-tutorials/project-based-learning
Curated list of project-based tutorials
karpathy/llm.c
LLM training in simple, raw C/CUDA
triton-lang/triton
Development repository for the Triton language and compiler
aalhour/awesome-compilers
:sunglasses: Curated list of awesome resources on Compilers, Interpreters and Runtimes
dmlc/dlpack
common in-memory tensor structure
libxsmm/libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
ucb-bar/gemmini
Berkeley's Spatial Array Generator
ucb-bar/chisel-tutorial
chisel tutorial exercises and answers
mn416/QPULib
Language and compiler for the Raspberry Pi GPU
hermanhermitage/videocoreiv-qpu
Fun and Games with the Videocoreiv Quad Processor Units
clevercool/TileSparsity
mikeroyal/LLVM-Guide
LLVM (Low Level Virtual Machine) Guide. Learn all about the compiler infrastructure, which is designed for compile-time, link-time, run-time, and "idle-time" optimization of programs. Originally implemented for C/C++ , though, has a variety of front-ends, including Java, Python, etc.
microsoft/SparTA
makslevental/openhls
PyTorch model to RTL flow for low latency inference
cornell-zhang/allo
Allo: A Programming Model for Composable Accelerator Design
plaidml/tpp-mlir
TPP experimentation on MLIR for linear algebra
IntelLabs/FP8-Emulation-Toolkit
PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.
GATECH-EIC/ViTCoD
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
nod-ai/SHARK-Turbine
Unified compiler/runtime for interfacing with PyTorch Dynamo.
nod-ai/iree-amd-aie
IREE plugin repository for the AMD AIE accelerator
intel/intel-extension-for-openxla
undoio/l3
L3: Lightweight Logging Library. A very small 'C' library to generate low-footprint, non-intrusive, high-performance logging of trace messages in an mmap()'ed file. Tools are provided to unpack the binary log-data into human-readable traces.
iree-org/iree-turbine
IREE's PyTorch Frontend, based on Torch Dynamo.
iml130/iree-bare-metal-arm
Example for running IREE in a bare-metal Arm environment.
iml130/iree-template-cpp
IREE C++ Template
intel/graph-compiler
cohort-project/cohort-soc
Cohort Project
Xilinx/mlir-xten
plaidml/iree
👻
xerpi/Halide
a language for fast, portable data-parallel computation