VimalWill's Stars
ggerganov/llama.cpp
LLM inference in C/C++
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
llvm/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
AnthonyCalandra/modern-cpp-features
A cheatsheet of modern C++ language and library features.
ml-explore/mlx
MLX: An array framework for Apple silicon
halide/Halide
a language for fast, portable data-parallel computation
helblazer811/ManimML
ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Library.
he-y/Awesome-Pruning
A curated list of neural network pruning resources.
merrymercy/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
neuralmagic/sparseml
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
hollance/neural-engine
Everything we actually know about the Apple Neural Engine (ANE)
llvm/torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
sophgo/tpu-mlir
Machine learning compiler based on MLIR for Sophgo TPU.
rajesh-s/computer-architecture-and-systems-resources
A curated list of Computer Architecture and Systems resources
openxla/stablehlo
Backward compatible ML compute opset inspired by HLO/MHLO
KEKE046/mlir-tutorial
Hands-On Practical MLIR Tutorial
spcl/gemm_hls
Scalable systolic array-based matrix-matrix multiplication implemented in Vivado HLS for Xilinx FPGAs.
boostorg/mp11
C++11 metaprogramming library
AlibabaResearch/flash-llm
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity
google/gcnn-survey-paper
intel/linux-npu-driver
Intel® NPU (Neural Processing Unit) Driver
synxlin/nn-compression
A Pytorch implementation of Neural Network Compression (pruning, deep compression, channel pruning)
Xilinx/mlir-air
matteocarnelos/microflow-rs
A robust and efficient TinyML inference engine.
apple/ml-upscale
Export utility for unconstrained channel pruned models
Xilinx/pyxir
podborski/GStreamerLatencyPlotter
A small node.js program that allows you to calculate and display the latency of each element of the GStreamer pipeline
wehu/c-mlir
A translator from c to MLIR
chuchu0512/image-processing-on-ZCU104
Using Xilinx Vitis, Vivado and Vitis HLS design program and running on Xilinx ZCU104 board
JamesTheZ/BladeDISC
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.