Duconnor's Stars
Neargye/nameof
Nameof operator for modern C++, simply obtain the name of a variable, type, function, macro, and enum
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
zpoint/CPython-Internals
Dive into CPython internals, trying to illustrate every detail of CPython implementation
triton-lang/triton-cpu
An experimental CPU backend for Triton
adam-maj/tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
karpathy/llm.c
LLM training in simple, raw C/CUDA
0xBYTESHIFT/fp16
class that represents 16-bit floating point (half)
netcan/asyncio
asyncio is a c++20 library to write concurrent code using the async/await syntax.
CppCon/CppCon2023
Slides and other materials from CppCon 2023
llvm/torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
microsoft/triton-shared
Shared Middle-Layer for Triton Compilation
ml-explore/mlx
MLX: An array framework for Apple silicon
triton-lang/triton
Development repository for the Triton language and compiler
saeed771/cpp_book
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
rui314/chibicc
A small C compiler
merrymercy/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
iree-org/iree
A retargetable MLIR-based machine learning compiler and runtime toolkit.
BBuf/tvm_mlir_learn
compiler learning resources collect.
heekhero/ACSR
microsoft/hummingbird
Hummingbird compiles trained ML models into tensor computation for faster inference.
yuhao318/UP-ViT
This is an official implementation for "A Unified Pruning Framework for Vision Transformers".
amix/vimrc
The ultimate Vim configuration (vimrc)
WXinlong/DenseCL
Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021 Oral.
zhoushengisnoob/DeepClustering
Methods and Implements of Deep Clustering
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Jimver/cuda-toolkit
GitHub Action to install CUDA
actions/starter-workflows
Accelerating new GitHub Actions workflows
MCG-NJU/SSD-LT
[ICCV 2021] Self Supervision to Distillation for Long-Tailed Visual Recognition