alanzhai219's Stars
zed-industries/zed
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
carbon-app/carbon
:black_heart: Create and share beautiful images of your source code
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
chenzomi12/AISystem
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
me115/design_patterns
图说设计模式
DefTruth/lite.ai.toolkit
🛠 A lite C++ toolkit of 100+ Awesome AI models, support ORT, MNN, NCNN, TNN and TensorRT. 🎉🎉
gpu-mode/lectures
Material for gpu-mode lectures
KDE/heaptrack
A heap memory profiler for Linux
siduck/chadwm
Making dwm as beautiful as possible!
027xiguapi/code-box
本插件可以用于CSDN/知乎/脚本之家/博客园/掘金等网站,一键下载文章html或markdown文件;实现无需登录一键复制代码;支持选中代码;或者代码右上角按钮的一键复制;解除关注博主即可阅读全文提示;去除登录弹窗;去除跳转APP弹窗.
DefTruth/CUDA-Learn-Notes
📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
kawre/leetcode.nvim
A Neovim plugin enabling you to solve LeetCode problems.
parallel101/cppguidebook
小彭老师领衔编写,现代C++的中文百科全书
likejazz/llama3.cuda
llama3.cuda is a pure C/CUDA implementation for Llama 3 model.
DangJin/awesome-readme-generator-tools
收录了一些可以快速创建出精美readme.md的工具集合
usyd-fsalab/fp6_llm
An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).
intel/pti-gpu
Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysis on Intel(R) Processor Graphics easily
microsoft/microxcaling
PyTorch emulation library for Microscaling (MX)-compatible data formats
ling0322/libllm
Efficient inference of large language models.
nbergont/qgv
Interactive Qt graphViz display
andreasfertig/notebookcpp-tips-and-tricks-with-templates
tetzank/SIMDSetOperations
testbed for different SIMD implementations for set intersection and set union
qlibs/mem
C++20 Memory Allocators
shivance/minbpe.c
a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.
abdallah197/llama2-from-scratch
sibellavia/tinymalloc
A lightweight memory allocator in C
enp1s0/simple_fp8