alanzhai219

To be Respect, Enterprising and Kindly

USTCshanghai

alanzhai219's Stars

zed-industries/zed
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
Language:Rust53k 230 10.3k3.4k
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python38.5k 387 3206.2k
carbon-app/carbon
:black_heart: Create and share beautiful images of your source code
Language:JavaScript34.8k 245 6781.9k
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Language:Python20.5k 135 1.2k1.5k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook14k 99 181.1k
chenzomi12/AISystem
AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Language:Jupyter Notebook11.9k 155 421.7k
me115/design_patterns
图说设计模式
Language:C++7k 269 271.8k
DefTruth/lite.ai.toolkit
🛠 A lite C++ toolkit of 100+ Awesome AI models, support ORT, MNN, NCNN, TNN and TensorRT. 🎉🎉
Language:C++3.7k 67 270706
gpu-mode/lectures
Material for gpu-mode lectures
Language:Jupyter Notebook3.5k 51 9348
KDE/heaptrack
A heap memory profiler for Linux
Language:C++3.4k 51 0206
siduck/chadwm
Making dwm as beautiful as possible!
Language:C2.5k 23 134180
027xiguapi/code-box
本插件可以用于CSDN/知乎/脚本之家/博客园/掘金等网站,一键下载文章html或markdown文件;实现无需登录一键复制代码;支持选中代码;或者代码右上角按钮的一键复制;解除关注博主即可阅读全文提示;去除登录弹窗;去除跳转APP弹窗.
Language:TypeScript2.1k 8 35145
DefTruth/CUDA-Learn-Notes
📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
Language:Cuda2k 15 9206
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
Language:Cuda1.9k 34 3494
kawre/leetcode.nvim
A Neovim plugin enabling you to solve LeetCode problems.
Language:Lua1.3k 8 11757
parallel101/cppguidebook
小彭老师领衔编写，现代C++的中文百科全书
Language:Typst787 63 3756
likejazz/llama3.cuda
llama3.cuda is a pure C/CUDA implementation for Llama 3 model.
Language:Cuda318 6 522
DangJin/awesome-readme-generator-tools
收录了一些可以快速创建出精美readme.md的工具集合
289 1 04
usyd-fsalab/fp6_llm
An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).
Language:Cuda229 6 1116
intel/pti-gpu
Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysis on Intel(R) Processor Graphics easily
Language:C++212 17 5856
microsoft/microxcaling
PyTorch emulation library for Microscaling (MX)-compatible data formats
Language:Python186 8 2425
ling0322/libllm
Efficient inference of large language models.
Language:C++145 3 17
nbergont/qgv
Interactive Qt graphViz display
Language:C++97 7 1137
andreasfertig/notebookcpp-tips-and-tricks-with-templates
Language:C++50 6 012
tetzank/SIMDSetOperations
testbed for different SIMD implementations for set intersection and set union
Language:C++40 8 28
qlibs/mem
C++20 Memory Allocators
31 4 01
shivance/minbpe.c
a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.
Language:C21 1 02
abdallah197/llama2-from-scratch
Language:Python7 1 13
sibellavia/tinymalloc
A lightweight memory allocator in C
Language:C5 2 00
enp1s0/simple_fp8
Language:C++1

alanzhai219

alanzhai219's Stars

zed-industries/zed

karpathy/nanoGPT

carbon-app/carbon

unslothai/unsloth

naklecha/llama3-from-scratch

chenzomi12/AISystem

me115/design_patterns

DefTruth/lite.ai.toolkit

gpu-mode/lectures

KDE/heaptrack

siduck/chadwm

027xiguapi/code-box

DefTruth/CUDA-Learn-Notes

HazyResearch/ThunderKittens

kawre/leetcode.nvim

parallel101/cppguidebook

likejazz/llama3.cuda

DangJin/awesome-readme-generator-tools

usyd-fsalab/fp6_llm

intel/pti-gpu

microsoft/microxcaling

ling0322/libllm

nbergont/qgv

andreasfertig/notebookcpp-tips-and-tricks-with-templates

tetzank/SIMDSetOperations

qlibs/mem

shivance/minbpe.c

abdallah197/llama2-from-scratch

sibellavia/tinymalloc

enp1s0/simple_fp8