ywhhh's Stars
Tianzhengshuyuan/llamacpp_with_annotation
DefTruth/CUDA-Learn-Notes
📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
Ewenwan/ShiYanLou
学习C & C++ & python&汇编语言 LLVM编译器 数据结构 算法 操作系统 单片机 linux 面试
Fazziekey/Fazziekey
gpu-mode/lectures
Material for gpu-mode lectures
d2l-ai/d2l-tvm
Dive into Deep Learning Compiler
gpu-mode/awesomeMLSys
An ML Systems Onboarding list
yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理)
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
TheAlgorithms/C-Plus-Plus
Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.
OpenPPL/ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
adam-maj/tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
SafeAILab/EAGLE
Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
bentoml/OpenLLM
Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.
CUTE-Project/CUTE
Hardware implementation of CUTE
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
ggerganov/llama.cpp
LLM inference in C/C++
YingkunZhou/EdgeTransformerBench
edge/mobile transformer based Vision DNN inference benchmark
dair-ai/ML-YouTube-Courses
📺 Discover the latest machine learning / AI courses on YouTube.
mit-pdos/xv6-riscv
Xv6 for RISC-V
torchpipe/torchpipe
Serving Inside Pytorch
stevencohn/OneMore
A OneNote add-in with simple, yet powerful and useful features
THU-DSP-LAB/ventus-gpgpu
GPGPU processor supporting RISCV-V extension, developed with Chisel HDL
apache/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
spdustin/ChatGPT-AutoExpert
🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).
skiselev/8088_bios
BIOS for Intel 8088 based computers