Pinned Repositories
aimet-model-zoo
awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
how-to-compile-your-language
An introduction to language design with building a compiler frontend on top of LLVM.
llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
PL-Compiler-Resource
程序语言与编译技术相关资料(持续更新中)
triton
Development repository for the Triton language and compiler
Triton-Compiler
Triton Compiler related materials.
triton-shared
Shared Middle-Layer for Triton Compilation
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
gfvvz's Repositories
gfvvz/Triton-Compiler
Triton Compiler related materials.
gfvvz/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
gfvvz/aimet-model-zoo
gfvvz/AISystem
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
gfvvz/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
gfvvz/PL-Compiler-Resource
程序语言与编译技术相关资料(持续更新中)
gfvvz/triton
Development repository for the Triton language and compiler
gfvvz/triton-shared
Shared Middle-Layer for Triton Compilation
gfvvz/Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
gfvvz/cmake_example
Example pybind11 module built with a CMake-based build system
gfvvz/FlagGems
FlagGems is an operator library for large language models implemented in Triton Language.
gfvvz/gfvvz.github.io
Build a Jekyll blog in minutes, without touching the command line.
gfvvz/ggml
Tensor library for machine learning
gfvvz/lectures
Material for cuda-mode lectures
gfvvz/llama.cpp
LLM inference in C/C++
gfvvz/llama2.c
Inference Llama 2 in one file of pure C
gfvvz/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
gfvvz/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
gfvvz/llm-from-scratch
llama3 implementation one matrix multiplication at a time
gfvvz/LLM-workshop-2024
A 4-hour coding workshop to understand how LLMs are implemented and used
gfvvz/llm.c
LLM training in simple, raw C/CUDA
gfvvz/md-blogs
A blog where I write about research papers and blog posts I read.
gfvvz/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
gfvvz/mojo
The Mojo Programming Language
gfvvz/pytorch-transformer
Attention is all you need implementation
gfvvz/resource-stream
CUDA related news and material links
gfvvz/tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
gfvvz/triton-cpu
An experimental CPU backend for Triton (https//github.com/openai/triton)
gfvvz/Triton-Puzzles
Puzzles for learning Triton
gfvvz/youtube-rag