gfvvz

AI Compiler

Shanghai, China

Pinned Repositories

aimet-model-zoo
Language:Python0 0 00
awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
0 0 00
ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Language:C++1 0 00
how-to-compile-your-language
An introduction to language design with building a compiler frontend on top of LLVM.
Language:HTML10
llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
1 0 00
PL-Compiler-Resource
程序语言与编译技术相关资料（持续更新中）
0 0 00
triton
Development repository for the Triton language and compiler
Language:C++0 0 00
Triton-Compiler
Triton Compiler related materials.
27 0 04
triton-shared
Shared Middle-Layer for Triton Compilation
Language:MLIR00
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:Python1 0 00

gfvvz's Repositories

gfvvz/Triton-Compiler
Triton Compiler related materials.
27 0 04
gfvvz/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
1 0 00
gfvvz/aimet-model-zoo
Language:Python0 0 00
gfvvz/AISystem
AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Language:Jupyter Notebook00
gfvvz/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
0 0 00
gfvvz/PL-Compiler-Resource
程序语言与编译技术相关资料（持续更新中）
0 0 00
gfvvz/triton
Development repository for the Triton language and compiler
Language:C++0 0 00
gfvvz/triton-shared
Shared Middle-Layer for Triton Compilation
Language:MLIR00
gfvvz/Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
gfvvz/cmake_example
Example pybind11 module built with a CMake-based build system
gfvvz/FlagGems
FlagGems is an operator library for large language models implemented in Triton Language.
Language:Python0 0
gfvvz/gfvvz.github.io
Build a Jekyll blog in minutes, without touching the command line.
Language:Jupyter Notebook
gfvvz/ggml
Tensor library for machine learning
gfvvz/lectures
Material for cuda-mode lectures
Language:Jupyter Notebook
gfvvz/llama.cpp
LLM inference in C/C++
gfvvz/llama2.c
Inference Llama 2 in one file of pure C
gfvvz/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook0 0
gfvvz/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
gfvvz/llm-from-scratch
llama3 implementation one matrix multiplication at a time
gfvvz/LLM-workshop-2024
A 4-hour coding workshop to understand how LLMs are implemented and used
gfvvz/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda
gfvvz/md-blogs
A blog where I write about research papers and blog posts I read.
gfvvz/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
gfvvz/mojo
The Mojo Programming Language
Language:Mojo0 0
gfvvz/pytorch-transformer
Attention is all you need implementation
gfvvz/resource-stream
CUDA related news and material links
gfvvz/tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
gfvvz/triton-cpu
An experimental CPU backend for Triton (https//github.com/openai/triton)
gfvvz/Triton-Puzzles
Puzzles for learning Triton
gfvvz/youtube-rag
Language:Jupyter Notebook0 0