Pinned Repositories
compiler-and-arch
A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture
compiler_assignment
compiler assignment testing codes [PKU, Course: Compiler Design, 2019 Spring]
Domino
FlexTensor-Micro
FlexTensor for MICRO tutorial
MatmulTutorial
A Easy-to-understand TensorOp Matmul Tutorial
ZSZ_Samples
Benchmark & Study materials
mlc-llm
Universal LLM Deployment Engine with ML Compilation
AMOS
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
FlexTensor
Automatic Schedule Exploration and Optimization Framework for Tensor Computations
TileFlow
TileFlow is a performance analysis tool based on Timeloop for fusion dataflows
KnowingNothing's Repositories
KnowingNothing/compiler-and-arch
A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture
KnowingNothing/MatmulTutorial
A Easy-to-understand TensorOp Matmul Tutorial
KnowingNothing/Domino
KnowingNothing/ZSZ_Samples
Benchmark & Study materials
KnowingNothing/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
KnowingNothing/Beijing_Daxuexi_Simple
北京 青年大学习 使用Github Actions自动完成
KnowingNothing/mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
KnowingNothing/byteir
ByteIR
KnowingNothing/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
KnowingNothing/cutlass
CUDA Templates for Linear Algebra Subroutines
KnowingNothing/cutlass-kernels
KnowingNothing/DeepEP
DeepEP: an efficient expert-parallel communication library
KnowingNothing/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.
KnowingNothing/flashinfer
FlashInfer: Kernel Library for LLM Serving
KnowingNothing/FlashMLA
KnowingNothing/FlexFlow
A distributed deep learning framework.
KnowingNothing/generative-ai-for-beginners
12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
KnowingNothing/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
KnowingNothing/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
KnowingNothing/nccl
Optimized primitives for collective multi-GPU communication
KnowingNothing/silo-lm
SILO Language Models code repository
KnowingNothing/tensorflow
An Open Source Machine Learning Framework for Everyone
KnowingNothing/tflite-micro
TensorFlow Lite for Microcontrollers
KnowingNothing/thrust
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
KnowingNothing/tilelang
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
KnowingNothing/triton
Development repository for the Triton language and compiler
KnowingNothing/tutorials
PyTorch tutorials.
KnowingNothing/uwsampl.github.io
The UW SAMPL group's website.
KnowingNothing/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
KnowingNothing/WorkScheduler
Work schedule app