SunflowerAries's Stars
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
huggingface/candle
Minimalist ML framework for Rust
federico-busato/Modern-CPP-Programming
Modern C++ Programming Course (C++03/11/14/17/20/23/26)
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
huggingface/text-embeddings-inference
A blazing fast inference solution for text embeddings models
leptonai/leptonai
A Pythonic framework to simplify AI service building
dendibakh/perf-book
The book "Performance Analysis and Tuning on Modern CPU"
shining1984/PL-Compiler-Resource
程序语言与编译技术相关资料(持续更新中)
znck/grammarly
Grammarly for VS Code
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
sustcsonglin/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
cuda-mode/resource-stream
CUDA related news and material links
stone-zeng/fduthesis
LaTeX thesis template for Fudan University
sampsyo/cs6120
advanced compilers
moderncv/moderncv
A modern curriculum vitae class for LaTeX
mit-han-lab/TinyChatEngine
TinyChatEngine: On-Device LLM Inference Library
olcf/cuda-training-series
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
google/aqt
Shenggan/awesome-distributed-ml
A curated list of awesome projects and papers for distributed training or inference
MARD1NO/CUDA-PPT
alibaba/easydist
Automated Parallelization System and Infrastructure for Multiple Ecosystems
mangpo/swizzle-inventor
A framework that helps implementing swizzle GPU kernels
nod-ai/techtalks
zhangjiang-compiler/tech-show
Zhangjiang Compiler Tech Show
mkongiv/model-driven-transf-pldi19
Patches for pldi19 paper titled Model-driven Transformations for etc
mkongiv/polycross-comp-ics21
ICS'21 Artifact - A Polyhedral Cross-Compilation Approach for Tile Size Selection of Affine Programs on GPGPUs