sunlex0717's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
meta-llama/llama
Inference code for Llama models
isocpp/CppCoreGuidelines
The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++
lllyasviel/ControlNet
Let us control diffusion models!
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
ml-explore/mlx
MLX: An array framework for Apple silicon
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
triton-lang/triton
Development repository for the Triton language and compiler
FMInference/FlexiGen
Running large language models on a single GPU for throughput-oriented scenarios.
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
electronicarts/EASTL
EASTL stands for Electronic Arts Standard Template Library. It is an extensive and robust implementation that has an emphasis on high performance.
tpn/pdfs
Technically-oriented PDF Collection (Papers, Specs, Decks, Manuals, etc)
apple/corenet
CoreNet: A library for training deep neural networks
KhronosGroup/MoltenVK
MoltenVK is a Vulkan Portability implementation. It layers a subset of the high-performance, industry-standard Vulkan graphics and compute API over Apple's Metal graphics framework, enabling Vulkan applications to run on macOS, iOS and tvOS.
coala/coala
coala provides a unified command-line interface for linting and fixing all your code, regardless of the programming languages you use.
neuralmagic/deepsparse
Sparsity-aware deep learning inference runtime for CPUs
gpu-mode/lectures
Material for gpu-mode lectures
hollance/neural-engine
Everything we actually know about the Apple Neural Engine (ANE)
ELS-RD/transformer-deploy
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
BBuf/how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
Voine/ChatWaifu_Mobile
移动版二次元 AI 老婆聊天器
mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
zeux/calm
CUDA/Metal accelerated language model inference
MomentsInGraphics/vulkan_renderer
A toy renderer written in C using Vulkan to perform real-time ray tracing research.
NVIDIA/Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
SJTU-ACA-Lab/blue-porcelain
Thinklab-SJTU/awesome-ai4eda
Awesome Artificial Intelligence for Electronic Design Automation Papers.
g-truc/sdk
LouiValley/RayTracing-Tech
This is a paper list about the most important techs and some hard core knowledge about ray tracing.