njh2001's Stars
PSAL-POSTECH/ONNXim
ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference
NVIDIA/jetson-rdma-picoevb
Minimal HW-based demo of GPUDirect RDMA on NVIDIA Jetson AGX Xavier running L4T
XiaoSong9905/CUDA-Optimization-Guide
Xiao's CUDA Optimization Guide [Active Adding New Contents]
microsoft/vidur
A large-scale simulation framework for LLM inference
stonne-simulator/sst-elements-with-stonne
STONNE Simulator integrated into SST Simulator
bowling233/dotfiles
Repository to sync my dotfiles
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
kvcache-ai/ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
triton-lang/triton
Development repository for the Triton language and compiler
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
LeiWang1999/ZYNQ-NVDLA
NVDLA (An Opensource DL Accelerator Framework) implementation on FPGA.
icgw/ucas-beamer
:scroll: UCAS Beamer (LaTeX)
BoChen-Ye/Tiny_LeViT_Hardware_Accelerator
This is my hobby project with System Verilog to accelerate LeViT Network which contain CNN and Attention layer.
shrekuu/vimrc
My Vim config
yaoyao-liu/minimal-light
A simple and elegant Jekyll theme for an academic personal homepage
galeselee/Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on inference acceleration, and related works will be gradually added in the future. Welcome contributions!
AnswerDotAI/gpu.cpp
A lightweight library for portable low-level GPU computation using WebGPU.
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
BBuf/how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
gpu-mode/lectures
Material for gpu-mode lectures
amix/vimrc
The ultimate Vim configuration (vimrc)
wklken/vim-for-server
.vimrc, simple configures for server, without plugins.
kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
ryankillian/karpathy-lectures-notebooks
Jupyter notebooks accompanying Andrej Karpathy's neural network lectures. Includes extended notes and direct Colab links.
MK2112/nn-zero-to-hero-notes
Jupyter Notebook notes on Andrej Karpathy's tutorial series, "Neural Networks: Zero to Hero."
karpathy/LLM101n
LLM101n: Let's build a Storyteller
AmberLJC/LLMSys-PaperList
Large Language Model (LLM) Systems Paper List
karpathy/nn-zero-to-hero
Neural Networks: Zero to Hero
fengbintu/Neural-Networks-on-Silicon
This is originally a collection of papers on neural network accelerators. Now it's more like my selection of research on deep learning and computer architecture.
ggerganov/llama.cpp
LLM inference in C/C++