njh2001

Institute of Computing Technology, CASBeijing, China

njh2001's Stars

PSAL-POSTECH/ONNXim
ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference
Language:C++4610
NVIDIA/jetson-rdma-picoevb
Minimal HW-based demo of GPUDirect RDMA on NVIDIA Jetson AGX Xavier running L4T
Language:Tcl15744
XiaoSong9905/CUDA-Optimization-Guide
Xiao's CUDA Optimization Guide [Active Adding New Contents]
22716
microsoft/vidur
A large-scale simulation framework for LLM inference
Language:Python24631
stonne-simulator/sst-elements-with-stonne
STONNE Simulator integrated into SST Simulator
Language:C++151
bowling233/dotfiles
Repository to sync my dotfiles
Language:Shell1
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python4.3k390
kvcache-ai/ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Language:Python69334
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++12.9k1.6k
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++5.5k924
LeiWang1999/ZYNQ-NVDLA
NVDLA (An Opensource DL Accelerator Framework) implementation on FPGA.
Language:Verilog30262
icgw/ucas-beamer
:scroll: UCAS Beamer (LaTeX)
Language:TeX17125
BoChen-Ye/Tiny_LeViT_Hardware_Accelerator
This is my hobby project with System Verilog to accelerate LeViT Network which contain CNN and Attention layer.
Language:SystemVerilog7
shrekuu/vimrc
My Vim config
Language:Vim Script4
yaoyao-liu/minimal-light
A simple and elegant Jekyll theme for an academic personal homepage
Language:CSS621513
galeselee/Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on inference acceleration, and related works will be gradually added in the future. Welcome contributions!
1586
AnswerDotAI/gpu.cpp
A lightweight library for portable low-level GPU computation using WebGPU.
Language:C++3.7k176
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
Language:HTML9.4k922
BBuf/how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
Language:Cuda1.5k122
gpu-mode/lectures
Material for gpu-mode lectures
Language:Jupyter Notebook2.6k264
amix/vimrc
The ultimate Vim configuration (vimrc)
Language:Vim Script30.6k7.3k
wklken/vim-for-server
.vimrc, simple configures for server, without plugins.
Language:Vim script604266
kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
1.1k23
ryankillian/karpathy-lectures-notebooks
Jupyter notebooks accompanying Andrej Karpathy's neural network lectures. Includes extended notes and direct Colab links.
Language:Jupyter Notebook94
MK2112/nn-zero-to-hero-notes
Jupyter Notebook notes on Andrej Karpathy's tutorial series, "Neural Networks: Zero to Hero."
Language:Jupyter Notebook11710
karpathy/LLM101n
LLM101n: Let's build a Storyteller
29.2k1.6k
AmberLJC/LLMSys-PaperList
Large Language Model (LLM) Systems Paper List
59224
karpathy/nn-zero-to-hero
Neural Networks: Zero to Hero
Language:Jupyter Notebook11.7k1.5k
fengbintu/Neural-Networks-on-Silicon
This is originally a collection of papers on neural network accelerators. Now it's more like my selection of research on deep learning and computer architecture.
1.8k380
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++65.9k9.5k

njh2001

njh2001's Stars

PSAL-POSTECH/ONNXim

NVIDIA/jetson-rdma-picoevb

XiaoSong9905/CUDA-Optimization-Guide

microsoft/vidur

stonne-simulator/sst-elements-with-stonne

bowling233/dotfiles

InternLM/lmdeploy

kvcache-ai/ktransformers

triton-lang/triton

NVIDIA/cutlass

LeiWang1999/ZYNQ-NVDLA

icgw/ucas-beamer

BoChen-Ye/Tiny_LeViT_Hardware_Accelerator

shrekuu/vimrc

yaoyao-liu/minimal-light

galeselee/Awesome_LLM_System-PaperList

AnswerDotAI/gpu.cpp

liguodongiot/llm-action

BBuf/how-to-optim-algorithm-in-cuda

gpu-mode/lectures

amix/vimrc

wklken/vim-for-server

kvcache-ai/Mooncake

ryankillian/karpathy-lectures-notebooks

MK2112/nn-zero-to-hero-notes

karpathy/LLM101n

AmberLJC/LLMSys-PaperList

karpathy/nn-zero-to-hero

fengbintu/Neural-Networks-on-Silicon

ggerganov/llama.cpp