LukeLIN-web's Stars
PlexPt/awesome-chatgpt-prompts-zh
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
Vonng/ddia
《Designing Data-Intensive Application》DDIA中文翻译
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
NVIDIA/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
openmlsys/openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
conanhujinming/tips_for_interview
我的一些面试心得;自学CS历程分享;找工作求职经验分享
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
shengyp/doing_the_PhD
HeKun-NVIDIA/CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
AmberLJC/LLMSys-PaperList
Large Language Model (LLM) Systems Paper List
mbzuai-oryx/MobiLlama
MobiLlama : Small Language Model tailored for edge devices
NVIDIA/nvbench
CUDA Kernel Benchmarking Library
NVIDIA/DCGM
NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs
anyscale/llm-continuous-batching-benchmarks
zkysfls/20fall-
Hai-chao-Zhang/OOSTraj
[CVPR24] OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising