Zhang-kg's Stars
siyuan-note/siyuan
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
musescore/MuseScore
MuseScore is an open source and free music notation software. For support, contribution, bug reports, visit MuseScore.org. Fork and make pull requests!
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
antirez/smallchat
A minimal programming example for a chat server
hongleizhang/RSPapers
RSTutorials: A Curated List of Must-read Papers on Recommender System.
microsoft/DeepSpeedExamples
Example models using DeepSpeed
ivmm/Student-resources
本文介绍的是利用学生、教职工身份可以享受到的相关学生优惠、教育优惠或教师优惠的权益,但也希望各位享受权利的同时不要忘记自己的义务,不要售卖、转手自己的学生优惠、教育优惠的资格,使得其他同学无法受益。
elves/elvish
Powerful scripting language & versatile interactive shell
neuralmagic/deepsparse
Sparsity-aware deep learning inference runtime for CPUs
IgorMundstein/WinMemoryCleaner
This free RAM cleaner uses native Windows features to optimize memory areas. It's a compact, portable, and smart application.
microsoft/BitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
eglinuxer/study_cmake
Eglinux's CMake course notes
microsoft/ArchProbe
A profiler to disclose and quantify hardware features on GPUs.
hrcheng1066/awesome-pruning
ghimiredhikura/Awasome-Pruning
Awasome Papers and Resources in Deep Neural Network Pruning with Source Code.
Shigangli/Magicube
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
gunrock/loops
🎃 GPU load-balancing library for regular and irregular computations.
OpenCAEPlus/OpenCAEPoro_ASC2024
OpenCAEPoro for ASC 2024
UDC-GAC/venom
A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
microsoft/ConvStencil
OpenCAEPlus/OpenCAEPoro
Open source simulator for porous media flow
marsupialtail/gpu-sparsert
lbarrios/algoritmos3-final
Repositorio para estudiar para el final de Algoritmos 3
pkusc/zaychik-power-controller
The Zaychik Power Controller server
LucasWilkinson/ASpT-mirror
Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding
Liu-Xiangzhi/CAMI
C Abstract Machine Interpreter
UDC-GAC/CLASP
CoLumn-vector pruning-Aware SPmm kernel
LeiWang1999/BitBLAS
LeiWang1999/MSBitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
KouweiLee/learning-docs
Some study notes