Pinned Repositories
AMG
Algebraic multigrid benchmark
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
awesome-model-quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
Batched-SpMM
New batched algorithm for sparse matrix-matrix multiplication (SpMM)
BLASTed
Fine-grain parallel iterative methods
cfs-spmv
Conflict-free symmetric SpMV library
CPP
Lecture notes, projects and other materials for Course 'CS205 C/C++ Program Design' at Southern University of Science and Technology.
cuFoam
cuFoam is a cuda based linear equations solver for OpenFoam.
HPC-Lab-Docs
Documentation for HPC course
professional-cuda-c-programming
MicroZHY's Repositories
MicroZHY/sputnik
A library of GPU kernels for sparse matrix operations.
MicroZHY/MpSpMV
Mixed Precision SpMV (MpSpMV)
MicroZHY/merge-spmm
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
MicroZHY/tSparse
A GPU algorithm for sparse matrix-matrix multiplication
MicroZHY/remifa
Reduced and mixed precision factorization algorithms
MicroZHY/cfs-spmv
Conflict-free symmetric SpMV library
MicroZHY/cuFoam
cuFoam is a cuda based linear equations solver for OpenFoam.
MicroZHY/I-SpMV
An iterative sparse matrix dense vector multiplication solver with MPI
MicroZHY/parilu-matlab
Incomplete factorizations constructed with fixed-point iterations -- matlab and mex implementations
MicroZHY/matrix_format_performance
MicroZHY/tcu_scope
MicroZHY/DarkIntegerMultiply_SunwayTaihulight
神威太湖之光上的大整数乘法。国产CPU并行应用挑战赛(CPC17)的神秘应用。
MicroZHY/Batched-SpMM
New batched algorithm for sparse matrix-matrix multiplication (SpMM)
MicroZHY/tcqr
Tensorコアを用いたBatched QR分解
MicroZHY/fp16Scaling
MicroZHY/AMG
Algebraic multigrid benchmark
MicroZHY/gpumembench
A GPU benchmark suite for assessing on-chip GPU memory bandwidth
MicroZHY/professional-cuda-c-programming
MicroZHY/MergePathOMP