Pinned Repositories
ABSA
CMakeTutorial
CMake中文实战教程
FuxiCTR
A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io/tutorials
Graph-Centric-Anomaly-Detection
how-to-optimize-gemm
How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Informer2020
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Megatron-LM
Ongoing research training transformer models at scale
Netease
spongezz's Repositories
spongezz/ABSA
spongezz/CMakeTutorial
CMake中文实战教程
spongezz/FuxiCTR
A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io/tutorials
spongezz/Graph-Centric-Anomaly-Detection
spongezz/how-to-optimize-gemm
spongezz/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
spongezz/Informer2020
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
spongezz/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
spongezz/Megatron-LM
Ongoing research training transformer models at scale
spongezz/Netease
spongezz/NN-CUDA-Example
Several simple examples for popular neural network toolkits calling custom CUDA operators.
spongezz/text-generation-inference
Large Language Model Text Generation Inference
spongezz/transformers-code
手把手带你实战Transformers 课程视频同步更新在B站与YouTube