spongezz

Pinned Repositories

ABSA
Language:Python00
CMakeTutorial
CMake中文实战教程
Language:C++00
FuxiCTR
A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io/tutorials
Language:Python0 0 00
Graph-Centric-Anomaly-Detection
Language:Python0 1 00
how-to-optimize-gemm
Language:C00
How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Language:Cuda00
Informer2020
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
Language:Python00
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:C++0 0 00
Megatron-LM
Ongoing research training transformer models at scale
Language:Python00
Netease
Language:C0 1 00

spongezz's Repositories

spongezz/ABSA
Language:Python00
spongezz/CMakeTutorial
CMake中文实战教程
Language:C++00
spongezz/FuxiCTR
A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io/tutorials
Language:Python0 0 00
spongezz/Graph-Centric-Anomaly-Detection
Language:Python0 1 00
spongezz/how-to-optimize-gemm
Language:C00
spongezz/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Language:Cuda00
spongezz/Informer2020
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
Language:Python00
spongezz/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:C++0 0 00
spongezz/Megatron-LM
Ongoing research training transformer models at scale
Language:Python00
spongezz/Netease
Language:C0 1 00
spongezz/NN-CUDA-Example
Several simple examples for popular neural network toolkits calling custom CUDA operators.
Language:Python0 0
spongezz/text-generation-inference
Large Language Model Text Generation Inference
spongezz/transformers-code
手把手带你实战Transformers 课程视频同步更新在B站与YouTube