Pinned Repositories
Assignment01
AVXelerate
Chat_Room
DSAA
evaluatingMPS
A simple benchmark for comparing NVIDIA's MPS with NATIVE execution
H2O
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
LeetCode
Para-FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Teedy
Lightweight document management system packed with all the features you can expect from big expensive solutions
lerrorgk's Repositories
lerrorgk/Para-FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
lerrorgk/Assignment01
lerrorgk/AVXelerate
lerrorgk/Chat_Room
lerrorgk/DSAA
lerrorgk/evaluatingMPS
A simple benchmark for comparing NVIDIA's MPS with NATIVE execution
lerrorgk/H2O
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
lerrorgk/LeetCode
lerrorgk/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
lerrorgk/Teedy
Lightweight document management system packed with all the features you can expect from big expensive solutions