zwshan

zwshan's Stars

NVIDIA/cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
Language:C++45790
Hardware-Alchemy/cuDNN-sample
cuDNN sample codes provided by Nvidia
Language:C++4415
OpenPPL/ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
Language:Python1.6k236
Pinging-ZJU/Pytorch-Memory-Utils
pytorch memory track code
4
neural-boost/neural-boost
Neural Boost targeting to boost inference performance.
Language:Python61
Tiiiger/QPyTorch
Low Precision Arithmetic Simulation in PyTorch
Language:Python26574
godweiyang/NN-CUDA-Example
Several simple examples for popular neural network toolkits calling custom CUDA operators.
Language:Python1.3k189
stefbraun/rnn_benchmarks
RNN benchmarks of pytorch, tensorflow and theano
Language:Python8818
guanh01/CS692-mlsys
This is the (evolving) reading list for the seminar.
565
zwshan/grnn
1
howardlau1999/sysu-thesis-typst
中山大学学位论文 Typst 模板
Language:Typst554
exaloop/codon
A high-performance, zero-overhead, extensible Python compiler using LLVM
Language:C++15.2k520
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Language:Python18.5k1.9k
marcpaga/nanopore_benchmark
Language:Python285
ziishaned/learn-regex
Learn regex the easy way
45.7k6.2k
nanoporetech/bonito
A PyTorch Basecaller for Oxford Nanopore Reads
Language:Python396122
fmfi-compbio/deepnano-blitz
Very fast ONT basecaller
Language:Rust5212
arcsysu/SYsU-lang
A mini, simple and modular compiler lab for SYsU/SysY(tiny C). Based on Clang/LLVM/ANTLR4/Bison/Flex.
Language:C20835
hbrunie/PyFloT
Mixed precision tuning tool
Language:C++52
ElegantLaTeX/ElegantPaper
Elegant LaTeX Template for Working Papers
Language:TeX1.3k256
MoZeWei/moTuner
Language:C++91
SciCompKL/CoDiPack
Fast gradient evaluation in C++ based on Expression Templates.
Language:C++9131
arcsysu/Weekly-Paper-Sharing-OSM
组会论文分享“OSM: Off-Chip Shared Memory for GPUs”的 $\LaTeX$ 展示源码
Language:TeX4
minhhn2910/CUDA-mixed-precision
Mixed precision between FP32 and FP16x2 in CUDA programs
Language:C3
raydongpub/GPU-FPtuner
Language:Python1
LLNL/adapt-fp
Language:C++146
chwan1016/awesome-gnn-systems
A list of awesome GNN systems.
Language:Python28726
ccfddl/ccf-deadlines
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
Language:Vue6.4k443