Pinned Repositories
2023-Project-117
Проект для курса «Моя первая научная статья», задача 117:: Поиск зависимостей биомеханических системах. Project for M1P, task 117: Search for dependencies in biomechanical systems
ArrayLSTM
GPU/CPU (CUDA) Implementation of "Recurrent Memory Array Structures", Simple RNN, LSTM, Array LSTM..
Basecalling-comparison
A comparison of different Oxford Nanopore basecallers
basecalling_architectures
bitsandbytes
8-bit CUDA functions for PyTorch
bonito
A PyTorch Basecaller for Oxford Nanopore Reads
buddy-benchmark
Benchmark Framework for Buddy Projects
grnn
libtorch_with_cuda_kernel
libtorch with custom cuda kernel
SYsU-lang-doc
提供 24年春季学期中山大学编译原理实验课程文档
zwshan's Repositories
zwshan/SYsU-lang-doc
提供 24年春季学期中山大学编译原理实验课程文档
zwshan/libtorch_with_cuda_kernel
libtorch with custom cuda kernel
zwshan/2023-Project-117
Проект для курса «Моя первая научная статья», задача 117:: Поиск зависимостей биомеханических системах. Project for M1P, task 117: Search for dependencies in biomechanical systems
zwshan/basecalling_architectures
zwshan/bitsandbytes
8-bit CUDA functions for PyTorch
zwshan/bonito
A PyTorch Basecaller for Oxford Nanopore Reads
zwshan/brocolli
Torch Fx Pytorch Model Converter
zwshan/buddy-benchmark
Benchmark Framework for Buddy Projects
zwshan/ChatPaper
Use ChatGPT to summarize the arXiv papers.
zwshan/ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
zwshan/composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
zwshan/cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
zwshan/cutlass-learning
the code of learning code
zwshan/cutlass_quant
Playing with quantization
zwshan/dm-ticket
大麦网自动购票, 支持docker一键部署。https://t.me/+2EELgNTYiMYxMTFl
zwshan/flash-linear-attention-pytorch
A Python implementation of flash linear attention operators in TransnormerLLM.
zwshan/gcc
zwshan/golsm
zwshan/HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
zwshan/HIPIFY
HIPIFY: Convert CUDA to Portable C++ Code
zwshan/MSRCall
zwshan/nanopore_benchmark
zwshan/parallel-decoding
Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"
zwshan/ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
zwshan/SYsU-lang2
中山大学编译原理课程实验(完全重构版本)
zwshan/TensorRT
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
zwshan/tickets
一个基于 tauri + rust + vue 的抢票软件,大麦抢票软件。
zwshan/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
zwshan/tvm_gpu_gemm
play gemm with tvm
zwshan/zwshan.github.io
store my resume