Pinned Repositories
APNN-TC
APNN-TC-kernel
awesome-distributed-ml
A curated list of awesome projects and papers for distributed training or inference
awesome-model-quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
Awesome-Pruning
A curated list of neural network pruning resources.
bert
TensorFlow code and pre-trained models for BERT
bismo
BISMO: A Scalable Bit-Serial Matrix Multiplication Overlay for Reconfigurable Computing
bitfusion
Simulator for BitFusion
bitsandbytes
8-bit CUDA functions for PyTorch
chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
frankinwi's Repositories
frankinwi/awesome-distributed-ml
A curated list of awesome projects and papers for distributed training or inference
frankinwi/Awesome-Pruning
A curated list of neural network pruning resources.
frankinwi/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
frankinwi/chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
frankinwi/mamba
frankinwi/bitsandbytes
8-bit CUDA functions for PyTorch
frankinwi/venom
A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
frankinwi/awesome-model-quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
frankinwi/wanda
A simple and effective LLM pruning approach.
frankinwi/WeiZheng.github.io
frankinwi/MNSIM-2.0
A Behavior-Level Modeling Tool for Memristor-based Neuromorphic Computing Systems
frankinwi/maestro
An analytical cost model evaluating DNN mappings (dataflows and tiling).
frankinwi/vision-transformers-cifar10
Let's train vision transformers (ViT) for cifar 10!
frankinwi/OBC
Code for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".
frankinwi/Systematic-Investigation-of-Sparse-Perturbed-Sharpness-Aware-Minimization-Optimizer
Systematic Investigation of Sparse Perturbed Sharpness-Aware Minimization Optimizer
frankinwi/DynamicViT
[NeurIPS 2021] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
frankinwi/PyHessian
PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks
frankinwi/ViTCoD
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
frankinwi/pytorch-cifar
95.47% on CIFAR10 with PyTorch
frankinwi/LPF-SGD
frankinwi/Magicube
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
frankinwi/Sparse-Sharpness-Aware-Minimization
[NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation
frankinwi/transformers-benchmarks
real Transformer TeraFLOPS on various GPUs
frankinwi/sam
SAM: Sharpness-Aware Minimization (PyTorch)
frankinwi/loss-landscape
Code for visualizing the loss landscape of neural nets
frankinwi/WinogradAwareNets
frankinwi/darts
Differentiable architecture search for convolutional and recurrent networks
frankinwi/uSystolic-Sim
A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.
frankinwi/ZAQ-code
CVPR 2021 : Zero-shot Adversarial Quantization (ZAQ)
frankinwi/LaMCTS
The release codes of LA-MCTS with its application to Neural Architecture Search.