Pinned Repositories
APNN-TC
APNN-TC-kernel
awesome-distributed-ml
A curated list of awesome projects and papers for distributed training or inference
awesome-model-quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
Awesome-Pruning
A curated list of neural network pruning resources.
bert
TensorFlow code and pre-trained models for BERT
bismo
BISMO: A Scalable Bit-Serial Matrix Multiplication Overlay for Reconfigurable Computing
bitfusion
Simulator for BitFusion
bitsandbytes
8-bit CUDA functions for PyTorch
chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
frankinwi's Repositories
frankinwi/awesome-distributed-ml
A curated list of awesome projects and papers for distributed training or inference
frankinwi/awesome-model-quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
frankinwi/Awesome-Pruning
A curated list of neural network pruning resources.
frankinwi/bitsandbytes
8-bit CUDA functions for PyTorch
frankinwi/chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
frankinwi/darts
Differentiable architecture search for convolutional and recurrent networks
frankinwi/DynamicViT
[NeurIPS 2021] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
frankinwi/LaMCTS
The release codes of LA-MCTS with its application to Neural Architecture Search.
frankinwi/loss-landscape
Code for visualizing the loss landscape of neural nets
frankinwi/LPF-SGD
frankinwi/maestro
An analytical cost model evaluating DNN mappings (dataflows and tiling).
frankinwi/Magicube
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
frankinwi/mamba
frankinwi/MNSIM-2.0
A Behavior-Level Modeling Tool for Memristor-based Neuromorphic Computing Systems
frankinwi/OBC
Code for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".
frankinwi/PyHessian
PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks
frankinwi/pytorch-cifar
95.47% on CIFAR10 with PyTorch
frankinwi/sam
SAM: Sharpness-Aware Minimization (PyTorch)
frankinwi/Sparse-Sharpness-Aware-Minimization
[NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation
frankinwi/Systematic-Investigation-of-Sparse-Perturbed-Sharpness-Aware-Minimization-Optimizer
Systematic Investigation of Sparse Perturbed Sharpness-Aware Minimization Optimizer
frankinwi/transformers-benchmarks
real Transformer TeraFLOPS on various GPUs
frankinwi/uSystolic-Sim
A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.
frankinwi/venom
A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
frankinwi/vision-transformers-cifar10
Let's train vision transformers (ViT) for cifar 10!
frankinwi/ViTCoD
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
frankinwi/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
frankinwi/wanda
A simple and effective LLM pruning approach.
frankinwi/WeiZheng.github.io
frankinwi/WinogradAwareNets
frankinwi/ZAQ-code
CVPR 2021 : Zero-shot Adversarial Quantization (ZAQ)