Pinned Repositories
aisys2023
algorithm_practice
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
aws-lambda
to test network bandwidth
Co-Boosting
[ICLR 2024] "Enhancing One-Shot Federated Learning Through Data and Ensemble Co-Boosting"
HyperparameterTuning
Useful bash script for model hyper-parameter searching
medipipe-based-ASL-translation
include J,Z (video -> text)
snuspl_intern
developing..
zTT_onDev
zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation
MaverickJune's Repositories
MaverickJune/aisys2023
MaverickJune/algorithm_practice
MaverickJune/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
MaverickJune/aws-lambda
to test network bandwidth
MaverickJune/Co-Boosting
[ICLR 2024] "Enhancing One-Shot Federated Learning Through Data and Ensemble Co-Boosting"
MaverickJune/HyperparameterTuning
Useful bash script for model hyper-parameter searching
MaverickJune/medipipe-based-ASL-translation
include J,Z (video -> text)
MaverickJune/snuspl_intern
developing..
MaverickJune/zTT_onDev
zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation
MaverickJune/concurrent_inference
An example of how to use the multiprocessing package along with PyTorch.
MaverickJune/Discrete-time-Signal-Processing-Solution
Discrete-time Signal Processing 3rd edition (Oppenheim)
MaverickJune/fedavg
PyTorch implementation of federated learning on MNIST
MaverickJune/FedOV
Towards Addressing Label Skews in One-Shot Federated Learning (ICLR 2023)
MaverickJune/flash-attention
Fast and memory-efficient exact attention
MaverickJune/incubator-nemo
Apache Nemo Incubating
MaverickJune/InfiniGen
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
MaverickJune/Knowledge-Distillation-Zoo
Pytorch implementation of various Knowledge Distillation (KD) methods.
MaverickJune/Learning-from-Data-Solutions
Repository of my solutions to the problems of "Learning from Data"
MaverickJune/MaverickJune
Config files for my GitHub profile.
MaverickJune/mdistiller
The official implementation of [CVPR2022] Decoupled Knowledge Distillation https://arxiv.org/abs/2203.08679 and [ICCV2023] DOT: A Distillation-Oriented Trainer https://openaccess.thecvf.com/content/ICCV2023/papers/Zhao_DOT_A_Distillation-Oriented_Trainer_ICCV_2023_paper.pdf
MaverickJune/Paper-review
MaverickJune/py3iperf3
A native Python iPerf3 client
MaverickJune/StaticCodeAnalyzer
for feature extraction of opencl kernels
MaverickJune/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs