pdh930105

Korea University/ PhD Student

Korea Universitykorea, Seoul

pdh930105's Stars

ImagineAILab/ai-by-hand-excel
2.4k 65 5353
hemingkx/SpeculativeDecodingPapers
📰 Must-read papers and blogs on Speculative Decoding ⚡️
564 28 426
Haiyang-W/GiT
[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"
Language:Python323 7 1415
gstoica27/ZipIt
A framework for merging models solving different tasks with different initializations into one multi-task model without any additional training
Language:Python291 3 2725
trevorpogue/algebraic-nnhw
Algebraic enhancements for deep learning accelerator architectures
Language:Python264 5 015
feizc/DiT-MoE
Scaling Diffusion Transformers with Mixture of Experts
Language:Python243 6 811
jxiw/MambaInLlama
[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models
Language:Python189 6 2414
CASE-Lab-UMD/LLM-Drop
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
Language:Python158 4 1317
ShoufaChen/Awesome-Diffusion-Transformers
https://www.shoufachen.com/Awesome-Diffusion-Transformers/
Language:HTML126 6 18
ChenMnZ/PrefixQuant
An algorithm for static activation quantization of LLMs
Language:Python109 8 235
thunlp/Ouroboros
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)
Language:Python85 7 69
ChangyuanWang17/QVLM
[NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.
Language:Python58 2 94
NVlabs/COAT
Language:Python52 7 01
smart-lty/ParallelSpeculativeDecoding
The official code for paper "parallel speculative decoding with adaptive draft length."
Language:Python31 1 11
Zhaoshixin-sky/CIM-MLC
[ASPLOS 2024] CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators
Language:Python25 2 24
Intelligent-Computing-Lab-Yale/TesseraQ
Language:Python18 1 22
PiggyJerry/DC-Net
The code for paper: "DC-Net: Divide-and-Conquer for Salient Object Detection"
Language:Python17 2 51
thunlp/Seq1F1B
Sequence-level 1F1B schedule for LLMs.
Language:Python16 0 11
ebby-s/MX-for-FPGA
Implementation of Microscaling data formats in SystemVerilog.
Language:SystemVerilog13 2 02
snu-comparch/Tender
Tender: Accelerating Large Language Models via Tensor Decompostion and Runtime Requantization (ISCA'24)
Language:Python12 3 01
BidyutSaha/TinyTNAS
TinyTNAS is a hardware-aware, multi-objective, time-bound Neural Architecture Search (NAS) tool designed for TinyML time series classification. Unlike GPU-based NAS methods, it runs efficiently on CPUs.
Language:Python11 2 00
Cheliosoops/BitQ
Language:Python11 2 00
abdelfattah-lab/shadow_llm
Language:Python7 3 11
lynn2089/SmartLite
Language:Python7 1 22
b-faye/OneEncoder
Language:Python60
ershang2/SlowTrack
Language:Python5 1 13
pingxue-hfut/DWR
Fast and Accurate Binary Neural Networks based on Depth-Width Reshaping
Language:Python5 2 0
yamilvindas/gdec
Language:Python2 1 1
dongwonjo/BinaryMoS
Language:Python1 1 0
naufalso/adversarial-manhole
Official code for "Adversarial Manholes: Challenging Monocular Depth Estimation and Semantic Segmentation with Physical Attacks"
Language:Python1 2 01

pdh930105

pdh930105's Stars

ImagineAILab/ai-by-hand-excel

hemingkx/SpeculativeDecodingPapers

Haiyang-W/GiT

gstoica27/ZipIt

trevorpogue/algebraic-nnhw

feizc/DiT-MoE

jxiw/MambaInLlama

CASE-Lab-UMD/LLM-Drop

ShoufaChen/Awesome-Diffusion-Transformers

ChenMnZ/PrefixQuant

thunlp/Ouroboros

ChangyuanWang17/QVLM

NVlabs/COAT

smart-lty/ParallelSpeculativeDecoding

Zhaoshixin-sky/CIM-MLC

Intelligent-Computing-Lab-Yale/TesseraQ

PiggyJerry/DC-Net

thunlp/Seq1F1B

ebby-s/MX-for-FPGA

snu-comparch/Tender

BidyutSaha/TinyTNAS

Cheliosoops/BitQ

abdelfattah-lab/shadow_llm

lynn2089/SmartLite

b-faye/OneEncoder

ershang2/SlowTrack

pingxue-hfut/DWR

yamilvindas/gdec

dongwonjo/BinaryMoS

naufalso/adversarial-manhole