jason-huh

KAISTSeoul, Korea

jason-huh's Stars

huawei-noah/Efficient-NLP
Language:Python8813
FLHonker/Awesome-Knowledge-Distillation
Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。
2.5k336
bytedance/effective_transformer
Running BERT without Padding
Language:C++46353
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Language:Python8.5k1.5k
cs217/cs217.github.io
Course Webpage for CS 217 Hardware Accelerators for Machine Learning, Stanford University
Language:CSS9613
apache/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:Python11.9k3.5k
gilshm/sparq
Post-training sparsity-aware quantization
Language:Python335
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++5.9k895
t-vi/pytorch-tvmisc
Totally Versatile Miscellanea for Pytorch
Language:Jupyter Notebook46872
microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++15k3k
intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Language:Python2.3k258
kssteven418/I-BERT
[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
Language:Python23032
kingoflolz/mesh-transformer-jax
Model parallel transformers in JAX and Haiku
Language:Python6.3k894
amirgholami/ZeroQ
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
Language:Python27456
Efficient-ML/Awesome-Model-Quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
1.9k208
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Language:Python20.5k2.6k
parachutel/cs224n-stanford-winter2021
Stanford Winter 2021
Language:Python8627
leehanchung/cs224n
Stanford CS224n: Natural Language Processing with Deep Learning, Winter 2020
Language:JavaScript12542
rishikksh20/LightSpeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Language:Python828
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python2k510
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python1.9k540
chester256/Model-Compression-Papers
Papers for deep neural network compression and acceleration
39680
thunlp/PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
4.1k381
facebookresearch/LAMA
LAnguage Model Analysis
Language:Python1.4k183
hiaoxui/soft-prompts
Language:Python889
Yeachan-Heo/HSC2021-AlphaSolar
Language:Jupyter Notebook135
fastai/course-nlp
A Code-First Introduction to NLP course
Language:Jupyter Notebook3.4k1.5k
deeppavlov/DeepPavlov
An open source library for deep learning end-to-end dialog systems and chatbots.
Language:Python6.8k1.2k
google-research/bert
TensorFlow code and pre-trained models for BERT
Language:Python38.4k9.6k
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python136k27.3k

jason-huh

jason-huh's Stars

huawei-noah/Efficient-NLP

FLHonker/Awesome-Knowledge-Distillation

bytedance/effective_transformer

triton-inference-server/server

cs217/cs217.github.io

apache/tvm

gilshm/sparq

NVIDIA/FasterTransformer

t-vi/pytorch-tvmisc

microsoft/onnxruntime

intel/neural-compressor

kssteven418/I-BERT

kingoflolz/mesh-transformer-jax

amirgholami/ZeroQ

Efficient-ML/Awesome-Model-Quantization

karpathy/minGPT

parachutel/cs224n-stanford-winter2021

leehanchung/cs224n

rishikksh20/LightSpeech

jik876/hifi-gan

ming024/FastSpeech2

chester256/Model-Compression-Papers

thunlp/PromptPapers

facebookresearch/LAMA

hiaoxui/soft-prompts

Yeachan-Heo/HSC2021-AlphaSolar

fastai/course-nlp

deeppavlov/DeepPavlov

google-research/bert

huggingface/transformers