Pinned Repositories
ThinK
ThinK: Thinner Key Cache by Query-Driven Pruning
3D-Machine-Learning
A learning resource repository for 3D machine learning
attention-transfer
Improving Convolutional Networks via Attention Transfer (ICLR 2017)
BNET
Batch Normalization with Enhanced Linear Transformation
channel-pruning
Channel Pruning for Accelerating Very Deep Neural Networks
gptqlora
GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ
PC-DARTS
PC-DARTS:Partial Channel Connections for Memory-Efficient Differentiable Architecture Search
qa-lora
Official PyTorch implementation of QA-LoRA
Trained-Rank-Pruning
Pytorch implementation of TRP
WLQ
caffe implementation of single level quantization
yuhuixu1993's Repositories
yuhuixu1993/PC-DARTS
PC-DARTS:Partial Channel Connections for Memory-Efficient Differentiable Architecture Search
yuhuixu1993/qa-lora
Official PyTorch implementation of QA-LoRA
yuhuixu1993/BNET
Batch Normalization with Enhanced Linear Transformation
yuhuixu1993/Trained-Rank-Pruning
Pytorch implementation of TRP
yuhuixu1993/WLQ
caffe implementation of single level quantization
yuhuixu1993/gptqlora
GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ
yuhuixu1993/3D-Machine-Learning
A learning resource repository for 3D machine learning
yuhuixu1993/attention-transfer
Improving Convolutional Networks via Attention Transfer (ICLR 2017)
yuhuixu1993/channel-pruning
Channel Pruning for Accelerating Very Deep Neural Networks
yuhuixu1993/diracnets
Training Very Deep Neural Networks Without Skip-Connections
yuhuixu1993/GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ, zeros fp16
yuhuixu1993/img_classification_pk_pytorch
Quickly comparing your image classification models with the state-of-the-art models (such as DenseNet, ResNet, ...)
yuhuixu1993/KVCache-Factory
Unified KV Cache Compression Methods for Auto-Regressive Models
yuhuixu1993/kvpress
LLM KV cache compression made easy
yuhuixu1993/mmdetection
OpenMMLab Detection Toolbox and Benchmark
yuhuixu1993/o3de
Open 3D Engine (O3DE) is an Apache 2.0-licensed multi-platform 3D engine that enables developers and content creators to build AAA games, cinema-quality 3D worlds, and high-fidelity simulations without any fees or commercial obligations.
yuhuixu1993/pdarts
Codes for our paper "Progressive Differentiable Architecture Search:Bridging the Depth Gap between Search and Evaluation"
yuhuixu1993/Search-R1
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
yuhuixu1993/Switchable-Normalization
Code for Switchable Normalization from "Differentiable Learning-to-Normalize via Switchable Normalization", https://arxiv.org/abs/1806.10779
yuhuixu1993/Tiny-DSOD
Tiny-DSOD: Lightweight Object Detection for Resource-Restricted Usage
yuhuixu1993/variational-dropout-sparsifies-dnn
Sparse Variational Dropout, ICML 2017
yuhuixu1993/xlnet
XLNet: Generalized Autoregressive Pretraining for Language Understanding
yuhuixu1993/yuhuixu1993.github.io