microsoft/TransformerCompression
For releasing code related to compression methods for transformers, accompanying our publications
PythonMIT
Issues
- 1
Convert Layernorm to RMSnorm
#110 opened by lihuang258 - 0
Issues with LLAMA 3-8B-Instruct model
#177 opened by madhusrivatsav - 0
NaN weights after rotating and slicing
#176 opened by sriyachakravarthy - 9
How to export entire model?
#114 opened by YuanzeSun - 0
need help
#175 opened by jiajunsun68 - 2
NotImplementedError: xx is neither a Hugging Face model nor a supported local model.
#138 opened by qxpBlog - 0
Quarot: DeepSeek-V2 Support
#174 opened by RanchiZhao - 0
How to evaluate the sliced model
#173 opened by 1250826219 - 2
What is the number of parameters afrer slicing?
#165 opened by yaya-sy - 1
- 0
- 8
can't reproduce result
#127 opened by MrGGLS - 1
How to load the tuned slicemodel
#140 opened by qxpBlog - 2
- 1
Command R and R+ support
#133 opened by Steel-skull - 2
- 2
PHI包导入问题
#164 opened by 1250826219 - 0
model inference
#160 opened by ChrisXULC - 0
error when Fine-tuning a sliced model llama 3
#155 opened by ChrisXULC - 4
How to run with llama-1?
#128 opened by liuxiaozhu01 - 2
Question of `RMSNorm`'s `forward` function
#131 opened by zhaoyang-star - 2
Layer fusion with Llama
#129 opened by kiucho - 11
why the generation speed of the pruned model by SliceGPT is slower than the original model?
#115 opened by joyce0105-ops - 1
question about the cal_dataset
#113 opened by tu2022 - 1
Proof of Equation 2
#121 opened by kiucho - 3
Recovery the drop in accuracy
#119 opened by zhaoyang-star - 2
run_finetuning.py issue on non-sliced private models
#123 opened by taaviv0 - 2
failures when evaluating the model
#117 opened by YuanzeSun - 0
Is there any inference demo for sliced model?
#116 opened by zhaoyang-star - 1
- 1
Reproducing the speed up results in table 2
#112 opened by Ahmed-Roushdy - 0
- 1
c4 dataset download fails
#56 opened by nailimixaM - 0
- 1
parameter count is innacurate
#72 opened by jameshensman - 0
Replace <pad> with <eos> for Llama2 and Phi2
#60 opened by nailimixaM - 5
Mistral Support
#81 opened by fakerybakery - 0
- 0
Scrub keys from commit history
#76 opened by nailimixaM - 0
--distribute fails for calculating perplexity
#66 opened by nailimixaM - 0
- 1
hf token should be retrivable from env
#58 opened by jameshensman - 1
llamadecoder in transformers 4.36
#49 opened by jameshensman - 0
Common command-line args
#52 opened by nailimixaM - 6
Action required: migrate or opt-out of migration to GitHub inside Microsoft
#30 opened by microsoft-github-policy-service