horseee/LLM-Pruner

Loading pruned model for causal llm

Opened this issue · 0 comments

Hi! While loading the pruned model (using this llm pruner) how do we load the model which will result in equivalent loading like AutoModelforCausalLlm as in hugging face transformers?