Loading pruned model for causal llm
Opened this issue · 0 comments
sriyachakravarthy commented
Hi! While loading the pruned model (using this llm pruner) how do we load the model which will result in equivalent loading like AutoModelforCausalLlm as in hugging face transformers?