dvmazur/mixtral-offloading

Implementation of benchmarks (C4 perplexity, Wikitext perplexity)

Opened this issue · 0 comments

Hey,

Great repo! I'm trying to reproduce some of the benchmarks in your technical report, but having trouble evaluating it. Would you be able to share your code for evaluating the perplexity score of the model on C4, etc.?

Thanks!