Will it compare performance with llama-moe?

Question

Will it compare performance with llama-moe?

ccccj opened this issue 8 months ago · 1 comments

llama-moe：
https://github.com/pjlab-sys4nlp/llama-moe/tree/main

Or will a training framework be released with llama as the base model?

Answer 1 · 2024-02-04T03:13:03.000Z

Our DeepSeekMoE model is trained from scratch, but not resumed or initialized from the LLaMA/LLaMA2 checkpoints.
As for absolute performance, you can refer to our and their papers for details.