Add Tencent Hunyuan-Large

Question

Add Tencent Hunyuan-Large

Opened this issue a month ago · 2 comments

They claim a overal MMLU-Pro score of 60.2.

The currently unveiled Hunyuan-Large (Hunyuan-MoE-A52B) model is the largest open-source Transformer-based MoE model in the industry, featuring a total of 389 billion parameters and 52 billion active parameters. This is currently the largest open-source Transformer-based MoE model in the industry, featuring a total of 389 billion parameters and 52 billion active parameters.

https://huggingface.co/tencent/Tencent-Hunyuan-Large

Answer 1 · 2024-11-13T22:30:37.000Z

Agreed, this would be very interesting to see

Answer 2 · 2024-11-14T23:47:18.000Z

They claim a overal MMLU-Pro score of 60.2.

The currently unveiled Hunyuan-Large (Hunyuan-MoE-A52B) model is the largest open-source Transformer-based MoE model in the industry, featuring a total of 389 billion parameters and 52 billion active parameters. This is currently the largest open-source Transformer-based MoE model in the industry, featuring a total of 389 billion parameters and 52 billion active parameters.

https://huggingface.co/tencent/Tencent-Hunyuan-Large

60.2 for the base, instruct probably better