huggingface/transformers

new model request: DeepSeek-V2

Atry opened this issue · 1 comments

Atry commented

Model description

Tech Report: https://arxiv.org/abs/2405.04434
Code: https://huggingface.co/deepseek-ai/DeepSeek-V2/blob/e0828e3cc0a03408724b80c3cc92c8e072db8d01/modeling_deepseek.py

Open source status

  • The model implementation is available
  • The model weights are available

Provide useful links for the implementation

No response

I am taking this up