new model request: DeepSeek-V2
Atry opened this issue · 1 comments
Atry commented
Model description
Tech Report: https://arxiv.org/abs/2405.04434
Code: https://huggingface.co/deepseek-ai/DeepSeek-V2/blob/e0828e3cc0a03408724b80c3cc92c8e072db8d01/modeling_deepseek.py
Open source status
- The model implementation is available
- The model weights are available
Provide useful links for the implementation
No response
swapdewalkar commented
I am taking this up