Pinned Repositories
3FS
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
awesome-deepseek-integration
Integrate the DeepSeek API into popular softwares
DeepEP
DeepEP: an efficient expert-parallel communication library
DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
DeepSeek-R1
DeepSeek-V3
DeepSeek-VL2
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
FlashMLA
FlashMLA: Efficient MLA kernels
Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
open-infra-index
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
DeepSeek's Repositories
deepseek-ai/DeepSeek-V3
deepseek-ai/DeepSeek-R1
deepseek-ai/awesome-deepseek-integration
Integrate the DeepSeek API into popular softwares
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
deepseek-ai/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
deepseek-ai/FlashMLA
FlashMLA: Efficient MLA kernels
deepseek-ai/3FS
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
deepseek-ai/DeepEP
DeepEP: an efficient expert-parallel communication library
deepseek-ai/open-infra-index
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
deepseek-ai/DeepSeek-LLM
DeepSeek LLM: Let there be answers
deepseek-ai/DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
deepseek-ai/DeepGEMM
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
deepseek-ai/DeepSeek-VL2
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
deepseek-ai/smallpond
A lightweight data processing framework built on DuckDB and 3FS.
deepseek-ai/DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
deepseek-ai/DreamCraft3D
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
deepseek-ai/DeepSeek-Math
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
deepseek-ai/DualPipe
A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
deepseek-ai/DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
deepseek-ai/EPLB
Expert Parallelism Load Balancer
deepseek-ai/DeepSeek-Prover-V2
deepseek-ai/profile-data
Analyze computation-communication overlap in V3/R1.
deepseek-ai/awesome-deepseek-coder
A curated list of open-source projects related to DeepSeek Coder
deepseek-ai/ESFT
Expert Specialized Fine-Tuning
deepseek-ai/DeepSeek-Prover-V1.5