DeepSeek
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference
DeepSeek-OCR 2: Visual Causal Flow
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 7.59k • 682 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 199 • 81 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 2.31k • 105 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • 16B • Updated • 276k • • 565
DeepSeek-VL model series
DeepSeek LLM series
DeepSeek MoE series
-
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation • Updated • 88.8k • • 972 -
deepseek-ai/DeepSeek-V3.2-Exp-Base
Text Generation • 685B • Updated • 247 • 66 -
deepseek-ai/DeepSeek-V3.2
Text Generation • 685B • Updated • 293k • • 1.33k -
deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation • Updated • 15.1k • 688
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 1.65M • • 13.1k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • Updated • 5.2k • 945 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • Updated • 98.7k • • 756 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 1,000k • • 1.53k
DeepSeek Math series
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
-
deepseek-ai/DeepSeek-V2-Chat-0628
Text Generation • 236B • Updated • 3.45k • 177 -
deepseek-ai/DeepSeek-V2-Chat
Text Generation • 236B • Updated • 13.9k • 461 -
deepseek-ai/DeepSeek-V2
Text Generation • 236B • Updated • 12.8k • 333 -
deepseek-ai/DeepSeek-V2-Lite
Text Generation • 16B • Updated • 246k • 168
models for paper expert-specialized fine-tuning
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 14.1k • 566 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 118k • 485 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 9.98k • 146 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • Updated • 89.8k • 159
-
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation • Updated • 88.8k • • 972 -
deepseek-ai/DeepSeek-V3.2-Exp-Base
Text Generation • 685B • Updated • 247 • 66 -
deepseek-ai/DeepSeek-V3.2
Text Generation • 685B • Updated • 293k • • 1.33k -
deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation • Updated • 15.1k • 688
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 1.65M • • 13.1k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • Updated • 5.2k • 945 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • Updated • 98.7k • • 756 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 1,000k • • 1.53k
DeepSeek Math series
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-V2-Chat-0628
Text Generation • 236B • Updated • 3.45k • 177 -
deepseek-ai/DeepSeek-V2-Chat
Text Generation • 236B • Updated • 13.9k • 461 -
deepseek-ai/DeepSeek-V2
Text Generation • 236B • Updated • 12.8k • 333 -
deepseek-ai/DeepSeek-V2-Lite
Text Generation • 16B • Updated • 246k • 168
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 7.59k • 682 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 199 • 81 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 2.31k • 105 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • 16B • Updated • 276k • • 565
models for paper expert-specialized fine-tuning
DeepSeek-VL model series
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 14.1k • 566 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 118k • 485 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 9.98k • 146 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • Updated • 89.8k • 159
DeepSeek LLM series
DeepSeek MoE series