deepseek-ai
/

DeepSeek-R1-Distill-Llama-70B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (2)

Update README.md

#16 opened 3 days ago by

chnsmth

#15 opened 4 days ago by

Does DeepSeek-Llama-70B support tensor parallelism for multi-GPU inference?

#14 opened 10 days ago by

weight files naming is not regular rule

#13 opened 18 days ago by

How much vram do you need?

#12 opened 21 days ago by

Upload IMG_4815.jpeg

#11 opened 24 days ago by

Amazon Sagemaker deployment failing with CUDA OutOfMemory error

#10 opened 27 days ago by

<thinking> is the proper tag?

#8 opened 27 days ago by

Add pipeline tag

#7 opened about 1 month ago by

Template

#6 opened about 1 month ago by

Lora

#4 opened about 1 month ago by

SFT (Non-RL) distillation is this good on a sub-100B model?

#2 opened about 1 month ago by

Lfg

#1 opened about 1 month ago by