--- license: mit tags: - unsloth - trl - sft language: - zh base_model: - deepseek-ai/DeepSeek-R1-Distill-Llama-8B ---