--- base_model: unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit tags: - text-generation-inference - transformers - unsloth - llama - trl - prashasst license: apache-2.0 language: - en datasets: - FreedomIntelligence/medical-o1-reasoning-SFT pipeline_tag: text-generation library_name: peft --- # Uploaded model - **Developed by:** Prashasst - **License:** apache-2.0 - **Finetuned from model :** unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit