axolotl-ai-co
/

qwen2-3b-instruct-code-grpo

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

smohammadi commited on Apr 1

Commit

4773419

·

verified ·

1 Parent(s): 54c45d0

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -10,10 +10,14 @@ tags:
 licence: license
 ---
 # Model Card for qwen2-3b-instruct-code-grpo-v4
 This model is a fine-tuned version of [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct).
-It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start

 licence: license
 ---
+[<img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/axolotl-ai-cloud/axolotl)
 # Model Card for qwen2-3b-instruct-code-grpo-v4
 This model is a fine-tuned version of [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct).
+It has been trained using [TRL](https://github.com/huggingface/trl) and the [grpo_code](https://github.com/axolotl-ai-cloud/grpo_code) repository.
 ## Quick start