Update README.md
Browse files
README.md
CHANGED
@@ -10,10 +10,14 @@ tags:
|
|
10 |
licence: license
|
11 |
---
|
12 |
|
|
|
|
|
|
|
|
|
13 |
# Model Card for qwen2-3b-instruct-code-grpo-v4
|
14 |
|
15 |
This model is a fine-tuned version of [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct).
|
16 |
-
It has been trained using [TRL](https://github.com/huggingface/trl).
|
17 |
|
18 |
## Quick start
|
19 |
|
|
|
10 |
licence: license
|
11 |
---
|
12 |
|
13 |
+
|
14 |
+
[<img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/axolotl-ai-cloud/axolotl)
|
15 |
+
|
16 |
+
|
17 |
# Model Card for qwen2-3b-instruct-code-grpo-v4
|
18 |
|
19 |
This model is a fine-tuned version of [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct).
|
20 |
+
It has been trained using [TRL](https://github.com/huggingface/trl) and the [grpo_code](https://github.com/axolotl-ai-cloud/grpo_code) repository.
|
21 |
|
22 |
## Quick start
|
23 |
|