Improve model card: Add pipeline tag, base model, code, and project page links
Browse filesThis PR improves the model card by:
- Adding the `pipeline_tag: text-generation` to the metadata for better discoverability on the Hugging Face Hub.
- Updating the `base_model` in the metadata to `Qwen-Qwen2.5-1.5B-Instruct` as indicated in the merge details.
- Adding a link to the associated GitHub repository: https://github.com/rohitgandikota/erasing-llm
- Adding a link to the project page: https://elm.baulab.info
- Restructuring the paper link to be more prominent at the top of the model card.
README.md
CHANGED
@@ -1,14 +1,20 @@
|
|
1 |
---
|
2 |
-
base_model:
|
|
|
3 |
library_name: transformers
|
4 |
tags:
|
5 |
- mergekit
|
6 |
- merge
|
7 |
-
|
8 |
---
|
|
|
9 |
# qwen25-instruct-with-gsm8k-reasoning
|
10 |
|
11 |
-
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
|
|
|
|
|
|
|
|
12 |
|
13 |
## Merge Details
|
14 |
### Merge Method
|
@@ -36,6 +42,4 @@ models:
|
|
36 |
weight: -1.0
|
37 |
merge_method: task_arithmetic
|
38 |
dtype: bfloat16
|
39 |
-
```
|
40 |
-
|
41 |
-
paper : https://arxiv.org/abs/2509.01363
|
|
|
1 |
---
|
2 |
+
base_model:
|
3 |
+
- Qwen-Qwen2.5-1.5B-Instruct
|
4 |
library_name: transformers
|
5 |
tags:
|
6 |
- mergekit
|
7 |
- merge
|
8 |
+
pipeline_tag: text-generation
|
9 |
---
|
10 |
+
|
11 |
# qwen25-instruct-with-gsm8k-reasoning
|
12 |
|
13 |
+
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit) as described in the paper "Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic".
|
14 |
+
|
15 |
+
- Paper: [Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic](https://arxiv.org/abs/2509.01363)
|
16 |
+
- Code: [https://github.com/rohitgandikota/erasing-llm](https://github.com/rohitgandikota/erasing-llm)
|
17 |
+
- Project page: [https://elm.baulab.info](https://elm.baulab.info)
|
18 |
|
19 |
## Merge Details
|
20 |
### Merge Method
|
|
|
42 |
weight: -1.0
|
43 |
merge_method: task_arithmetic
|
44 |
dtype: bfloat16
|
45 |
+
```
|
|
|
|