Update README.md
Browse files
README.md
CHANGED
@@ -14,6 +14,12 @@ tags:
|
|
14 |
|
15 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
16 |
|
|
|
|
|
|
|
|
|
|
|
|
|
17 |
## Merge Details
|
18 |
### Merge Method
|
19 |
|
|
|
14 |
|
15 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
16 |
|
17 |
+
Testing training data validation:
|
18 |
+
|
19 |
+
* Model Stock Loss: 0.451
|
20 |
+
|
21 |
+
My hypothesis that the pretraining was dragging down the stock merge's performance on training data in any way seems inaccurate.
|
22 |
+
|
23 |
## Merge Details
|
24 |
### Merge Method
|
25 |
|