rene-contango
/

test-model-output

Generated from Trainer

Model card Files Files and versions

rene-contango commited on 20 days ago

Commit

9a696f8

·

verified ·

1 Parent(s): a821d05

End of training

Files changed (2) hide show

README.md +4 -4
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -97,7 +97,7 @@ xformers_attention: null
 This model is a fine-tuned version of [Qwen/Qwen2-0.5B](https://huggingface.co/Qwen/Qwen2-0.5B) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.4922
 ## Model description
@@ -135,9 +135,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 1.5645        | 0.0336 | 1    | 1.5611          |
-| 1.5509        | 0.1008 | 3    | 1.5592          |
-| 1.5323        | 0.2017 | 6    | 1.5367          |
-| 1.2915        | 0.3025 | 9    | 1.4922          |
 ### Framework versions

 This model is a fine-tuned version of [Qwen/Qwen2-0.5B](https://huggingface.co/Qwen/Qwen2-0.5B) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4914
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 1.5645        | 0.0336 | 1    | 1.5611          |
+| 1.551         | 0.1008 | 3    | 1.5586          |
+| 1.5317        | 0.2017 | 6    | 1.5365          |
+| 1.2912        | 0.3025 | 9    | 1.4914          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2dabe9b7ef29317705bef452aa34677283bffa38637d9534a8b1c669ac1cfe3b
 size 17717130

 version https://git-lfs.github.com/spec/v1
+oid sha256:8537e087e8d503fe8518defa67ae4993c01b6471cd69f7d10b4a3abc649b49fb
 size 17717130