End of training

Browse files

Files changed (5) hide show

README.md +10 -16
model.safetensors +1 -1
runs/May21_21-23-47_46d9a0ca97b3/events.out.tfevents.1747862636.46d9a0ca97b3.7177.0 +3 -0
runs/May21_21-37-31_46d9a0ca97b3/events.out.tfevents.1747863460.46d9a0ca97b3.10767.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [answerdotai/ModernBERT-large](https://huggingface.co/answerdotai/ModernBERT-large) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7424
-- Accuracy: 0.5687
 ## Model description
@@ -42,27 +42,21 @@ The following hyperparameters were used during training:
 - train_batch_size: 6
 - eval_batch_size: 1
 - seed: 42
-- gradient_accumulation_steps: 2
-- total_train_batch_size: 12
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: constant
-- training_steps: 550
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy |
 |:-------------:|:------:|:----:|:---------------:|:--------:|
-| 1.327         | 0.1497 | 50   | 0.6950          | 0.5640   |
-| 1.5088        | 0.2994 | 100  | 0.6833          | 0.5687   |
-| 1.2018        | 0.4491 | 150  | 0.6867          | 0.5687   |
-| 1.3053        | 0.5988 | 200  | 0.6923          | 0.5687   |
-| 1.2024        | 0.7485 | 250  | 0.6879          | 0.5687   |
-| 1.774         | 0.8982 | 300  | 0.7197          | 0.4313   |
-| 1.4274        | 1.0479 | 350  | 0.6862          | 0.5687   |
-| 1.199         | 1.1976 | 400  | 0.6995          | 0.5687   |
-| 1.4548        | 1.3473 | 450  | 0.7085          | 0.4313   |
-| 1.581         | 1.4970 | 500  | 0.6845          | 0.5687   |
-| 1.8775        | 1.6467 | 550  | 0.7424          | 0.5687   |
 ### Framework versions

 This model is a fine-tuned version of [answerdotai/ModernBERT-large](https://huggingface.co/answerdotai/ModernBERT-large) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7135
+- Accuracy: 0.6114
 ## Model description
 - train_batch_size: 6
 - eval_batch_size: 1
 - seed: 42
+- gradient_accumulation_steps: 10
+- total_train_batch_size: 60
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: constant
+- training_steps: 100
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy |
 |:-------------:|:------:|:----:|:---------------:|:--------:|
+| 6.6012        | 0.2994 | 20   | 0.6613          | 0.6114   |
+| 5.841         | 0.5988 | 40   | 0.7060          | 0.6114   |
+| 7.3284        | 0.8982 | 60   | 0.6671          | 0.6114   |
+| 6.9088        | 1.2096 | 80   | 0.6861          | 0.4834   |
+| 7.1679        | 1.5090 | 100  | 0.7135          | 0.6114   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:93532a65a301ad51358f5f9a8a09b4b3d9c77b2f56eb0b012f83328dfe4ad879
 size 1583351632

 version https://git-lfs.github.com/spec/v1
+oid sha256:92732eadea9f4b14c0fb4e9fdb011ed695dc6997e34940e9c4ccfb05fa6a401f
 size 1583351632

runs/May21_21-23-47_46d9a0ca97b3/events.out.tfevents.1747862636.46d9a0ca97b3.7177.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8856e34d67c3764449b2333638c5587503d6fa54e4080f0a79e776fb4aa1eb6f
+size 59924

runs/May21_21-37-31_46d9a0ca97b3/events.out.tfevents.1747863460.46d9a0ca97b3.10767.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d1d0d5f24049561e8f4599042c1a31bf3f40ee241cea50d990eda82dc4c8ed54
+size 28359

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ec4e9356ffe35231cc836704851a4c82a8a26682883a5eb2a2e265e2a961ba07
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:58e58db89359895ac052107c3f315961b71117e0b19a4bc5ddbe44107bdb19e9
 size 5304