End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [kiranpantha/whisper-large-v3-nepali](https://huggingface.co/kiranpantha/whisper-large-v3-nepali) on the OpenSLR54 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1471
 ## Model description
@@ -53,9 +53,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.4836        | 1.0   | 28   | 0.1722          |
-| 0.114         | 2.0   | 56   | 0.1454          |
-| 0.0737        | 3.0   | 84   | 0.1471          |
 ### Framework versions

 This model is a fine-tuned version of [kiranpantha/whisper-large-v3-nepali](https://huggingface.co/kiranpantha/whisper-large-v3-nepali) on the OpenSLR54 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1504
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.4858        | 1.0   | 28   | 0.1794          |
+| 0.1158        | 2.0   | 56   | 0.1456          |
+| 0.0711        | 3.0   | 84   | 0.1504          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -26,8 +26,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "k_proj",
-    "q_proj"
   ],
   "task_type": null,
   "use_dora": true,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q_proj",
+    "k_proj"
   ],
   "task_type": null,
   "use_dora": true,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3eb03281fd69f01b9da21ce080e463e2ea7cf6f1c855ff57e679949db5240509
 size 32523696

 version https://git-lfs.github.com/spec/v1
+oid sha256:c30b53053f903a98447c3c9ad72452733818637894d3575785ddfa3cd62c0372
 size 32523696

runs/Feb07_11-56-49_idc-training-gpu-compute-03/events.out.tfevents.1738929410.idc-training-gpu-compute-03.144735.11 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a192421d2153b7a7afe4ff6f45b8a98dee6e9d519cb8ed7d49f3690b7dbca6cb
+size 7880

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:185646876ba24c61e7c35f1906643f8aa1a31d4df4c6993dcbe7e2a0f5425c7c
 size 5624

 version https://git-lfs.github.com/spec/v1
+oid sha256:9a8c7d5e707ccca7846d52f5b61240cf40c7288c9f04d9d0633fe8e3aee7a8f6
 size 5624