vishal2304
/

whisper-small-ta

@@ -11,7 +11,7 @@ datasets:
 metrics:
 - wer
 model-index:
-- name: Whisper Small Ta - Vishal Sankar Ram
   results:
   - task:
       name: Automatic Speech Recognition
@@ -20,23 +20,23 @@ model-index:
       name: Common Voice 17.0
       type: mozilla-foundation/common_voice_11_0
       config: ta
-      split: None
       args: 'config: en, split: test'
     metrics:
     - name: Wer
       type: wer
-      value: 75.03805175038052
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# Whisper Small Ta - Vishal Sankar Ram
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 17.0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5508
-- Wer: 75.0381
 ## Model description
@@ -56,25 +56,29 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 1
-- eval_batch_size: 1
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 5
-- training_steps: 20
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer     |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|
-| No log        | 0.02  | 10   | 0.5789          | 78.8432 |
-| No log        | 0.04  | 20   | 0.5508          | 75.0381 |
 ### Framework versions
 - Transformers 4.49.0
-- Pytorch 2.6.0
 - Datasets 3.3.2
 - Tokenizers 0.21.0

 metrics:
 - wer
 model-index:
+- name: Whisper Small En - Vishal Sankar Ram
   results:
   - task:
       name: Automatic Speech Recognition
       name: Common Voice 17.0
       type: mozilla-foundation/common_voice_11_0
       config: ta
+      split: test
       args: 'config: en, split: test'
     metrics:
     - name: Wer
       type: wer
+      value: 67.42770167427702
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Whisper Small En - Vishal Sankar Ram
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 17.0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4155
+- Wer: 67.4277
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 5
+- training_steps: 50
+- mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer     |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|
+| No log        | 0.08  | 10   | 0.5279          | 73.3638 |
+| No log        | 0.16  | 20   | 0.4622          | 70.7763 |
+| 0.45          | 0.24  | 30   | 0.4298          | 69.2542 |
+| 0.45          | 0.32  | 40   | 0.4193          | 67.1233 |
+| 0.334         | 0.4   | 50   | 0.4155          | 67.4277 |
 ### Framework versions
 - Transformers 4.49.0
+- Pytorch 2.5.1+cu124
 - Datasets 3.3.2
 - Tokenizers 0.21.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0ea761fdb2e877f6bdb3c08bb8c77de60087c9b241318d3e3d04fc8213d875bf
 size 966995080

 version https://git-lfs.github.com/spec/v1
+oid sha256:a8bf41943e5b18ccf3a6ac6f73416dfca2052797db1ac872aaab7ed373d177d5
 size 966995080