Ashkchamp
/

blip-finetuned-kag100

Image-Text-to-Text

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

Ashkchamp commited on 22 days ago

Commit

d791ad4

·

verified ·

1 Parent(s): cab5988

Ashkchamp/Pokemon-Image-Captioning

Files changed (3) hide show

README.md +12 -8
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Salesforce/blip-image-captioning-base](https://huggingface.co/Salesforce/blip-image-captioning-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7570
 ## Model description
@@ -35,11 +35,11 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 3e-06
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
-- gradient_accumulation_steps: 2
 - total_train_batch_size: 16
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
@@ -51,9 +51,13 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| No log        | 2.1277 | 100  | 0.8710          |
-| No log        | 4.2553 | 200  | 0.7686          |
-| No log        | 6.3830 | 300  | 0.7570          |
 ### Framework versions

 This model is a fine-tuned version of [Salesforce/blip-image-captioning-base](https://huggingface.co/Salesforce/blip-image-captioning-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7098
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-06
+- train_batch_size: 2
+- eval_batch_size: 2
 - seed: 42
+- gradient_accumulation_steps: 8
 - total_train_batch_size: 16
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 1.3371        | 1.1130 | 50   | 1.0989          |
+| 0.9289        | 2.2260 | 100  | 0.8548          |
+| 0.7662        | 3.3390 | 150  | 0.8150          |
+| 0.677         | 4.4520 | 200  | 0.7596          |
+| 0.6306        | 5.5650 | 250  | 0.7650          |
+| 0.61          | 6.6780 | 300  | 0.7456          |
+| 0.5978        | 7.7910 | 350  | 0.7098          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f5f1303175bff20588318a29349ff3c4cddee46d3b10593302ec2dbee566e1dc
 size 989717056

 version https://git-lfs.github.com/spec/v1
+oid sha256:dc68880c85c895a02953574489fd546430f7ef655a60d15a35604a6e8d5f14ba
 size 989717056

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4139f1f28e64b59df96ede484c116447fef86749be869e6caab59f4fa92520af
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:1112357f696fc8205962d0b760cd0f98dabebe4188e1cf3c266ba71c35e27c7a
 size 5304