Update pipeline tag and add link to Github
Browse filesThis PR updates the pipeline tag to `image-to-text` to better reflect the model's capabilities. The model can be used for image-text retrieval and as a vision encoder for VLMs, in addition to zero-shot image classification.
I also link to the Big Vision Github repository, where the model was developed.
README.md
CHANGED
@@ -1,22 +1,23 @@
|
|
1 |
---
|
|
|
2 |
license: apache-2.0
|
|
|
3 |
tags:
|
4 |
- vision
|
5 |
widget:
|
6 |
-
|
7 |
-
|
8 |
-
|
9 |
-
example_title: Bee
|
10 |
-
library_name: transformers
|
11 |
-
pipeline_tag: zero-shot-image-classification
|
12 |
---
|
13 |
|
14 |
# SigLIP 2 So400m
|
15 |
|
16 |
-
[SigLIP 2](https://
|
17 |
-
[SigLIP](https://
|
18 |
into a unified recipe, for improved semantic understanding, localization, and dense features.
|
19 |
|
|
|
|
|
20 |
## Intended uses
|
21 |
|
22 |
You can use the raw model for tasks like zero-shot image classification and
|
@@ -99,4 +100,4 @@ Evaluation of SigLIP 2 is shown below (taken from the paper).
|
|
99 |
primaryClass={cs.CV},
|
100 |
url={https://arxiv.org/abs/2502.14786},
|
101 |
}
|
102 |
-
```
|
|
|
1 |
---
|
2 |
+
library_name: transformers
|
3 |
license: apache-2.0
|
4 |
+
pipeline_tag: image-to-text
|
5 |
tags:
|
6 |
- vision
|
7 |
widget:
|
8 |
+
- src: https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/bee.jpg
|
9 |
+
candidate_labels: bee in the sky, bee on the flower
|
10 |
+
example_title: Bee
|
|
|
|
|
|
|
11 |
---
|
12 |
|
13 |
# SigLIP 2 So400m
|
14 |
|
15 |
+
[SigLIP 2](https://arxiv.org/abs/2502.14786) extends the pretraining objective of
|
16 |
+
[SigLIP](https://arxiv.org/abs/2303.15343) with prior, independently developed techniques
|
17 |
into a unified recipe, for improved semantic understanding, localization, and dense features.
|
18 |
|
19 |
+
The codebase for SigLIP 2 is available at [Big Vision](https://github.com/google-research/big_vision).
|
20 |
+
|
21 |
## Intended uses
|
22 |
|
23 |
You can use the raw model for tasks like zero-shot image classification and
|
|
|
100 |
primaryClass={cs.CV},
|
101 |
url={https://arxiv.org/abs/2502.14786},
|
102 |
}
|
103 |
+
```
|