Add initial model card for OpenVision 2 with metadata and links
#1
by
nielsr
HF Staff
- opened
This pull request adds a comprehensive initial model card for the OpenVision 2 model to its Hugging Face repository. This update significantly improves the documentation and discoverability of the model by:
- Adding the
pipeline_tag: image-text-to-text
to the metadata, which helps users discover the model at https://huggingface.co/models?pipeline_tag=image-text-to-text. - Including
library_name: open_clip
in the metadata, based on theopen_clip_config.json
file, which indicates compatibility with theopen_clip
library and allows for automated usage snippets. - Linking directly to the official paper: OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning.
- Adding a link to the official project page: https://ucsc-vlaa.github.io/OpenVision2/.
- Including a link to the GitHub repository for the code: https://github.com/UCSC-VLAA/OpenVision/blob/main/src/main_openvision2.py.
- Providing the paper's abstract, model details, and a citation guide in the markdown content.