UCSC-VLAA/openvision2-vit-large-patch14-336-vision-only · Add initial model card for OpenVision 2 with metadata and links

This pull request adds a comprehensive initial model card for the OpenVision 2 model to its Hugging Face repository. This update significantly improves the documentation and discoverability of the model by:

Adding the pipeline_tag: image-text-to-text to the metadata, which helps users discover the model at https://huggingface.co/models?pipeline_tag=image-text-to-text.
Including library_name: open_clip in the metadata, based on the open_clip_config.json file, which indicates compatibility with the open_clip library and allows for automated usage snippets.
Linking directly to the official paper: OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning.
Adding a link to the official project page: https://ucsc-vlaa.github.io/OpenVision2/.
Including a link to the GitHub repository for the code: https://github.com/UCSC-VLAA/OpenVision/blob/main/src/main_openvision2.py.
Providing the paper's abstract, model details, and a citation guide in the markdown content.