Add model card metadata and links to paper, code and project page

This PR adds a model card with metadata for better discoverability, including the pipeline tag, library name, and license. It also includes links to the paper, code, and project page.

Files changed (1) hide show

README.md +10 -0

README.md ADDED Viewed

	@@ -0,0 +1,10 @@

+---
+license: cc-by-nc-4.0
+pipeline_tag: image-text-to-text
+library_name: transformers
+---
+VLAA-Thinker is a vision-language model that takes an image and text as input and outputs text, as described in [SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models](https://huggingface.co/papers/2504.11468).
+Project Page: https://ucsc-vlaa.github.io/VLAA-Thinking/
+Code: https://github.com/UCSC-VLAA/VLAA-Thinking