Improve model card: Add pipeline tag, library, links, and usage example

#1
by nielsr HF Staff - opened

This PR significantly enhances the model card for PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning by:

  • Adding the pipeline_tag: image-text-to-text, which accurately categorizes the model as a multimodal large language model capable of processing images and text to generate text. This improves its discoverability on the Hugging Face Hub (e.g., at https://huggingface.co/models?pipeline_tag=image-text-to-text).
  • Specifying library_name: transformers, enabling the convenient "Use in Transformers" widget directly on the model page and providing standard loading instructions for users.
  • Updating the paper link to the official Hugging Face Papers page (https://huggingface.co/papers/2507.06448) for better integration within the Hub's ecosystem.
  • Including direct links to the project page (https://mikewangwzhl.github.io/PAPO/) and the GitHub repository (https://github.com/mikewangwzhl/PAPO) for users to find more context and the source code.
  • Adding a practical Python code snippet for inference using the transformers library, allowing users to quickly get started with the model.

These updates aim to provide a more comprehensive, user-friendly, and discoverable model card.

PAPOGalaxy changed pull request status to merged

Sign up or log in to comment