Skywork
/

UniPic2-Metaquery-9B

image-understanding

vision-language

Model card Files Files and versions

OrlandoHugBot commited on 9 days ago

Commit

777eb9b

·

verified ·

1 Parent(s): af8cb4c

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -30,12 +30,12 @@ tags:
 </p>
 ## 📖 Introduction
 **UniPic2-Metaquery-9B** is an unified multimodal model built on Qwen2.5-VL-Instruct and SD3.5-Medium. It delivers end-to-end image understanding, text-to-image (T2I) generation, and image editing, and runs smoothly on a single 16 GB consumer GPU.
 <div align="center"> <img src="teaser.png" alt="Model Teaser" width="720"> </div>
 ## 📊 Benchmarks
 **UniPic2-Metaquery-9B** w/o GRPO achieves competitive results across a variety of vision-language tasks:
@@ -47,7 +47,7 @@ tags:
 | ✂️ **GEditBench-EN** | 6.90  |
 | 🧪 **ImgEdit-Bench** | 4.10   |
 ## 🧠 Usage
@@ -203,10 +203,11 @@ edited_image = pipeline(
 edited_image.save("image_editing.png")
 ```
-## 📄 License
 This model is released under the MIT License.
 ## Citation
 If you use Skywork-UniPic in your research, please cite:
 ```

 </p>
 ## 📖 Introduction
 **UniPic2-Metaquery-9B** is an unified multimodal model built on Qwen2.5-VL-Instruct and SD3.5-Medium. It delivers end-to-end image understanding, text-to-image (T2I) generation, and image editing, and runs smoothly on a single 16 GB consumer GPU.
 <div align="center"> <img src="teaser.png" alt="Model Teaser" width="720"> </div>
 ## 📊 Benchmarks
 **UniPic2-Metaquery-9B** w/o GRPO achieves competitive results across a variety of vision-language tasks:
 | ✂️ **GEditBench-EN** | 6.90  |
 | 🧪 **ImgEdit-Bench** | 4.10   |
+---
 ## 🧠 Usage
 edited_image.save("image_editing.png")
 ```
+## 📄 License
 This model is released under the MIT License.
 ## Citation
 If you use Skywork-UniPic in your research, please cite:
 ```