view article Article Vision Language Model Alignment in TRL ⚡️ By sergiopaniego and 4 others • 14 days ago • 69
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated 6 days ago • 198
Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning Paper • 2507.14137 • Published Jul 18 • 33
PS3: Scaling Vision Pre-Training to 4K Resolution Collection Enabling 4k resolution for VLMs, CVPR 2025, https://nvlabs.github.io/PS3/ • 14 items • Updated 7 days ago • 4
view article Article Fine-Tuning SigLIP 2 for Single Label Image Classification By prithivMLmods • Mar 5 • 16