SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards Paper • 2511.07403 • Published Nov 10 • 14
Olympus: A Universal Task Router for Computer Vision Tasks Paper • 2412.09612 • Published Dec 12, 2024 • 4
IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation Paper • 2506.03150 • Published Jun 3 • 21