a's picture

7

a

Wws0512

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

upvoted a paper about 2 months ago

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

upvoted a paper 2 months ago

Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published Nov 17 • 42

upvoted a paper about 2 months ago

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

Paper • 2510.25760 • Published Oct 29 • 16

upvoted 2 papers 2 months ago

Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1

Paper • 2510.19600 • Published Oct 22 • 68

AI for Service: Proactive Assistance with AI Glasses

Paper • 2510.14359 • Published Oct 16 • 74

upvoted 3 papers 3 months ago

Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods

Paper • 2510.07143 • Published Oct 8 • 12

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1 • 39

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era

Paper • 2509.12989 • Published Sep 16 • 28