DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks Paper • 2509.01396 • Published 5 days ago • 44
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks Paper • 2509.01396 • Published 5 days ago • 44
Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery Paper • 2508.17380 • Published 13 days ago • 4
PhysUniBench: An Undergraduate-Level Physics Reasoning Benchmark for Multimodal Models Paper • 2506.17667 • Published Jun 21 • 3
Examining the Source of Defects from a Mechanical Perspective for 3D Anomaly Detection Paper • 2505.05901 • Published May 9 • 1
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers Paper • 2508.21148 • Published 9 days ago • 129
Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery Paper • 2508.17380 • Published 13 days ago • 4
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers Paper • 2508.21148 • Published 9 days ago • 129
PhysUniBench: An Undergraduate-Level Physics Reasoning Benchmark for Multimodal Models Paper • 2506.17667 • Published Jun 21 • 3
Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning Paper • 2506.10521 • Published Jun 12 • 74