PDE-Controller: LLMs for Autoformalization and Reasoning of PDEs Paper • 2502.00963 • Published 21 days ago • 16
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 11 days ago • 181
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • Updated about 8 hours ago • 1.07M • • 1.16k
Revealing the Barriers of Language Agents in Planning Paper • 2410.12409 • Published Oct 16, 2024 • 26
Exploring Model Kinship for Merging Large Language Models Paper • 2410.12613 • Published Oct 16, 2024 • 21
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception Paper • 2410.12628 • Published Oct 16, 2024 • 35
HumanEval-V: Benchmarking High-Level Visual Reasoning with Complex Diagrams in Coding Tasks Paper • 2410.12381 • Published Oct 16, 2024 • 44
Towards Natural Image Matting in the Wild via Real-Scenario Prior Paper • 2410.06593 • Published Oct 9, 2024 • 3
Empirical Study of Mutual Reinforcement Effect and Application in Few-shot Text Classification Tasks via Prompt Paper • 2410.09745 • Published Oct 13, 2024 • 3
GS^3: Efficient Relighting with Triple Gaussian Splatting Paper • 2410.11419 • Published Oct 15, 2024 • 12
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning Paper • 2410.09754 • Published Oct 13, 2024 • 8
Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation Paper • 2410.08001 • Published Oct 10, 2024 • 4
EchoPrime: A Multi-Video View-Informed Vision-Language Model for Comprehensive Echocardiography Interpretation Paper • 2410.09704 • Published Oct 13, 2024 • 13
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models Paper • 2410.11805 • Published Oct 15, 2024 • 13
SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI Paper • 2410.11096 • Published Oct 14, 2024 • 13