Submitted by axxkaya 75 Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization · 9 authors 2
Submitted by Eric3200 46 On the Compositional Generalization of Multimodal LLMs for Medical Imaging · 9 authors 4
Submitted by akhaliq 39 Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs · 14 authors 2
Submitted by akhaliq 24 TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization · 7 authors 4
Submitted by Jiayi-Pan 22 Training Software Engineering Agents and Verifiers with SWE-Gym · 7 authors 2
Submitted by HyunsooCha 18 PERSE: Personalized 3D Generative Avatars from A Single Portrait · 3 authors 3
Submitted by Ningyu 18 OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System · 13 authors 2
Submitted by RefalMachine 17 Facilitating large language model Russian adaptation with Learned Embedding Propagation · 2 authors 2
Submitted by zjy2001 14 HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation · 4 authors 3