Submitted by akhaliq 43 AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation · 7 authors 1
Submitted by akhaliq 31 Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models · 5 authors 2
Submitted by akhaliq 24 PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation · 8 authors 1
Submitted by akhaliq 12 LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency · 5 authors 1