Submitted by zsytony 42 HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models · 14 authors 5
Submitted by akhaliq 33 MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling · 4 authors 2
Submitted by yizhilll 28 OmniBench: Towards The Future of Universal Omni-Language Models · 20 authors 2
Submitted by WenhaoWang 18 MonoFormer: One Transformer for Both Diffusion and Autoregression · 8 authors 4
Submitted by mhamilton723 17 Seeing Faces in Things: A Model and Dataset for Pareidolia · 7 authors 2
Submitted by akhaliq 14 Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts · 7 authors 2
Submitted by akhaliq 8 Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation · 10 authors 2