Solaris: Building a Multiplayer Video World Model in Minecraft Paper • 2602.22208 • Published 4 days ago • 24
Understanding vs. Generation: Navigating Optimization Dilemma in Multimodal Models Paper • 2602.15772 • Published 12 days ago • 6
jina-embeddings-v5-text: Task-Targeted Embedding Distillation Paper • 2602.15547 • Published 12 days ago • 24
Revisiting the Platonic Representation Hypothesis: An Aristotelian View Paper • 2602.14486 • Published 14 days ago • 11
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook Paper • 2602.14299 • Published 14 days ago • 26
Quantifying the Gap between Understanding and Generation within Unified Multimodal Models Paper • 2602.02140 • Published 27 days ago • 12
Quantifying the Gap between Understanding and Generation within Unified Multimodal Models Paper • 2602.02140 • Published 27 days ago • 12
TSRBench: A Comprehensive Multi-task Multi-modal Time Series Reasoning Benchmark for Generalist Models Paper • 2601.18744 • Published Jan 26 • 10
Pretraining Frame Preservation in Autoregressive Video Memory Compression Paper • 2512.23851 • Published Dec 29, 2025 • 25