20250902 - a ShiqiangWoo Collection

ShiqiangWoo 's Collections

AI-generaed code

EO

20250902

updated 3 days ago

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Paper • 2508.21104 • Published 9 days ago • 27
T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables

Paper • 2508.19813 • Published 10 days ago • 20
No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes

Paper • 2508.19060 • Published 11 days ago • 8
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents

Paper • 2508.17198 • Published 13 days ago • 6
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench

Paper • 2508.20931 • Published 9 days ago • 15
Democracy-in-Silico: Institutional Design as Alignment in AI-Governed Polities

Paper • 2508.19562 • Published 10 days ago • 2