Zhoues's picture

Zhoues

Zhoues

·

https://zhoues.github.io/

Zhoues

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

upvoted a paper 11 days ago

VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space

upvoted a paper 20 days ago

DINOv3

View all activity

Organizations

upvoted a paper 6 days ago

EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

Paper • 2508.21112 • Published 9 days ago • 72

upvoted a paper 11 days ago

VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space

Paper • 2508.19247 • Published 11 days ago • 39

upvoted 2 papers 20 days ago

DINOv3

Paper • 2508.10104 • Published 24 days ago • 238

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published 23 days ago • 91

upvoted 2 collections about 2 months ago

MineDreamer

Model weights and Dataset • 5 items • Updated 18 days ago • 1

RoboRefer & RefSpatial

RoboRefer weights, RefSpatial Dataset and RefSpatial-Bench • 8 items • Updated 18 days ago • 3

upvoted a paper about 2 months ago

Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data

Paper • 2507.07095 • Published Jul 9 • 54

upvoted 4 papers 2 months ago

RoboBrain 2.0 Technical Report

Paper • 2507.02029 • Published Jul 2 • 30

Spatial Mental Modeling from Limited Views

Paper • 2506.21458 • Published Jun 26 • 13

Use Property-Based Testing to Bridge LLM Code Generation and Validation

Paper • 2506.18315 • Published Jun 23 • 10

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models

Paper • 2506.19851 • Published Jun 24 • 59

upvoted 5 papers 3 months ago

VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning

Paper • 2506.09049 • Published Jun 10 • 36

Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Paper • 2506.15677 • Published Jun 18 • 24

VideoMolmo: Spatio-Temporal Grounding Meets Pointing

Paper • 2506.05336 • Published Jun 5 • 10

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics

Paper • 2506.04308 • Published Jun 4 • 43

What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?

Paper • 2505.22129 • Published May 28 • 15

upvoted a paper 4 months ago

A Survey of Interactive Generative Video

Paper • 2504.21853 • Published Apr 30 • 47

upvoted a paper 5 months ago

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published Mar 31 • 39

upvoted 2 papers 6 months ago

Position: Interactive Generative Video as Next-Generation Game Engine

Paper • 2503.17359 • Published Mar 21 • 62

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Paper • 2503.16408 • Published Mar 20 • 41