xinyi chen

quasdo

AI & ML interests

None yet

Recent Activity

authored a paper 15 days ago

RoboGround: Robotic Manipulation with Grounded Vision-Language Priors

authored a paper 15 days ago

CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation

authored a paper 15 days ago

GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

View all activity

Organizations

None yet

authored 4 papers 15 days ago

RoboGround: Robotic Manipulation with Grounded Vision-Language Priors

Paper • 2504.21530 • Published Apr 30

CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation

Paper • 2506.19816 • Published Jun 24

GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

Paper • 2506.10966 • Published Jun 12

MM-ACT: Learn from Multimodal Parallel Generation to Act

Paper • 2512.00975 • Published 27 days ago • 6

upvoted a paper 22 days ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published 23 days ago • 45

upvoted a paper 25 days ago

MM-ACT: Learn from Multimodal Parallel Generation to Act

Paper • 2512.00975 • Published 27 days ago • 6

authored a paper 2 months ago

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

Paper • 2510.13778 • Published Oct 15 • 16

upvoted 2 papers 2 months ago

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

Paper • 2510.13778 • Published Oct 15 • 16

InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

Paper • 2510.11341 • Published Oct 13 • 34

upvoted a paper 3 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28 • 173

liked a model 3 months ago

InternRobotics/InternVLA-M1

Robotics • 4B • Updated Oct 15 • 521 • 26

liked a model 4 months ago

InternRobotics/F1-VLA

Robotics • 4B • Updated Sep 9 • 36 • 32

upvoted a paper 4 months ago

F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions

Paper • 2509.06951 • Published Sep 8 • 32

liked a dataset 4 months ago

Linslab/VLA-OS-Dataset

Updated Jun 24 • 2.6k • 2

liked a model 5 months ago

stabilityai/stable-video-diffusion-img2vid-xt

Image-to-Video • Updated Jul 10, 2024 • 230k • 3.21k

upvoted a paper 5 months ago

The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner

Paper • 2507.13332 • Published Jul 17 • 48

upvoted a paper 7 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 147

liked 2 datasets 8 months ago

ZzZZCHS/RoboGround_Data

Updated Apr 30 • 770 • 1

cindyxl/ObjaversePlusPlus

Viewer • Updated 23 days ago • 789k • 450 • 14

upvoted a paper 10 months ago

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10 • 61

xinyi chen

AI & ML interests

Recent Activity

Organizations

quasdo's activity

🎉 Free Image Generator Now Available!