2 31 9

Beichen Zhang

BeichenZhang

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

upvoted a paper about 1 month ago

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

upvoted a paper about 1 month ago

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

View all activity

Organizations

None yet

upvoted a paper 11 days ago

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published 11 days ago • 35

upvoted 2 papers about 1 month ago

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Paper • 2508.04700 • Published Aug 6 • 51

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

Paper • 2508.00819 • Published Aug 1 • 62

upvoted 2 papers about 2 months ago

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction

Paper • 2507.15852 • Published Jul 21 • 38

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Paper • 2507.07984 • Published Jul 10 • 41

upvoted a paper 2 months ago

ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing

Paper • 2506.19848 • Published Jun 24 • 26

liked a model 3 months ago

zer0int/LongCLIP-GmP-ViT-L-14

Zero-Shot Image Classification • 0.4B • Updated Jul 16 • 39.4k • 76

upvoted a paper 3 months ago

Video World Models with Long-term Spatial Memory

Paper • 2506.05284 • Published Jun 5 • 53

upvoted a paper 4 months ago

Visual Agentic Reinforcement Fine-Tuning

Paper • 2505.14246 • Published May 20 • 32

authored 2 papers 4 months ago

Long-CLIP: Unlocking the Long-Text Capability of CLIP

Paper • 2403.15378 • Published Mar 22, 2024 • 4

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 287

upvoted a paper 4 months ago

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6 • 94

upvoted 2 papers 5 months ago

MM-IFEngine: Towards Multimodal Instruction Following

Paper • 2504.07957 • Published Apr 10 • 34

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published Apr 10 • 49

upvoted a paper 6 months ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7 • 124

authored a paper 6 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

upvoted a paper 6 months ago

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25 • 75

upvoted 3 papers 7 months ago

Beichen Zhang

AI & ML interests

Recent Activity

Organizations

BeichenZhang's activity