Juncheng Yan's picture

22 7

Juncheng Yan

JonsonYan

·

AI & ML interests

3D computer vision

Recent Activity

upvoted a paper about 2 months ago

Scaling RL to Long Videos

upvoted a paper about 2 months ago

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

upvoted a paper about 2 months ago

SpatialTrackerV2: 3D Point Tracking Made Easy

View all activity

Organizations

None yet

upvoted 4 papers about 2 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 157

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Paper • 2507.06261 • Published Jul 7 • 60

SpatialTrackerV2: 3D Point Tracking Made Easy

Paper • 2507.12462 • Published Jul 16 • 16

π^3: Scalable Permutation-Equivariant Visual Geometry Learning

Paper • 2507.13347 • Published Jul 17 • 64

upvoted a paper 2 months ago

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23 • 75

upvoted 3 papers 4 months ago

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published May 2 • 42

In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer

Paper • 2504.20690 • Published Apr 29 • 19

TesserAct: Learning 4D Embodied World Models

Paper • 2504.20995 • Published Apr 29 • 21

upvoted 5 papers 5 months ago

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published Apr 11 • 130

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

Paper • 2504.07960 • Published Apr 10 • 50

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7 • 110

OmniCaptioner: One Captioner to Rule Them All

Paper • 2504.07089 • Published Apr 9 • 20

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Paper • 2503.24379 • Published Mar 31 • 77

upvoted a paper 6 months ago

Unified Video Action Model

Paper • 2503.00200 • Published Feb 28 • 14

upvoted 2 papers 7 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 418

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Paper • 2501.18427 • Published Jan 30 • 21

upvoted 4 papers 8 months ago

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Paper • 2501.09755 • Published Jan 16 • 37

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16 • 72

ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning

Paper • 2501.06590 • Published Jan 11 • 11

VideoAuteur: Towards Long Narrative Video Generation

Paper • 2501.06173 • Published Jan 10 • 34