chawdoe's picture

8 2

chawdoe

chawdoe

·

chawdoe

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding

liked a dataset 9 days ago

dorni/SpeakerVid-5M-Dataset

upvoted a paper 10 days ago

VibeVoice Technical Report

View all activity

Organizations

upvoted a paper 1 day ago

Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding

Paper • 2508.20478 • Published 9 days ago • 15

liked a dataset 9 days ago

dorni/SpeakerVid-5M-Dataset

Updated Aug 4 • 813 • 8

upvoted a paper 10 days ago

VibeVoice Technical Report

Paper • 2508.19205 • Published 11 days ago • 120

liked a model 11 days ago

microsoft/VibeVoice-1.5B

Text-to-Speech • 3B • Updated 5 days ago • 218k • 1.51k

upvoted a paper 17 days ago

Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer

Paper • 2508.09131 • Published 25 days ago • 16

upvoted a paper 23 days ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published 23 days ago • 141

upvoted 2 papers about 2 months ago

SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation

Paper • 2507.09862 • Published Jul 14 • 49

Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning

Paper • 2507.05255 • Published Jul 7 • 73

upvoted a paper 7 months ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14 • 56

upvoted a paper 8 months ago

Taming Teacher Forcing for Masked Autoregressive Video Generation

Paper • 2501.12389 • Published Jan 21 • 10