tingting gao's picture

2

tingting gao

TinaGao

·

AI & ML interests

MLLMs|Diffusion Models|Computer Vision

Recent Activity

authored a paper 3 days ago

DragAnything: Motion Control for Anything using Entity Representation

authored a paper 3 days ago

Learning Multi-dimensional Human Preference for Text-to-Image Generation

authored a paper 3 days ago

CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation

View all activity

Organizations

None yet

authored 14 papers 3 days ago

DragAnything: Motion Control for Anything using Entity Representation

Paper • 2403.07420 • Published Mar 12, 2024 • 15

Learning Multi-dimensional Human Preference for Text-to-Image Generation

Paper • 2405.14705 • Published May 23, 2024

CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation

Paper • 2406.10462 • Published Jun 15, 2024

Decouple Content and Motion for Conditional Image-to-Video Generation

Paper • 2311.14294 • Published Nov 24, 2023

Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization

Paper • 2502.01051 • Published Feb 3

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published Feb 14 • 35

Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning

Paper • 2505.21067 • Published May 27 • 3

InstructEngine: Instruction-driven Text-to-Image Alignment

Paper • 2504.10329 • Published Apr 14

OneRec Technical Report

Paper • 2506.13695 • Published Jun 16 • 16

TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types

Paper • 2502.09925 • Published Feb 14

Thyme: Think Beyond Images

Paper • 2508.11630 • Published 22 days ago • 79

Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models

Paper • 2504.08809 • Published Apr 9

OneRec-V2 Technical Report

Paper • 2508.20900 • Published 9 days ago • 19

Kwai Keye-VL 1.5 Technical Report

Paper • 2509.01563 • Published 5 days ago • 29

authored a paper 2 months ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2 • 130