Ming-Omni: A Unified Multimodal Model for Perception and Generation Paper • 2506.09344 • Published Jun 11 • 27 • 4
Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction Paper • 2505.02471 • Published May 5 • 12 • 1
Animate-X: Universal Character Image Animation with Enhanced Motion Representation Paper • 2410.10306 • Published Oct 14, 2024 • 57 • 5
Animate-X: Universal Character Image Animation with Enhanced Motion Representation Paper • 2410.10306 • Published Oct 14, 2024 • 57 • 5
Mimir: Improving Video Diffusion Models for Precise Text Understanding Paper • 2412.03085 • Published Dec 4, 2024 • 12 • 2
Animate-X: Universal Character Image Animation with Enhanced Motion Representation Paper • 2410.10306 • Published Oct 14, 2024 • 57 • 5
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation Paper • 2311.15841 • Published Nov 27, 2023 • 2 • 2
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation Paper • 2311.15773 • Published Nov 27, 2023 • 4 • 2