Collections
Discover the best community collections!
Collections including paper arxiv:2412.09856
-
DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes
Paper β’ 2412.11100 β’ Published β’ 7 -
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Paper β’ 2412.09856 β’ Published β’ 10 -
DisPose: Disentangling Pose Guidance for Controllable Human Image Animation
Paper β’ 2412.09349 β’ Published β’ 8 -
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Paper β’ 2412.04448 β’ Published β’ 10
-
MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model
Paper β’ 2405.20222 β’ Published β’ 11 -
ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation
Paper β’ 2406.00908 β’ Published β’ 11 -
CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation
Paper β’ 2406.02509 β’ Published β’ 9 -
I4VGen: Image as Stepping Stone for Text-to-Video Generation
Paper β’ 2406.02230 β’ Published β’ 17
-
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
Paper β’ 2401.09985 β’ Published β’ 17 -
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects
Paper β’ 2401.09962 β’ Published β’ 9 -
Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution
Paper β’ 2401.10404 β’ Published β’ 10 -
ActAnywhere: Subject-Aware Video Background Generation
Paper β’ 2401.10822 β’ Published β’ 13