Submitted by akhaliq 94 Design2Code: How Far Are We From Automating Front-End Engineering? · 5 authors 2
Submitted by akhaliq 61 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis · 17 authors 3
Submitted by akhaliq 29 OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on · 4 authors 2
Submitted by akhaliq 27 MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies · 7 authors 6
Submitted by akhaliq 16 DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models · 7 authors 2
Submitted by akhaliq 15 InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding · 10 authors 1
Submitted by akhaliq 14 ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models · 10 authors 1
Submitted by akhaliq 8 ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models · 8 authors 1
Submitted by akhaliq 7 Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation · 7 authors 1
Submitted by akhaliq 5 3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos · 6 authors