Submitted by akhaliq 18 NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models · 7 authors
Submitted by akhaliq 16 I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models · 5 authors
Submitted by akhaliq 15 $\textit{Trans-LoRA}$: towards data-free Transferable Parameter Efficient Finetuning · 7 authors
Submitted by akhaliq 14 Looking Backward: Streaming Video-to-Video Translation with Feature Banks · 6 authors 2
Submitted by akhaliq 14 Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer · 5 authors
Submitted by akhaliq 11 Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels · 6 authors 3
Submitted by akhaliq 10 LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters · 4 authors 2
Submitted by akhaliq 10 Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control · 7 authors
Submitted by akhaliq 7 Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models · 24 authors