Submitted by akhaliq 29 Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think · 6 authors 2
Submitted by akhaliq 26 Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion · 6 authors 2
Submitted by alexmartin1722 23 Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models · 6 authors 2
Submitted by akhaliq 19 EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer · 7 authors 3
Submitted by leejaymin 17 A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B · 5 authors 3
Submitted by akhaliq 14 OSV: One Step is Enough for High-Quality Image to Video Generation · 8 authors 2
Submitted by akhaliq 9 SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction · 7 authors 2
Submitted by soujanyaporia 7 Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse · 6 authors 2
Submitted by moein99 5 Implicit Neural Representations with Fourier Kolmogorov-Arnold Networks · 4 authors 2
Submitted by moein99 5 Single-Layer Learnable Activation for Implicit Neural Representation (SL$^{2}$A-INR) · 6 authors 2
Submitted by ZacharyNovack 5 PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing · 4 authors 2