new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Sep 18

Submitted by

akhaliq

OmniGen: Unified Image Generation

·
9 authors

Submitted by

akhaliq

NVLM: Open Frontier-Class Multimodal LLMs

·
10 authors

Submitted by

akhaliq

Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

·
6 authors

Submitted by

akhaliq

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

·
6 authors

Submitted by

alexmartin1722

Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models

·
6 authors

Submitted by

akhaliq

EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

·
7 authors

Submitted by

leejaymin

A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B

·
5 authors

Submitted by

akhaliq

OSV: One Step is Enough for High-Quality Image to Video Generation

·
8 authors

Submitted by

akhaliq

On the limits of agency in agent-based models

·
5 authors

Submitted by

akhaliq

Agile Continuous Jumping in Discontinuous Terrains

·
11 authors

Submitted by

akhaliq

SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction

·
7 authors

Submitted by

soujanyaporia

Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse

·
6 authors

Submitted by

obiwan96

Human-like Affective Cognition in Foundation Models

·
8 authors

Submitted by

moein99

Implicit Neural Representations with Fourier Kolmogorov-Arnold Networks

·
4 authors

Submitted by

moein99

Single-Layer Learnable Activation for Implicit Neural Representation (SL$^{2}$A-INR)

·
6 authors

Submitted by

ZacharyNovack

PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing

·
4 authors