new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Jul 9

Submitted by

yichaodu

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

·
19 authors

Submitted by

FeYuan

LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages

·
5 authors

Submitted by

mbur

Associative Recurrent Memory Transformer

·
4 authors

Submitted by

xhluca

Learning Action and Reasoning-Centric Image Editing from Videos and Simulations

·
7 authors

Submitted by

ethanchern

ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation

·
4 authors

Submitted by

fredsala

Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction

·
4 authors

Submitted by

akhaliq

UltraEdit: Instruction-based Fine-Grained Image Editing at Scale

·
10 authors

Submitted by

akhaliq

Compositional Video Generation as Flow Equalization

·
2 authors

Submitted by

wyt2000

InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct

·
16 authors

Submitted by

myownskyW7

Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images

·
10 authors

Submitted by

jedyang97

Multi-Object Hallucination in Vision-Language Models

·
8 authors

Submitted by

fairyang

PAS: Data-Efficient Plug-and-Play Prompt Augmentation System

·
19 authors

Submitted by

luohy

Training Task Experts through Retrieval Based Distillation

·
5 authors

Submitted by

ThomasFEL

Understanding Visual Feature Reliance through the Lens of Complexity

·
5 authors

Submitted by

kamwoh

PartCraft: Crafting Creative Objects by Parts

·
4 authors

Submitted by

TranSirius

LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking

·
8 authors

Submitted by

vanilla1116

ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models

·
6 authors