new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Nov 14

Submitted by

akhaliq

Music ControlNet: Multiple Time-varying Controls for Music Generation

·
4 authors

Submitted by

akhaliq

ChatAnything: Facetime Chat with LLM-Enhanced Personas

·
7 authors

Submitted by

akhaliq

Story-to-Motion: Synthesizing Infinite and Controllable Character Animation from Long Text

·
4 authors

Submitted by

akhaliq

Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models

·
14 authors

Submitted by

akhaliq

To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning

·
6 authors

Submitted by

akhaliq

GOAT: GO to Any Thing

·
13 authors

Submitted by

akhaliq

SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models

·
16 authors

Submitted by

akhaliq

MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks

·
11 authors

Submitted by

akhaliq

GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

·
12 authors

Submitted by

akhaliq

The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4

·
2 authors

Submitted by

akhaliq

Trusted Source Alignment in Large Language Models

·
7 authors

Submitted by

akhaliq

LayoutPrompter: Awaken the Design Ability of Large Language Models

·
6 authors

Submitted by

akhaliq

Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer

·
6 authors

Submitted by

akhaliq

Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data

·
9 authors

Submitted by

akhaliq

Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

·
31 authors