new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Aug 28

Submitted by

netag

Beyond Transcription: Mechanistic Interpretability in ASR

·
9 authors

4

Submitted by

wyu1

Self-Rewarding Vision-Language Model via Reasoning Decomposition

·
11 authors

Submitted by

Zery

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

·
11 authors

Submitted by

XingweiT

Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?

·
4 authors

2

Submitted by

Liang-ZX

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

·
10 authors

3

Submitted by

zParquet

MIDAS: Multimodal Interactive Digital-human Synthesis via Real-time Autoregressive Video Generation

·
8 authors

Submitted by

pengxiang

Diffusion Language Models Know the Answer Before Decoding

·
9 authors

2

Submitted by

wybertwang

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

·
7 authors

Submitted by

zaydzuhri

Predicting the Order of Upcoming Tokens Improves Language Modeling

·
3 authors

2

Submitted by

wentingzhao

StepWiser: Stepwise Generative Judges for Wiser Reasoning

·
7 authors

Submitted by

blinoff

Gaze into the Heart: A Multi-View Video Dataset for rPPG and Health Biomarkers Estimation

·
7 authors

Submitted by

Jungang

Mind the Third Eye! Benchmarking Privacy Awareness in MLLM-powered Smartphone Agents

·
6 authors

Submitted by

taesiri

MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment

·
5 authors

Submitted by

lilvjosephtang

SEAM: Semantically Equivalent Across Modalities Benchmark for Vision-Language Models

·
4 authors

Submitted by

taesiri

DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis

·
7 authors

Submitted by

taesiri

Taming the Chaos: Coordinated Autoscaling for Heterogeneous and Disaggregated LLM Inference

·
11 authors

Submitted by

teddykoker

Training a Foundation Model for Materials on a Budget

·
2 authors