Delong Chen's picture

1 10 5

Delong Chen

chendelong

·

https://chendelong.world/

AI & ML interests

Vision-language, Music AI

Recent Activity

authored a paper 1 day ago

Planning with Reasoning using Vision Language World Model

upvoted a paper 1 day ago

Planning with Reasoning using Vision Language World Model

commented on a paper 1 day ago

Planning with Reasoning using Vision Language World Model

View all activity

Organizations

None yet

authored a paper 1 day ago

Planning with Reasoning using Vision Language World Model

Paper • 2509.02722 • Published 3 days ago • 13

authored 7 papers over 1 year ago

Few-shot Adaptation of Multi-modal Foundation Models: A Survey

Paper • 2401.01736 • Published Jan 3, 2024

The Pyramid of Captions

Paper • 2405.00485 • Published May 1, 2024

High-Dimension Human Value Representation in Large Language Models

Paper • 2404.07900 • Published Apr 11, 2024 • 1

Subobject-level Image Tokenization

Paper • 2402.14327 • Published Feb 22, 2024 • 18

VirtualConductor: Music-driven Conducting Video Generation System

Paper • 2108.04350 • Published Jul 28, 2021

Taming Diffusion Models for Music-driven Conducting Motion Generation

Paper • 2306.10065 • Published Jun 15, 2023

ProtoCLIP: Prototypical Contrastive Language Image Pretraining

Paper • 2206.10996 • Published Jun 22, 2022

authored a paper almost 2 years ago

Towards Joint Modeling of Dialogue Response and Speech Synthesis based on Large Language Model

Paper • 2309.11000 • Published Sep 20, 2023 • 2

authored 2 papers about 2 years ago

Visual Instruction Tuning with Polite Flamingo

Paper • 2307.01003 • Published Jul 3, 2023 • 1

RemoteCLIP: A Vision Language Foundation Model for Remote Sensing

Paper • 2306.11029 • Published Jun 19, 2023 • 1