Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zeyue's picture
1 3 2

Zeyue PRO

Zeyue7
Sanctuary9900's profile picture commit3r's profile picture CCP6's profile picture
·
  • ZeyueT

AI & ML interests

None yet

Organizations

Multimodal Art Projection's profile picture HKUST Audio's profile picture

authored 10 papers 6 months ago

ChatMusician: Understanding and Generating Music Intrinsically with LLM

Paper • 2402.16153 • Published Feb 25, 2024 • 61

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

Paper • 2402.17723 • Published Feb 27, 2024 • 16

ComposerX: Multi-Agent Symbolic Music Composition with LLMs

Paper • 2404.18081 • Published Apr 28, 2024 • 2

Mixed Neural Voxels for Fast Multi-view Video Synthesis

Paper • 2212.00190 • Published Dec 1, 2022

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

Paper • 2407.20962 • Published Jul 30, 2024

Foundation Models for Music: A Survey

Paper • 2408.14340 • Published Aug 26, 2024 • 45

Audio-FLAN: A Preliminary Release

Paper • 2502.16584 • Published Feb 23 • 37

VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling

Paper • 2406.04321 • Published Jun 6, 2024 • 1

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 70

AudioX: Diffusion Transformer for Anything-to-Audio Generation

Paper • 2503.10522 • Published Mar 13 • 27
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略