Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • 免费去水印

  • Log In
  • Sign Up
OpenTSLab 's Collections
Neural Signals
Audio
Scientific Time Series

Audio

updated Nov 12
Upvote
-

  • rookie9/PicoAudio2

    Updated Sep 29 • 27

  • Sleeping
    4

    PicoAudio2

    🐨
    4

    Online inference for PicoAudio2


  • PicoAudio2: Temporal Controllable Text-to-Audio Generation with Natural Language Description

    Paper • 2509.00683 • Published Aug 31

  • wsntxxn/UniFlow-Audio-large

    0.8B • Updated 25 days ago • 30

  • wsntxxn/UniFlow-Audio-medium

    0.4B • Updated 25 days ago • 10

  • wsntxxn/UniFlow-Audio-small

    0.2B • Updated 25 days ago • 11

  • Running on Zero
    4

    UniFlow-Audio

    👁
    4

    Generate audio from omni-modalities in a single model.


  • UniFlow-Audio: Unified Flow Matching for Audio Generation from Omni-Modalities

    Paper • 2509.24391 • Published Sep 29

  • Bayesian Speech synthesizers Can Learn from Multiple Teachers

    Paper • 2510.24372 • Published Oct 28

  • marcoyang/spear-base-speech

    93.3M • Updated Nov 3 • 9

  • marcoyang/spear-base-speech-audio

    93.3M • Updated Nov 3 • 43

  • marcoyang/spear-large-speech

    0.3B • Updated Nov 3 • 6

  • marcoyang/spear-large-speech-audio

    0.3B • Updated Nov 3 • 50

  • marcoyang/spear-xlarge-speech-audio

    0.6B • Updated Nov 3 • 531 • 1

  • SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations

    Paper • 2510.25955 • Published Oct 29
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets 免费Z-image图片生成 免费去水印 Vibevoice

🎉 Free Image Generator Now Available!

Totally Free + Zero Barriers + No Login Required