Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OpenSpeechHub 's Collections
⭐ July(1-7) 2025 - Open Speech Models
Japanese
Orpheus-Japanese
Italian
Thai

⭐ July(1-7) 2025 - Open Speech Models

updated Jul 4
Upvote
-

  • kyutai/tts-1.6b-en_fr

    Text-to-Speech • Updated Jul 8 • 61.6k • 318

  • MuteSwap: Silent Face-based Voice Conversion

    Paper • 2507.00498 • Published Jul 1

  • A Dataset for Automatic Assessment of TTS Quality in Spanish

    Paper • 2507.01805 • Published Jul 2

  • Voice Conversion for Likability Control via Automated Rating of Speech Synthesis Corpora

    Paper • 2507.01356 • Published Jul 2

  • SpeechAccentLLM: A Unified Framework for Foreign Accent Conversion and Text to Speech

    Paper • 2507.01348 • Published Jul 2

  • Multi-interaction TTS toward professional recording reproduction

    Paper • 2507.00808 • Published Jul 1

  • Teaching Audio-Aware Large Language Models What Does Not Hear: Mitigating Hallucinations through Synthesized Negative Samples

    Paper • 2505.14518 • Published May 20

  • Adaptability of ASR Models on Low-Resource Language: A Comparative Study of Whisper and Wav2Vec-BERT on Bangla

    Paper • 2507.01931 • Published Jul 2

  • DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

    Paper • 2507.02768 • Published Jul 3 • 3

  • JoyTTS: LLM-based Spoken Chatbot With Voice Cloning

    Paper • 2507.02380 • Published Jul 3
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略