vibevoice-gguf / README.md
calcuis's picture
Update README.md
197bcd9 verified
metadata
license: mit
base_model:
  - microsoft/VibeVoice-1.5B
pipeline_tag: text-to-speech
tags:
  - gguf-connector

vibevoice-gguf

  • run it with gguf-connector; simply execute the command below in console/terminal
ggc v6

GGUF file(s) available. Select which one to use:

  1. vibevoice-1.5b-iq4_nl.gguf
  2. vibevoice-1.5b-q4_0.gguf
  3. vibevoice-1.5b-q8_0.gguf

Enter your choice (1 to 3): _

  • opt a gguf file in your current directory to interact with; nothing else

screenshot

  • note: for the latest update, you should be able to adjust the speech pacing (see picture above)
Prompt* Audio Sample
Speaker 1: Hey, why you folks always act together like a wolf pack?
Speaker 2: Oh, really? We just hang out for good food and share the bills.
Speaker 1: Wow. Amazing. A pig pack then!
Speaker 2: You must be the smartest joke maker in this universe.
🎧 audio-output-demo

*for this demo prompt, drag audio1.wav and audio2.wav inside the upload voice samples, then it will be taken as voice reference for speakers 1 and 2 (voice cloning)

reference