muhtasham
/

hifigan-ar-v2

Model card Files Files and versions

hifigan-ar-v2 / README.md

muhtasham's picture

WIP

9d199d5 5 months ago

|

history blame contribute delete

776 Bytes

	---
	language: en
	license: mit
	tags:
	- speech
	- audio
	- vocoder
	- hifigan
	- tts
	---

	# HiFiGAN Arabic Vocoder

	A standalone implementation of HiFiGAN vocoder for Arabic text-to-speech, based on the paper "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis" (https://arxiv.org/pdf/2010.05646.pdf).

	## Usage

	```python
	from hifigan_ar_v2 import HiFiGANArabicGenerator
	import torch

	# Load the model
	model = HiFiGANArabicGenerator.from_pretrained("generator.ckpt", "config.json")

	# Generate audio from mel spectrogram
	mel = torch.rand(1, 80, 122) # Example mel spectrogram
	audio = model(mel) # Shape: [1, 1, 8448]
	```

	## Model Details
	- Sample Rate: 22050 Hz
	- Input: Mel spectrogram (80 channels)
	- Output: Audio waveform (1 channel)