| | --- |
| | license: apache-2.0 |
| | language: |
| | - ca |
| | pipeline_tag: text-to-speech |
| | tags: |
| | - TTS |
| | - speech-synthesis |
| | - Catalan |
| | - VITS |
| | --- |
| | # Aholab TTS Synthesis models - Pau [ca] |
| | ## Description |
| | This repository contains Pau TTS model in Catalan. |
| | This model is part of a [collection](https://huggingface.co/collections/HiTZ/tts) of text-to-speech (TTS) models in Basque (eu), Galician (gl), Catalan (ca) and Spanish (es). All voices in this collections are based on the VITS architecture proposed by [Kim et al. (2021)](https://arxiv.org/abs/2106.06103). |
| |
|
| | * Basque [eu]: |
| | - antton |
| | - maider |
| | * Galician [gl]: |
| | - brais |
| | - celtia |
| | - iago |
| | - icia |
| | - paulo |
| | - sabela |
| | * Catalan [ca]: |
| | - pau |
| | - ona |
| | * Spanish [es]: |
| | - laura |
| | - alejandro |
| |
|
| | ## Uses |
| | These models are intented to be used for speech synthesis in Basque, Galician, Catalan and Spanish. |
| |
|
| | ### How to use |
| | If you want to use this model for synthesis please go to the following Github repo: [aHoTTS](https://github.com/hitz-zentroa/aHoTTS) |
| |
|
| | ## Additional information |
| | ### Voice Resource Licenses and references |
| | * Galician |
| | - Celtia |
| | Public Creative Commond Attribution 4.0 International License |
| | [Vázquez Abuín, M., García Díaz, N., Vladu, A. I., Magariños, C., Vidal Miguéns, A., & Fernández Rei, E. (2023). Nos_Celtia-GL: Galician TTS corpus (1.0.0.) [Data set]. Zenodo.](https://doi.org/10.5281/zenodo.7716958) |
| | - Brais |
| | Public Creative Commond Attribution 4.0 International License |
| | [Vladu, A. I., García Díaz, N., Regueira Fernández, X. L., Magariños, C., Moscoso Sánchez, A., Fernández López, D., Fernández Rei, E., & Dubert-García, F. (2025). Nos_Brais-GL: Galician TTS corpus [Data set]. Zenodo](https://doi.org/10.5281/zenodo.14265241) |
| | - Sabela/Icia/Iago/Paulo |
| | Public Creative Commond Attribution 4.0 International License |
| | [Centro Ramón Piñeiro para a Investigación en Humanidades (CRPIH), & Multimedia Technology Group (GTM) – atlanTTic Research Center for Telecommunication Technologies. (2023). CRPIH_UVigo-GL-Voices: Galician TTS dataset (1.0.0.) [Data set]. Zenodo.](https://doi.org/10.5281/zenodo.8027725) |
| | * Catalan |
| | - Creative Commons Attribution-ShareAlike 4.0 International Public License [festcat_trimmed_denoised](https://huggingface.co/datasets/projecte-aina/festcat_trimmed_denoised) |
| | * Basque |
| | - Maider, Antton: developed by HiTZ with funding from Project ILENIA. Public Creative Commond Attribution 4.0 |
| | * Spanish |
| | - Alejandro: Developed in HiTZ from [openSLR dataset.](https://openslr.org/94/) |
| | - Laura: Acquired in [ELRA ID: ELRA-S0309](https://catalog.elra.info/en-us/repository/browse/ELRA-S0309/) |
| | ### Authors |
| | HiTZ Basque Center for Language Technology - Aholab Signal Processing Laboratory, University of the Basque Country EHU. |
| | ### Contact information |
| | Ibon Saratxaga: ibon.saratxaga@ehu.eus |
| | ### Licensing Information |
| | [Apache License, Version 2.0](https://www.apache.org/licenses/LICENSE-2.0) |
| | ### Funding |
| | Catalan and Galician have been funded by the project with reference numbers 2022/TL22/00215337, 2022/TL22/00215336, 2022/TL22/00215335, and 2022/TL22/00215334 is funded by the Ministry of Digital Transformation and by the Recovery, Transformation and Resilience Plan – Funded by the European Union – NextGenerationEU. |
| | ### Citation information |
| | Hernaez, I., Navas, E., Murugarren, J.L., Etxebarria, B. (2001) Description of the AhoTTS system for the Basque language. Proc. 4th ISCA ITRW on Speech Synthesis (SSW 4), paper 202 |