Papers
arxiv:2508.19721

CAMÕES: A Comprehensive Automatic Speech Recognition Benchmark for European Portuguese

Published on Aug 27
Authors:
,
,
,
,
,
,
,
,
,
,

Abstract

CAM\~OES, an open framework for European Portuguese and other Portuguese varieties, provides a benchmark and models, demonstrating improved performance over zero-shot models with relative WER reductions above 35%.

AI-generated summary

Existing resources for Automatic Speech Recognition in Portuguese are mostly focused on Brazilian Portuguese, leaving European Portuguese (EP) and other varieties under-explored. To bridge this gap, we introduce CAM\~OES, the first open framework for EP and other Portuguese varieties. It consists of (1) a comprehensive evaluation benchmark, including 46h of EP test data spanning multiple domains; and (2) a collection of state-of-the-art models. For the latter, we consider multiple foundation models, evaluating their zero-shot and fine-tuned performances, as well as E-Branchformer models trained from scratch. A curated set of 425h of EP was used for both fine-tuning and training. Our results show comparable performance for EP between fine-tuned foundation models and the E-Branchformer. Furthermore, the best-performing models achieve relative improvements above 35% WER, compared to the strongest zero-shot foundation model, establishing a new state-of-the-art for EP and other varieties.

Community

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2508.19721 in a model README.md to link it from this page.

Datasets citing this paper 1

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2508.19721 in a Space README.md to link it from this page.

Collections including this paper 1