Japanese QuartzNet Large

ReazonSpeech Large(5000h)で学習したQuartzNet Large モデルです。

非常に軽量なモデルです。

学習状況

WandBのレポートに学習曲線などをまとめています。

こちらのASRモデルは　全て文字誤り率(CER)で評価しています。

Training set : 19.619 %

Validation set : 17.909 %

Test set :

Comming soon...

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The HF Inference API does not support text-to-speech models for nemo library.