update long-form inference
Browse filesSigned-off-by: monica-sekoyan <[email protected]>
README.md
CHANGED
@@ -391,12 +391,12 @@ Number of characters per minute on [MUSAN](https://www.openslr.org/17) \[16] 48
|
|
391 |
|
392 |
### Long-form Inference
|
393 |
|
394 |
-
`Canary-1b-v2` achieves strong performance on long-form transcription by using dynamic chunking with 1-second overlap between chunks, allowing for efficient parallel processing. This feature is automatically enabled when calling `.transcribe()`
|
395 |
|
396 |
| **Dataset** | **WER ↓** |
|
397 |
| ----------------------- | --------- |
|
398 |
-
| Earnings-22 | 13.
|
399 |
-
| This American Life |
|
400 |
|
401 |
**Note:** Presented WERs do not include Punctuation and Capitalization errors.
|
402 |
|
|
|
391 |
|
392 |
### Long-form Inference
|
393 |
|
394 |
+
`Canary-1b-v2` achieves strong performance on long-form transcription by using dynamic chunking with 1-second overlap between chunks, allowing for efficient parallel processing. This dynamic chunking feature is automatically enabled when calling `.transcribe()` on a single audio file, or when using `batch_size=1` with multiple audio files that are longer than 40 seconds.
|
395 |
|
396 |
| **Dataset** | **WER ↓** |
|
397 |
| ----------------------- | --------- |
|
398 |
+
| Earnings-22 | 13.78% |
|
399 |
+
| This American Life | 9.87% |
|
400 |
|
401 |
**Note:** Presented WERs do not include Punctuation and Capitalization errors.
|
402 |
|