msekoyan commited on
Commit
75a41cc
·
1 Parent(s): 784a4f3

update long-form inference

Browse files

Signed-off-by: monica-sekoyan <[email protected]>

Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -391,12 +391,12 @@ Number of characters per minute on [MUSAN](https://www.openslr.org/17) \[16] 48
391
 
392
  ### Long-form Inference
393
 
394
- `Canary-1b-v2` achieves strong performance on long-form transcription by using dynamic chunking with 1-second overlap between chunks, allowing for efficient parallel processing. This feature is automatically enabled when calling `.transcribe()` with `batch_size=1` on audio exceeding 40 seconds.
395
 
396
  | **Dataset** | **WER ↓** |
397
  | ----------------------- | --------- |
398
- | Earnings-22 | 13.51% |
399
- | This American Life | 8.65% |
400
 
401
  **Note:** Presented WERs do not include Punctuation and Capitalization errors.
402
 
 
391
 
392
  ### Long-form Inference
393
 
394
+ `Canary-1b-v2` achieves strong performance on long-form transcription by using dynamic chunking with 1-second overlap between chunks, allowing for efficient parallel processing. This dynamic chunking feature is automatically enabled when calling `.transcribe()` on a single audio file, or when using `batch_size=1` with multiple audio files that are longer than 40 seconds.
395
 
396
  | **Dataset** | **WER ↓** |
397
  | ----------------------- | --------- |
398
+ | Earnings-22 | 13.78% |
399
+ | This American Life | 9.87% |
400
 
401
  **Note:** Presented WERs do not include Punctuation and Capitalization errors.
402