Speaker diarization
Relies on pyannote.audio 2.0 currently in development: see installation instructions.
from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained("AMITKESARI2000/pyannote_SD1")
output = pipeline("audio.wav")
for turn, _, speaker in output.itertracks(yield_label=True):
# speaker speaks between turn.start and turn.end
...
Benchmark
Dataset | Diarization error rate |
---|---|
AMI only_words evaluation set |
21.4% |
- Downloads last month
- 2
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.