S4D: Speaker Diarization Toolkit in Python - Le Mans Université Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

S4D: Speaker Diarization Toolkit in Python

Résumé

In this paper, we present S4D, a new open-source Python toolkit dedicated to speaker diarization. S4D provides various state-of-the-art components and the possibility to easily develop end-to-end diarization prototype systems. S4D offers a large panel of clustering, segmentation, scoring and visualization algorithms. S4D has been thought to be easily understood, installed, modified and used in order to allow fast transfers of diarization technologies to industry and facilitate development of new approaches. Examples, benchmarks on standard tasks and tutori-als are provided in this paper. S4D is an extension of the open-source toolkit for speaker recognition: SIDEKIT.
Fichier principal
Vignette du fichier
s4d-speaker-diarization.pdf (277.92 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02280162 , version 1 (06-09-2019)

Identifiants

  • HAL Id : hal-02280162 , version 1

Citer

Pierre-Alexandre Broux, Florent Desnous, Anthony Larcher, Simon Petitrenaud, Jean Carrive, et al.. S4D: Speaker Diarization Toolkit in Python. Interspeech, Sep 2018, Hyderabad, India. ⟨hal-02280162⟩
1197 Consultations
2800 Téléchargements

Partager

Gmail Facebook X LinkedIn More