S4D: Speaker Diarization Toolkit in Python

Abstract : In this paper, we present S4D, a new open-source Python toolkit dedicated to speaker diarization. S4D provides various state-of-the-art components and the possibility to easily develop end-to-end diarization prototype systems. S4D offers a large panel of clustering, segmentation, scoring and visualization algorithms. S4D has been thought to be easily understood, installed, modified and used in order to allow fast transfers of diarization technologies to industry and facilitate development of new approaches. Examples, benchmarks on standard tasks and tutori-als are provided in this paper. S4D is an extension of the open-source toolkit for speaker recognition: SIDEKIT.
Type de document :
Communication dans un congrès
Interspeech 2018, Sep 2018, Hyderabad, India. 〈http://interspeech2018.org/〉
Liste complète des métadonnées

Littérature citée [3 références]  Voir  Masquer  Télécharger

https://hal-univ-lemans.archives-ouvertes.fr/hal-01818313
Contributeur : Pierre-Alexandre Broux <>
Soumis le : mardi 19 juin 2018 - 00:04:35
Dernière modification le : mercredi 19 septembre 2018 - 17:23:21
Document(s) archivé(s) le : mardi 25 septembre 2018 - 10:50:13

Fichier

[Article] Computer-assisted Sp...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01818313, version 1

Collections

Citation

Pierre-Alexandre Broux, Florent Desnous, Anthony Larcher, Simon Petitrenaud, Jean Carrive, et al.. S4D: Speaker Diarization Toolkit in Python. Interspeech 2018, Sep 2018, Hyderabad, India. 〈http://interspeech2018.org/〉. 〈hal-01818313〉

Partager

Métriques

Consultations de la notice

167

Téléchargements de fichiers

166