S. Arora, E. Nyberg, R. , and C. P. , Estimating annotation cost for active learning in a multi-annotator environment, Proceedings of the NAACL HLT 2009 Workshop on Active Learning for Natural Language Processing, HLT '09, pp.18-26, 2009.
DOI : 10.3115/1564131.1564136

URL : http://aclweb.org/anthology-new/W/W09/W09-1903.pdf

C. Barras, E. Geoffrois, Z. Wu, and M. Liberman, , 2001.

, Transcriber: development and use of a tool for assisting speech corpora production, Speech Communication, vol.33, issue.1, pp.5-22

T. Bazillon, Y. Estève, and D. Luzzati, Transcription manuelle vs assistée de la parole préparé et spontanée, 2008.

J. Bonastre, P. Delacourt, C. Fredouille, T. Merlin, and C. Wellekens, A speaker tracking system based on speaker turn detection for NIST evaluation, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), pp.1177-1180, 2000.
DOI : 10.1109/ICASSP.2000.859175

P. Broux, D. Doukhan, S. Petitrenaud, S. Meignier, C. et al., An active learning method for speaker identity annotation in audio recordings, 1st International Workshop on Multimodal Media Data Analytics (MMDA), In conjuction with the 22nd European Conference on Artificial Intelligence (ECAI), 2016.
URL : https://hal.archives-ouvertes.fr/hal-01451532

M. Budnik, J. Poignant, L. Besacier, and G. Quénot, Automatic propagation of manual annotations for multimodal person identification in TV shows, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI), pp.1-4, 2014.
DOI : 10.1109/CBMI.2014.6849849

URL : https://hal.archives-ouvertes.fr/hal-01002927

M. Charhad, D. Moraru, S. Ayache, and G. Quénot, Speaker identity indexing in audio-visual documents, Content-Based Multimedia Indexing (CBMI2005), 2005.
URL : https://hal.archives-ouvertes.fr/hal-00953917

R. Dufour, V. Jousse, Y. Estève, F. Béchet, and G. Linarès, Spontaneous speech characterization and detection in large audio database, 2009.
URL : https://hal.archives-ouvertes.fr/hal-01433943

O. Galibert, Methodologies for the evaluation of speaker diarization and automatic speech recognition in the presence of overlapping speech, INTERSPEECH, pp.1131-1134, 2013.

J. Kahn, O. Galibert, L. Quintard, M. Carré, A. Giraudel et al., A presentation of the REPERE challenge, 2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI), pp.1-6, 2012.
DOI : 10.1109/CBMI.2012.6269851

I. A. Mccowan, D. Moore, J. Dines, D. Gatica-perez, M. Flynn et al., On the use of information retrieval measures for speech recognition evaluation, 2004.

S. Meignier and T. Merlin, Lium spkdiarization: an open source toolkit for diarization, CMU SPUD Workshop, 2010.
URL : https://hal.archives-ouvertes.fr/hal-01433518

N. , The rich transcription spring 2003 (RT-03S) evaluation plan, 2003.

R. Ordelman, D. Jong, F. Larson, and M. , Enhanced Multimedia Content Access and Exploitation Using Semantic Speech Retrieval, 2009 IEEE International Conference on Semantic Computing, pp.521-528, 2009.
DOI : 10.1109/ICSC.2009.80

M. Snover, B. Dorr, R. Schwartz, L. Micciulla, and J. Makhoul, A study of translation edit rate with targeted human annotation, Proceedings of association for machine translation in the Americas, 2006.

F. Vallet, J. Uro, J. Andriamakaoly, H. Nabi, M. Derval et al., Speech trax: A bottom to the top approach for speaker tracking and indexing in an archiving context, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC). European Language Resources Association (ELRA), 2016.

P. Wittenburg, H. Brugman, A. Russel, A. Klassmann, and H. Sloetjes, Elan: a professional framework for multimodality research, Proceedings of LREC, p.5, 2006.

M. E. Wood and E. Lewis, Windmill-the use of a parsing algorithm to produce predictions for disabled persons, PROCEEDINGS-INSTITUTE OF ACOUSTICS, vol.18, pp.315-322, 1996.