DocumentCode
454704
Title
Nuts and Flakes: a Study of Data Characteristics in Speaker Diarization
Author
Mirghafori, Nikki ; Wooters, Chuck
Author_Institution
Int. Comput. Sci. Inst., Berkeley, CA
Volume
1
fYear
2006
fDate
14-19 May 2006
Abstract
Researchers in the speaker diarization community have observed that some audio files show unusually high diarization error rates (DER) (hard to crack "nuts"), and some exhibit hyper-sensitivity to tuning parameters ("flakes"). The goal of this study is to systematically study the features that correlate with such behavior. We calculated over forty features for each of 24 shows from the broadcast news corpus along the dimensions of speaker count, conversation turn, and speaker and show duration. We observed that number of speakers, number of turns, and do-nothing DER (a measure related to the percentage of time the dominant speaker spoke) correlated best with "nuttiness". The do-nothing DER and number of speakers were also the best correlates of "flakiness"
Keywords
speech processing; broadcast news corpus; data characteristics; diarization error rates; flakes; nuts; speaker diarization; Audio recording; Broadcasting; Cellular neural networks; Computer science; Contracts; Density estimation robust algorithm; Error analysis; NIST; Optimal matching; Time measurement;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1660196
Filename
1660196
Link To Document