DocumentCode
3528154
Title
Speaker diarization in meeting audio
Author
Nwe, Tin Lay ; Sun, Hanwu ; Li, Haizhou ; Rahardja, Susanto
Author_Institution
Inst. for Infocomm Res. (I2R), A*STAR, Singapore
fYear
2009
fDate
19-24 April 2009
Firstpage
4073
Lastpage
4076
Abstract
This paper describes speaker diarization system on a NIST Rich Transcription 2007 (RT-07) meeting recognition evaluation data set for the task of multiple distant microphone (MDM). Our implementation includes three components: initial clustering, non-speech removal and cluster purification. Initial clusters are generated using directional of arrival (DOA) information and bootstrap clustering. Multiple GMM modeling for speech/non-speech classification is employed for non-speech removal component. In addition, a novel system fusion strategy using information from receiver operating curve (ROC) is proposed for non-speech removal component. Finally, consensus clustering approach together with iterative GMM clustering method is employed for speaker cluster purification. The system achieves the overall DER of 10.81%.
Keywords
direction-of-arrival estimation; pattern classification; pattern clustering; speaker recognition; GMM modeling; NIST Rich Transcription 2007 meeting recognition evaluation data set; bootstrap clustering; consensus clustering approach; directional of arrival; meeting audio; multiple distant microphone; nonspeech classification; nonspeech removal component; receiver operating curve; speaker cluster purification; speaker diarization system; system fusion strategy; Adaptive filters; Conferences; Direction of arrival estimation; Erbium; Machine learning; Natural languages; Purification; Speech processing; Sun; Tin; Meetings; clustering methods; modeling; pattern classification; speech processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960523
Filename
4960523
Link To Document