DocumentCode :
2451728
Title :
Speaking rate estimation for multi-speakers
Author :
Wu, Yong ; He, Qian-Hua ; Li, Yan-Xiong
Author_Institution :
Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China
fYear :
2012
fDate :
16-18 July 2012
Firstpage :
976
Lastpage :
979
Abstract :
It is important to estimate speaking rates of multispeakers in multi-participants conversational speech, especially speaking rate of dominant participant. This paper proposes an algorithm for estimating speaking rates of multi-speakers. In the proposed algorithm, speaker segmentation and clustering are first performed. As a result, number of speakers and the corresponding speech of each speaker are obtained. Finally, detecting the local maxima of energy envelope of each speaker´s speech and then speaking rate of each speaker is defined as total number of local maxima divided by length of each speaker´s speech. Experimental results show that the proposed algorithm can estimate speaking rates of multi-speakers with satisfactory results, whereas the previous algorithms can only estimate speaking rate of single speaker.
Keywords :
pattern clustering; speaker recognition; dominant participant speaking rate; energy envelope; local maxima detection; multiparticipant conversational speech; multispeaker speaking rate estimation; speaker clustering; speaker segmentation; Algorithm design and analysis; Clustering algorithms; Educational institutions; Estimation; Speech; Speech processing; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Audio, Language and Image Processing (ICALIP), 2012 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0173-2
Type :
conf
DOI :
10.1109/ICALIP.2012.6376756
Filename :
6376756
Link To Document :
بازگشت