Title :
Robust speaking rate estimation using broad phonetic class recognition
Author :
Yuan, Jiahong ; Liberman, Mark
Author_Institution :
Univ. of Pennsylvania, Philadelphia, PA, USA
Abstract :
Robust speaking rate estimation can be useful in automatic speech recognition and speaker identification, and accurate, automatic measures of speaking rate are also relevant for research in linguistics, psychology, and social sciences. In this study we built a broad phonetic class recognizer for speaking rate estimation. We tested the recognizer on a variety of data sets, including laboratory speech, telephone conversations, foreign accented speech, and speech in different languages, and we found that the recognizer´s estimates are robust under these sources of variation. We also found that the acoustic models of the broad phonetic classes are more robust than those of the monophones for syllable detection.
Keywords :
estimation theory; natural languages; speaker recognition; speech processing; speech recognition; automatic speech recognition; broad phonetic class recognition; foreign accented speech; laboratory speech; languages; linguistics; monophones; psychology; robust speaking rate estimation; social sciences; speaker identification; telephone conversations; Acoustic signal detection; Automatic speech recognition; Detection algorithms; Frequency; Natural languages; Psychology; Robustness; Speech enhancement; Speech recognition; Testing; Speaking rate estimation; broad phonetic class; robustness; syllable detection;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495686