DocumentCode
2801237
Title
Robust speaking rate estimation using broad phonetic class recognition
Author
Yuan, Jiahong ; Liberman, Mark
Author_Institution
Univ. of Pennsylvania, Philadelphia, PA, USA
fYear
2010
fDate
14-19 March 2010
Firstpage
4222
Lastpage
4225
Abstract
Robust speaking rate estimation can be useful in automatic speech recognition and speaker identification, and accurate, automatic measures of speaking rate are also relevant for research in linguistics, psychology, and social sciences. In this study we built a broad phonetic class recognizer for speaking rate estimation. We tested the recognizer on a variety of data sets, including laboratory speech, telephone conversations, foreign accented speech, and speech in different languages, and we found that the recognizer´s estimates are robust under these sources of variation. We also found that the acoustic models of the broad phonetic classes are more robust than those of the monophones for syllable detection.
Keywords
estimation theory; natural languages; speaker recognition; speech processing; speech recognition; automatic speech recognition; broad phonetic class recognition; foreign accented speech; laboratory speech; languages; linguistics; monophones; psychology; robust speaking rate estimation; social sciences; speaker identification; telephone conversations; Acoustic signal detection; Automatic speech recognition; Detection algorithms; Frequency; Natural languages; Psychology; Robustness; Speech enhancement; Speech recognition; Testing; Speaking rate estimation; broad phonetic class; robustness; syllable detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5495686
Filename
5495686
Link To Document