DocumentCode
3017491
Title
Speech rate change detection in martingale framework
Author
Yasuda, Hozumi ; Kudo, Motoi
Author_Institution
Grad. Sch. of Inf. Sci. & Technol., Hokkaido Univ., Sapporo, Japan
fYear
2012
fDate
27-29 Nov. 2012
Firstpage
859
Lastpage
864
Abstract
Automatic speech recognition of conversational speech has not reached the level of practical use yet. Possible reasons are 1) multiple speakers speak in turn or sometimes at the same time, and 2) they speak very casually. One of key features to characterize conversational speech is the speaking speed. By knowing the speaking speed it becomes possible to adapt a recognition system to speech at that speed. Therefore, in this paper, we aim to estimate the speech rate in real time in order to choose a recognition model suitable for the speech rate. This paper introduces a probabilistic method for estimating the speech rate as well as the changing points of speech rates. We use a martingale framework in two stages to this goal. For examining the effectiveness of our method, two experiments are conducted. First, phoneme-level speech rate change detection is tried in both of reading and conversational speech. Second, the detection of the changing points of word-level speech rates is tried.
Keywords
speech recognition; stochastic processes; automatic speech recognition; changing point detection; conversational speech; martingale framework; phoneme-level speech rate change detection; probabilistic method; speaking speed; speech rate change detection; speech rate estimation; Accuracy; Estimation; Feature extraction; Intelligent systems; Proposals; Speech; Speech recognition; dialogue; martingale; speech rate; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems Design and Applications (ISDA), 2012 12th International Conference on
Conference_Location
Kochi
ISSN
2164-7143
Print_ISBN
978-1-4673-5117-1
Type
conf
DOI
10.1109/ISDA.2012.6416650
Filename
6416650
Link To Document