• DocumentCode
    3017491
  • Title

    Speech rate change detection in martingale framework

  • Author

    Yasuda, Hozumi ; Kudo, Motoi

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Hokkaido Univ., Sapporo, Japan
  • fYear
    2012
  • fDate
    27-29 Nov. 2012
  • Firstpage
    859
  • Lastpage
    864
  • Abstract
    Automatic speech recognition of conversational speech has not reached the level of practical use yet. Possible reasons are 1) multiple speakers speak in turn or sometimes at the same time, and 2) they speak very casually. One of key features to characterize conversational speech is the speaking speed. By knowing the speaking speed it becomes possible to adapt a recognition system to speech at that speed. Therefore, in this paper, we aim to estimate the speech rate in real time in order to choose a recognition model suitable for the speech rate. This paper introduces a probabilistic method for estimating the speech rate as well as the changing points of speech rates. We use a martingale framework in two stages to this goal. For examining the effectiveness of our method, two experiments are conducted. First, phoneme-level speech rate change detection is tried in both of reading and conversational speech. Second, the detection of the changing points of word-level speech rates is tried.
  • Keywords
    speech recognition; stochastic processes; automatic speech recognition; changing point detection; conversational speech; martingale framework; phoneme-level speech rate change detection; probabilistic method; speaking speed; speech rate change detection; speech rate estimation; Accuracy; Estimation; Feature extraction; Intelligent systems; Proposals; Speech; Speech recognition; dialogue; martingale; speech rate; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications (ISDA), 2012 12th International Conference on
  • Conference_Location
    Kochi
  • ISSN
    2164-7143
  • Print_ISBN
    978-1-4673-5117-1
  • Type

    conf

  • DOI
    10.1109/ISDA.2012.6416650
  • Filename
    6416650