• DocumentCode
    3109249
  • Title

    Criteria for database and tool design for speech timing analysis with special reference to mandarin

  • Author

    Yu Jue ; Gibbon, D.

  • Author_Institution
    Sch. of Humanities, Zhejiang Univ., Hangzhou, China
  • fYear
    2012
  • fDate
    9-12 Dec. 2012
  • Firstpage
    41
  • Lastpage
    46
  • Abstract
    This position paper investigates some of the problems in modelling speech timing for the design of speech databases and corpus analysis tools for phonetics and speech technology. First we examine a selection of phonetic approaches to speech timing analysis, the so-called `rhythm metrics´, and focus on explaining (1) inconsistencies (varying results for the same language) and (2) the failure to model rhythmic alternation. To overcome these problems we present a new perspective on the phonetic identification of rhythm patterns as a special case of duration modelling, including the additional criterion of alternation. We describe the Rhythm Parser, a tool for identifying hierarchical alternating patterns, and discuss results from applying it.
  • Keywords
    grammars; natural language processing; speech processing; text analysis; Mandarin text; corpus analysis tool design; duration modelling; focus condition; hierarchical rhythm alternation pattern phonetic identification; phonetic approach selection; rhythm metrics; rhythm parser tool; rhythmic alternation criterion; speech database tool design; speech timing analysis; Acceleration; Indexes; Rhythm; Speech; Stress; Timing; bottom-up analysis; peak unit; rhythm metric; speech corpus; speech timing; timing hierarchy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech Database and Assessments (Oriental COCOSDA), 2012 International Conference on
  • Conference_Location
    Macau
  • Print_ISBN
    978-1-4673-2811-1
  • Electronic_ISBN
    978-1-4673-2812-8
  • Type

    conf

  • DOI
    10.1109/ICSDA.2012.6422453
  • Filename
    6422453