DocumentCode
3109249
Title
Criteria for database and tool design for speech timing analysis with special reference to mandarin
Author
Yu Jue ; Gibbon, D.
Author_Institution
Sch. of Humanities, Zhejiang Univ., Hangzhou, China
fYear
2012
fDate
9-12 Dec. 2012
Firstpage
41
Lastpage
46
Abstract
This position paper investigates some of the problems in modelling speech timing for the design of speech databases and corpus analysis tools for phonetics and speech technology. First we examine a selection of phonetic approaches to speech timing analysis, the so-called `rhythm metrics´, and focus on explaining (1) inconsistencies (varying results for the same language) and (2) the failure to model rhythmic alternation. To overcome these problems we present a new perspective on the phonetic identification of rhythm patterns as a special case of duration modelling, including the additional criterion of alternation. We describe the Rhythm Parser, a tool for identifying hierarchical alternating patterns, and discuss results from applying it.
Keywords
grammars; natural language processing; speech processing; text analysis; Mandarin text; corpus analysis tool design; duration modelling; focus condition; hierarchical rhythm alternation pattern phonetic identification; phonetic approach selection; rhythm metrics; rhythm parser tool; rhythmic alternation criterion; speech database tool design; speech timing analysis; Acceleration; Indexes; Rhythm; Speech; Stress; Timing; bottom-up analysis; peak unit; rhythm metric; speech corpus; speech timing; timing hierarchy;
fLanguage
English
Publisher
ieee
Conference_Titel
Speech Database and Assessments (Oriental COCOSDA), 2012 International Conference on
Conference_Location
Macau
Print_ISBN
978-1-4673-2811-1
Electronic_ISBN
978-1-4673-2812-8
Type
conf
DOI
10.1109/ICSDA.2012.6422453
Filename
6422453
Link To Document