Title :
A distinctive feature based method for evaluating the phonetic transcription of a non-native speech database
Author :
Zhang, Jinsong ; Wang, Dongning ; Cao, Wen ; Xiong, Ziyu
Author_Institution :
Center of Studies of Chinese as a Second Language, Beijing Language & Culture Univ., Beijing, China
fDate :
Nov. 29 2010-Dec. 3 2010
Abstract :
For the purpose of studies of second language acquisition and computer aided pronunciation training, an L2 Chinese speech database by Japanese learners has been collected and phonetically annotated twice by two independent groups of labelers. As there are errors in the two annotations, appropriate methods are needed to screen out inconsistencies for rechecking. This paper presents a multi-step procedure to deal with the problem: first screen out checking candidates based on statistical distributional analyses of inconsistent transcriptions, then analyze and merge inconsistent phoneme transcriptions based on phonetic knowledge, finally make use of a phonetic feature based analyzer to order those inconsistent pair labels for rechecking. Those labels with most dissimilarity are assigned with top priority for rechecking. Preliminary experimental results showed that the procedure was helpful to generate a meaningful candidate list and priority ordering for rechecking.
Keywords :
audio databases; feature extraction; natural language processing; speech processing; statistical analysis; statistical distributions; Japanese learners; L2 Chinese speech database; computer aided pronunciation training; feature based method; nonnative speech database; phoneme transcriptions; phonetic knowledge; phonetic transcription; second language acquisition; statistical distributional analyses; Compounds; Correlation; Databases; Labeling; Reliability; Speech; Tongue; distinctive features; inter-labeler agreement; inter-language database;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
DOI :
10.1109/ISCSLP.2010.5684857