Using Articulatory Representations to Detect Segmental Errors in Nonnative Pronunciation

Author

Tepperman, Joseph ; Narayanan, Shrikanth

Author_Institution

Viterbi Sch. of Eng., Southern California Univ., Los Angeles, CA

Volume

16

Issue

1

fYear

2008

Firstpage

8

Lastpage

22

Abstract

Motivated by potential applications in second-language pedagogy, we present a novel approach to using articulatory information to improve automatic detection of typical phone-level errors made by nonnative speakers of English-a difficult task that involves discrimination between close pronunciations. We describe a reformulation of the hidden-articulator Markov model (HAMM) framework that is appropriate for the pronunciation evaluation domain. Model training requires no direct articulatory measurement, but rather involves a constrained and interpolated mapping from phone-level transcriptions to a set of physically and numerically meaningful articulatory representations. Here, we define two new methods of deriving articulatory-based features for classification: one, by concatenating articulatory recognition results over eight streams representative of the vocal tract´s constituents; the other, by calculating multidimensional articulatory confidence scores within these representations based on general linguistic knowledge of articulatory variants. After adding these articulatory features to traditional phone-level confidence scores, our results demonstrate absolute reductions in combined error rates for verification of segment-level pronunciations produced by nonnative speakers in the ISLE corpus by as much as 16%-17% for some target segments, and a 3%-4% absolute improvement overall.

Keywords

hidden Markov models; interpolation; linguistics; natural languages; signal classification; signal representation; speaker recognition; articulatory representation; articulatory-based feature classification; hidden-articulator Markov model; interpolated mapping; linguistic knowledge; nonnative English speaker; nonnative pronunciation; phone-level confidence score; phone-level segmental error detection; phone-level transcription; second-language pedagogy; Error analysis; Multidimensional systems; Natural languages; Speech analysis; Speech recognition; Statistical analysis; Stress; Tongue; Viterbi algorithm; Articulatory features; hidden-articulator Markov model (HAMM); language learning; nonnative speech; pronunciation evaluation; reading assessment;

fLanguage

English

Journal_Title

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher

ieee

ISSN

1558-7916

Type

jour

DOI

10.1109/TASL.2007.909330

Filename

4384605