Design of a speech recognition system based on acoustically derived segmental units

Author

Bacchiani, M. ; Ostendorf, M. ; Sagisaka, Y. ; Paliwal, K.

Author_Institution

ATR Interpreting Telecommun. Res. Labs., Kyoto, Japan

Volume

1

fYear

1996

fDate

7-10 May 1996

Firstpage

443

Abstract

The design of a speech recognition system based on acoustically-derived, segmental units can be divided in three steps: unit design, lexicon building and pronunciation modeling. We formulate an iterative unit design procedure which consistently uses a maximum likelihood (ML) objective in successive application of resegmentation and model re-estimation. The lexicon building allows multi-word entries in the lexicon but restricts the number of these entries in order to avoid a too costly search. Selected multi-word lexical entries are those with high frequency (such as function words) and those which consistently exhibit cross-word phone assimilation. The stochastic pronunciation model represents the likelihood of a particular acoustic segment sequence given the phonetic baseform of a lexical item, where the sequence of baseform phones are treated as a Markov state sequence and each state can emit multiple segments

Keywords

Markov processes; iterative methods; maximum likelihood estimation; sequences; speech recognition; Markov state sequence; acoustic segment sequence; acoustically derived segmental units; baseform phones; cross-word phone assimilation; function words; iterative unit design procedure; lexicon building; maximum likelihood; model re-estimation; multi-word entries; phonetic baseform; pronunciation modeling; resegmentation; speech recognition system; stochastic pronunciation model; Acoustical engineering; Buildings; Cepstral analysis; Degradation; Design engineering; Frequency; Maximum likelihood estimation; Polynomials; Speech recognition; Stochastic processes;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on

Conference_Location

Atlanta, GA

ISSN

1520-6149

Print_ISBN

0-7803-3192-3

Type

conf

DOI

10.1109/ICASSP.1996.541128

Filename

541128