• DocumentCode
    302320
  • Title

    Design of a speech recognition system based on acoustically derived segmental units

  • Author

    Bacchiani, M. ; Ostendorf, M. ; Sagisaka, Y. ; Paliwal, K.

  • Author_Institution
    ATR Interpreting Telecommun. Res. Labs., Kyoto, Japan
  • Volume
    1
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    443
  • Abstract
    The design of a speech recognition system based on acoustically-derived, segmental units can be divided in three steps: unit design, lexicon building and pronunciation modeling. We formulate an iterative unit design procedure which consistently uses a maximum likelihood (ML) objective in successive application of resegmentation and model re-estimation. The lexicon building allows multi-word entries in the lexicon but restricts the number of these entries in order to avoid a too costly search. Selected multi-word lexical entries are those with high frequency (such as function words) and those which consistently exhibit cross-word phone assimilation. The stochastic pronunciation model represents the likelihood of a particular acoustic segment sequence given the phonetic baseform of a lexical item, where the sequence of baseform phones are treated as a Markov state sequence and each state can emit multiple segments
  • Keywords
    Markov processes; iterative methods; maximum likelihood estimation; sequences; speech recognition; Markov state sequence; acoustic segment sequence; acoustically derived segmental units; baseform phones; cross-word phone assimilation; function words; iterative unit design procedure; lexicon building; maximum likelihood; model re-estimation; multi-word entries; phonetic baseform; pronunciation modeling; resegmentation; speech recognition system; stochastic pronunciation model; Acoustical engineering; Buildings; Cepstral analysis; Degradation; Design engineering; Frequency; Maximum likelihood estimation; Polynomials; Speech recognition; Stochastic processes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.541128
  • Filename
    541128