• DocumentCode
    3522347
  • Title

    Phoneme environment clustering for speech recognition

  • Author

    Sagayama, Shigeki

  • Author_Institution
    Human Interface Lab., NTT, Tokyo, Japan
  • fYear
    1989
  • fDate
    23-26 May 1989
  • Firstpage
    397
  • Abstract
    A general principle is proposed to solve problems in context-dependent phoneme segment (or subword unit) based speech recognition, namely, how to choose the set of units and how to estimate the context effect missing in the training data. A phoneme environment clustering algorithm, which automatically selects an optimal set of allophones and estimates the missing context, is presented. This algorithm additionally gives the means to analyze coarticulation effects automatically and quantitatively. The problem is formulated as a clustering technique in phoneme environment space to approximate the mapping function from phoneme environment space to phoneme pattern space by a limited number of centroid patterns based on a distortion measure defined on the phoneme pattern space. The algorithm is tested for phoneme recognition and word recognition, and results are discussed
  • Keywords
    speech recognition; allophones; centroid patterns; coarticulation effects; context-dependent phoneme segment; distortion measure; mapping function; phoneme environment clustering algorithm; phoneme environment space; phoneme pattern space; phoneme recognition; speech recognition; subword unit; word recognition; Algorithm design and analysis; Clustering algorithms; Distortion measurement; Environmental factors; Extraterrestrial measurements; Humans; Laboratories; Speech analysis; Speech recognition; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
  • Conference_Location
    Glasgow
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.1989.266449
  • Filename
    266449