• DocumentCode
    312167
  • Title

    Speech recognition based on acoustically derived segment units

  • Author

    Fukada, Toshiaki ; Bacchiani, Michiel ; Paliwal, Kuldip K. ; Sagisaka, Yoshinori

  • Author_Institution
    ATR Interpreting Telecommun. Res. Labs., Japan
  • Volume
    2
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    1077
  • Abstract
    The paper describes a new method of word model generation based on acoustically derived segment units (henceforth ASUs). An ASU-based approach has the advantages of growing out of human pre-determined phonemes and of consistently generating acoustic units by using the maximum likelihood (ML) criterion. The former advantage is effective when it is difficult to map acoustics to a phone such as with highly co-articulated spontaneous speech. In order to implement an ASU-based modeling approach in a speech recognition system, one must first solve two points: (1) how does one design an inventory of acoustically-derived segmental units and (2) how does one model the pronunciations of lexical entries in terms of the ASUs. As for the second question, the authors propose an ASU-based word model generation method by composing the ASU statistics, that is, their means, variances and durations. The effectiveness of the proposed method is shown through spontaneous word recognition experiments
  • Keywords
    maximum likelihood estimation; speech recognition; statistics; acoustically derived segment units; durations; highly co-articulated spontaneous speech; human pre-determined phonemes; lexical entry pronunciation; maximum likelihood criterion; speech recognition; spontaneous word recognition experiments; statistics; variances; word model generation; Acoustical engineering; Acoustics; Character recognition; Context modeling; Databases; Humans; Speech recognition; Statistical distributions; Stochastic processes; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607792
  • Filename
    607792