• DocumentCode
    284577
  • Title

    Connectionist word-level classification in speech recognition

  • Author

    Haffner, Patrick

  • Author_Institution
    CNET, Lannion, France
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    621
  • Abstract
    MS-TDNN (multistate time delay neural networks), a connectionist architecture with embedded time alignment, was proposed recently. It makes word level classification possible and efficient on speech recognition tasks. Connectionist classification at the word level (rather than the usual maximum likelihood estimation) has not been commonly used in speech recognition, and raises issues related to proper temporal modeling and global discriminant training applied at the word level. The author shows how MS-TDNNs deal with these issues in a simple and efficient way, and achieve state of the art performance on several tasks representative of different problems in speaker-independent speech recognition: telephone digits and connected spelled letters
  • Keywords
    delays; learning (artificial intelligence); neural nets; speech recognition; MS-TDNN; connected spelled letters; connectionist word-level classification; embedded time alignment; global discriminant training; multistate time delay neural networks; speech recognition; telephone digits; temporal modeling; training; Delay effects; Hidden Markov models; Intelligent networks; Maximum likelihood estimation; Neural networks; Power system modeling; Power system reliability; Speech recognition; Telephony; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225832
  • Filename
    225832