• DocumentCode
    2020730
  • Title

    Performance through consistency: connectionist large vocabulary continuous speech recognition

  • Author

    Tebelskis, Joe

  • Author_Institution
    Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    2
  • fYear
    1993
  • fDate
    27-30 April 1993
  • Firstpage
    259
  • Abstract
    It is shown that the performance of a connectionist speech recognition system can be improved by resolving inconsistencies in its design. Specifically, by introducing word level training into a TDNN (time-delay neural network) phoneme classifier (thus defining a multistate time-delay neural network or MS-TDNN), the training and testing criteria become consistent, enhancing the system´s word recognition accuracy. The author applied this MS-TDNN architecture to the task of large vocabulary continuous speech recognition, and found that it outperforms all other systems that have been evaluated on the CMU (Carnegie Mellon University) Conference Registration database. In addition, preliminary results suggest that the MS-TDNN may perform well on the large vocabulary Resource Management database, using a relatively small number of free parameters.<>
  • Keywords
    learning (artificial intelligence); neural nets; performance evaluation; speech recognition equipment; vocabulary; Conference Registration database; Resource Management database; connectionist speech recognition system; consistency; large vocabulary continuous speech recognition; performance; phoneme classifier; testing criteria; time-delay neural network; word level training; word recognition accuracy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
  • Conference_Location
    Minneapolis, MN, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1993.319285
  • Filename
    319285