• DocumentCode
    336745
  • Title

    Telephone speech recognition using neural networks and hidden Markov models

  • Author

    Yuk, DongSuk ; Flanagan, James

  • Author_Institution
    CAIP Center, Rutgers Univ., Piscataway, NJ, USA
  • Volume
    1
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    157
  • Abstract
    The performance of well-trained speech recognizers using high quality full bandwidth speech data is usually degraded when used in real world environments. In particular, telephone speech recognition is extremely difficult due to the limited bandwidth of the transmission channels. In this paper, neural network based adaptation methods are applied to telephone speech recognition and a new unsupervised model adaptation method is proposed. The advantage of the neural network based approach is that the retraining of speech recognizers for telephone speech is avoided. Furthermore, because the multi-layer neural network is able to compute nonlinear functions, it can accommodate for the non-linear mapping between full bandwidth speech and telephone speech. The new unsupervised model adaptation method does not require transcriptions and can be used with the neural networks. Experimental results on TIMIT/NTIMIT corpora show that the performance of the proposed methods is comparable to that of recognizers retained on telephone speech
  • Keywords
    hidden Markov models; neural nets; nonlinear functions; speech intelligibility; speech recognition; telephony; unsupervised learning; TIMIT/NTIMIT corpora; experimental results; full bandwidth speech data; hidden Markov models; high quality full bandwidth speech data; multilayer neural network; nonlinear functions; nonlinear mapping; performance; real world environments; telephone speech recognition; transmission channels; unsupervised model adaptation method; Adaptation model; Bandwidth; Degradation; Hidden Markov models; Neural networks; Noise reduction; Speech enhancement; Speech recognition; Telephony; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.758086
  • Filename
    758086