• DocumentCode
    3783763
  • Title

    Innovative approaches for large vocabulary name recognition

  • Author

    Yuqing Gao;B. Ramabhadran;J. Chen;H. Erdogan;M. Picheny

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    1
  • fYear
    2001
  • fDate
    6/23/1905 12:00:00 AM
  • Firstpage
    53
  • Abstract
    Automatic name dialing is a practical and interesting application of speech recognition on telephony systems. The IBM name recognition system is a large vocabulary, speaker independent system currently in use for reaching IBM employees in the United States. We present some innovative algorithms that improve name recognition accuracy. Unlike transcription tasks, such as the Switchboard task, recognition of names poses a variety of different problems. Several of these problems arise from the fact that foreign names are hard to pronounce for speakers who are not familiar with the names and that there are no standardized methods for pronouncing proper names. Noise robustness is another very important factor as these calls are typically made in noisy environments, such as from a car, cafeteria, airport, etc. and over different kinds of cellular and land-line telephone channels. We have performed a systematic analysis of the speech recognition errors and tackled the issues separately with techniques ranging from weighted speaker clustering, massive adaptation, rapid and unsupervised adaptation methods to pronunciation modeling methods. We find that the decoding accuracy can be improved significantly (28% relative) in this manner.
  • Keywords
    "Vocabulary","Speech recognition","Telephony","Working environment noise","Hidden Markov models","Speech analysis","Clustering algorithms","Loudspeakers","Speech coding","Testing"
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP ´01). 2001 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.940765
  • Filename
    940765