• DocumentCode
    417188
  • Title

    Improving phoneme recognition of telephone quality speech

  • Author

    Huang, Qiang ; Cox, Stephen

  • Author_Institution
    Sch. of Comput. Sci., East Anglia Univ., Norwich, UK
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    There are some speech understanding applications in which training transcriptions are unavailable, and hence the vocabulary is unknown, but the task is to recognise key words and phrases within an utterance rather than to attempt a complete, accurate transcription. An example of such a task is call-routing, when transcriptions of training utterances (which are very expensive to produce) are unavailable. In such cases, phoneme rather than word recognition is appropriate. However, phoneme recognition of spontaneous speech spoken by a large multi-accent population over telephone connections is very inaccurate. To improve accuracy, we describe a technique in which we segment the waveform into subword-like units and use clustering and an iteratively refined language model to correct the errors in the recognised phonemes. The method was shown to work well on telephone quality spontaneous speech, raising the phoneme accuracy from 28.1% after the first iteration to 47.3% after three iterations.
  • Keywords
    iterative methods; natural language interfaces; speech processing; speech recognition; speech-based user interfaces; call-routing; clustering; iterative language model; phoneme recognition; speech recognition; speech understanding applications; spontaneous speech; telephone quality speech; training transcriptions; waveform segmentation; word recognition; Acoustic applications; Decoding; Error correction; Insurance; Loudspeakers; Routing; Speech processing; Speech recognition; Telephony; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326018
  • Filename
    1326018