• DocumentCode
    284614
  • Title

    A segment-based speaker adaptation neural network applied to continuous speech recognition

  • Author

    Fukuzawa, Keiji ; Komori, Yasuhiro ; Sawai, Hidefumi ; Sugiyama, Masahide

  • Author_Institution
    ATR Interpreting Telephony Res. Lab., Kyoto, Japan
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    433
  • Abstract
    The authors describe a speaker adaptation technique using segment-based neural-mapping applied to continuous speech recognition. The adaptation neural network has a time-shifted subconnection architecture to maintain the temporal structure in the acoustic segment and to decrease the amount of speech data for training. The effectiveness of this network has been reported for phoneme recognition. The speaker adaptation network is combined with a TDNN-LR continuous speech recognizer, and is evaluated in word and phrase recognition experiments with several speakers. The results of 500-word recognition experiments show that the recognition rate by segment-based adaptation is 92.2%, 28.8% higher than the rate without adaptation. The results of 278 phrase recognition experiments show that the recognition rate by segment-based adaptation is 57.4%, 27.7% higher than the rate without adaptation
  • Keywords
    neural nets; speech recognition; TDNN-LR continuous speech recognizer; acoustic segment; backpropagation; phoneme recognition; phrase recognition; recognition rate; segment-based neural-mapping; speaker adaptation technique; speech data; temporal structure; time-shifted subconnection architecture; training; word recognition; Feedforward neural networks; Feedforward systems; Laboratories; Loudspeakers; Neural networks; Research and development; Speech analysis; Speech recognition; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225879
  • Filename
    225879