• DocumentCode
    2020715
  • Title

    Multi-speaker/speaker-independent architectures for the multi-state time delay neural network

  • Author

    Hild, Hermann ; Waibel, Alex

  • Author_Institution
    Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    2
  • fYear
    1993
  • fDate
    27-30 April 1993
  • Firstpage
    255
  • Abstract
    The authors present an improved multistate time delay neural network (MS-TDNN) for speaker-independent, connected letter recognition which outperforms an HMM (hidden Markov model) based system (SPHINX) and previous MS-TDNNs. They also explore new network architectures with internal speaker models. Four different architectures characterized by an increasing number of speaker-specific parameters are introduced. The speaker-specific parameters can be adjusted by automatic speaker identification or by speaker adaptation, allowing for tuning-in to a new speaker. Both methods lead to significant improvements over the straightforward speaker-independent architecture. Even unsupervised tuning-in (speech is unlabeled) works well.<>
  • Keywords
    neural nets; performance evaluation; speech recognition equipment; unsupervised learning; automatic speaker identification; connected letter recognition; internal speaker models; multi-state time delay neural network; speaker adaptation; speaker-independent architectures; speaker-specific parameters; unsupervised tuning-in;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
  • Conference_Location
    Minneapolis, MN, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1993.319284
  • Filename
    319284