• DocumentCode
    3229793
  • Title

    Vocabulary learning and environment normalization in vocabulary-independent speech recognition

  • Author

    Hon, Hsiao-Wuen ; Lee, Kai-Fu

  • Author_Institution
    Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    485
  • Abstract
    The authors discuss adaptation issues of vocabulary-independent (VI) systems. Just as with speaker-adaptation in a speaker-independent system, two vocabulary learning algorithms are implemented in order to tailor the VI subword models to the target vocabulary. The first algorithm generates vocabulary-adapted clustering decision trees by focusing on relevant allophones during tree generation and reduces the VI error rate by 9%. The second algorithm, vocabulary-bias training, gives the relevant allophones more prominence by assigning more weight to them during Baum-Welch training of the generalized allophonic models and reduces the VI error rate by 15%. Finally, in order to overcome the degradation caused by the different acoustic environments used for VI training and testing, codebook-dependent cepstral normalization (CDCN) and interpolated SNR-dependent cepstral normalization (ISDCN) originally designed for microphone adaptation are incorporated into the VI system, and both reduce the degradation of VI cross-environment recognition by 50%
  • Keywords
    learning (artificial intelligence); spectral analysis; speech recognition; trees (mathematics); Baum-Welch training; acoustic environments; allophones; codebook-dependent cepstral normalization; cross-environment recognition; degradation; environment normalization; error rate; interpolated SNR-dependent cepstral normalization; subword models; target vocabulary; tree generation; vocabulary learning algorithms; vocabulary-adapted clustering decision trees; vocabulary-bias training; vocabulary-independent speech recognition; Clustering algorithms; Decision trees; Degradation; Error analysis; Management training; Microphones; Resource management; Robustness; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225866
  • Filename
    225866