• DocumentCode
    353504
  • Title

    Discriminative resolution enhancement in acoustic modelling

  • Author

    Duchateau, Jacques ; Demuynck, Kris ; Wambacq, Patrick

  • Author_Institution
    ESAT, Katholieke Univ., Leuven, Heverlee, Belgium
  • Volume
    3
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1245
  • Abstract
    The accuracy of the acoustic models in large vocabulary recognition systems can be improved by increasing the resolution in the acoustic feature space. This can be obtained by increasing the number of Gaussian densities in the models by splitting of the Gaussians. This paper proposes a novel algorithm for this splitting operation. It is based on the phonetic decision tree used for the state tying in context dependent modelling. The advantage of the method is that it improves the capability of the acoustic models to discriminate between the different tied states. The proposed splitting algorithm was evaluated on the Wall Street Journal recognition task. Comparison with a commonly used splitting algorithm clearly shows that our method can provide smaller (thus faster) acoustic models and results in lower error rates
  • Keywords
    Gaussian processes; decision trees; speech enhancement; speech recognition; Gaussian densities; acoustic feature space; acoustic modelling; context dependent modelling; discriminative resolution enhancement; error rates; large vocabulary recognition systems; phonetic decision tree; state tying; Context modeling; Decision trees; Error analysis; Mutual information; Speech recognition; Tail; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.861801
  • Filename
    861801