• DocumentCode
    3161885
  • Title

    Derivation of eigentriphones by weighted principal component analysis

  • Author

    Ko, Tom ; Mak, Brian

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Hong Kong Univ. of Sci. & Technol., Clear Water Bay, China
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4097
  • Lastpage
    4100
  • Abstract
    Last year we proposed a new acoustic modeling method called eigentriphones in which all triphones are distinct (with no tied states) so that they may be more discriminative. In our method, frequent triphones are used to derive an eigenbasis using PCA, and the infrequent triphones are then “adapted” as a linear combination of the eigenvectors which are also called eigentriphones. Although the eigentriphones method compares favorably with traditional tied-state triphones, the PCA procedure has two limitations: (1) only the frequent triphones are employed, and (2) they are considered “equal” even though some are more robust than the others. In this paper, weighted PCA is proposed to solve both problems so that all triphones-frequent and infrequent triphones-may contribute to the derivation of the eigentriphones, each at a different extent depending on its sample count. Experimental evaluation on the WSJ 5Kvocabulary speech recognition task shows that weighted PCA produces better models than simple PCA, and its performance is fairly independent of the number of eigentriphones once more than 20% of them are used. As a consequence, all triphones may be represented by fewer eigentriphones, resulting in a more compact model.
  • Keywords
    eigenvalues and eigenfunctions; principal component analysis; speech recognition; vocabulary; PCA; WSJ 5K-vocabulary speech recognition task; eigenbasis; eigentriphone derivation method; eigenvectors; frequent triphones; infrequent triphones; linear combination; weighted principal component analysis; Acoustics; Adaptation models; Hidden Markov models; Principal component analysis; Robustness; Speech recognition; Training; Eigentriphones; context-dependent acoustic modeling; eigenvoice adaptation; weighted PCA;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288819
  • Filename
    6288819