• DocumentCode
    2179204
  • Title

    Multilayer perceptron with sparse hidden outputs for phoneme recognition

  • Author

    Sivaram, G.S.V.S. ; Hermansky, Hynek

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5336
  • Lastpage
    5339
  • Abstract
    This paper introduces the sparse multilayer perceptron (SMLP) which learns the transformation from the inputs to the targets as in multilayer perceptron (MLP) while the outputs of one of the internal hidden layers is forced to be sparse. This is achieved by adding a sparse regularization term to the cross-entropy cost and learning the parameters of the network to minimize the joint cost. On the TIMIT phoneme recognition task, the SMLP based system trained using perceptual linear prediction (PLP) features performs better than the conventional MLP based system. Furthermore, their combination yields a phoneme error rate of 21.2%, a relative improvement of 6.2% over the baseline.
  • Keywords
    multilayer perceptrons; speech recognition; SMLP; TIMIT phoneme recognition task; sparse hidden outputs; sparse multilayer perceptron; Acoustics; Cost function; Hidden Markov models; Neurons; Speech; Speech processing; Training; Multilayer perceptron; machine learning; phoneme recognition; sparse features;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947563
  • Filename
    5947563