• DocumentCode
    697783
  • Title

    Robust phoneme classification: Exploiting the adaptability of acoustic waveform models

  • Author

    Ager, Matthew ; Cvetkovic, Zoran ; Sollich, Peter

  • Author_Institution
    Dept. of Math., King´s Coll. London, London, UK
  • fYear
    2009
  • fDate
    24-28 Aug. 2009
  • Firstpage
    530
  • Lastpage
    534
  • Abstract
    The robustness of classification of isolated phoneme segments using generative classifiers is investigated for the acoustic waveform, MFCC and PLP speech representations. Gaussian mixture models with diagonal covariance matrices are used followed by maximum likelihood classification. The performance of noise adapted acoustic waveform models is compared with PLP and MFCC models that were adapted using noisy training set feature standardisation. In the presence of additive noise, acoustic waveforms have significantly lower classification error. Even for the unrealistic case where PLP and MFCC classifiers are trained and tested in exactly matched noise conditions acoustic waveform classifiers still outperform them. In both cases the acoustic waveform classifiers are trained explicitly only on quiet data and then modified by a simple transformation to account for the noise.
  • Keywords
    Gaussian processes; acoustic noise; acoustic signal processing; cepstral analysis; mixture models; prediction theory; signal classification; signal representation; speech processing; Gaussian mixture models; MFCC classifiers; MFCC models; PLP speech representations; acoustic waveform classifiers; acoustic waveform models; additive noise; classification error; diagonal covariance matrices; generative classifiers; isolated phoneme segments; maximum likelihood classification; noisy training set feature standardisation; phoneme classification; Abstracts; Adaptation models; Discrete cosine transforms; Indexes; Mel frequency cepstral coefficient; Robustness; Signal to noise ratio; Acoustic Waveforms; Generative Classification; Phoneme; Robustness; Speech Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2009 17th European
  • Conference_Location
    Glasgow
  • Print_ISBN
    978-161-7388-76-7
  • Type

    conf

  • Filename
    7077355