Title :
Robust phoneme classification: Exploiting the adaptability of acoustic waveform models
Author :
Ager, Matthew ; Cvetkovic, Zoran ; Sollich, Peter
Author_Institution :
Dept. of Math., King´s Coll. London, London, UK
Abstract :
The robustness of classification of isolated phoneme segments using generative classifiers is investigated for the acoustic waveform, MFCC and PLP speech representations. Gaussian mixture models with diagonal covariance matrices are used followed by maximum likelihood classification. The performance of noise adapted acoustic waveform models is compared with PLP and MFCC models that were adapted using noisy training set feature standardisation. In the presence of additive noise, acoustic waveforms have significantly lower classification error. Even for the unrealistic case where PLP and MFCC classifiers are trained and tested in exactly matched noise conditions acoustic waveform classifiers still outperform them. In both cases the acoustic waveform classifiers are trained explicitly only on quiet data and then modified by a simple transformation to account for the noise.
Keywords :
Gaussian processes; acoustic noise; acoustic signal processing; cepstral analysis; mixture models; prediction theory; signal classification; signal representation; speech processing; Gaussian mixture models; MFCC classifiers; MFCC models; PLP speech representations; acoustic waveform classifiers; acoustic waveform models; additive noise; classification error; diagonal covariance matrices; generative classifiers; isolated phoneme segments; maximum likelihood classification; noisy training set feature standardisation; phoneme classification; Abstracts; Adaptation models; Discrete cosine transforms; Indexes; Mel frequency cepstral coefficient; Robustness; Signal to noise ratio; Acoustic Waveforms; Generative Classification; Phoneme; Robustness; Speech Recognition;
Conference_Titel :
Signal Processing Conference, 2009 17th European
Conference_Location :
Glasgow
Print_ISBN :
978-161-7388-76-7