DocumentCode :
542220
Title :
Robust phoneme discrimination using acoustic waveforms
Author :
Cvetkovic, Zran ; Beferull-Lozano, Baltasar ; Buja, Andreas
Author_Institution :
AT&T Shannon Laboratory, Florham Park, New Jersey, USA
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
We present a study of separability of acoustic waveforms of speech at phoneme level. The analyzed data consist of 64ms segments of acoustic waveforms of individual phonemes from TIMIT data base, sampled at 16kHz. For each phoneme, by means of principal component analysis, we identify subspaces which contain a given proportion of the total energy of the available waveforms in time-domain, and also in spectral-magnitude domain. In order to assess separation between phonemes in the two domains, we perform pairwise classification of phonemes on clean data and on data immersed in white additive Gaussian noise up to 0dB signal to noise ratio. While the classification based on spectral magnitudes exhibits high sensitivity to additive noise, the time-domain classification proves to be very robust.
Keywords :
Nickel; Robustness; Signal to noise ratio;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743718
Filename :
5743718
Link To Document :
بازگشت