DocumentCode
542220
Title
Robust phoneme discrimination using acoustic waveforms
Author
Cvetkovic, Zran ; Beferull-Lozano, Baltasar ; Buja, Andreas
Author_Institution
AT&T Shannon Laboratory, Florham Park, New Jersey, USA
Volume
1
fYear
2002
fDate
13-17 May 2002
Abstract
We present a study of separability of acoustic waveforms of speech at phoneme level. The analyzed data consist of 64ms segments of acoustic waveforms of individual phonemes from TIMIT data base, sampled at 16kHz. For each phoneme, by means of principal component analysis, we identify subspaces which contain a given proportion of the total energy of the available waveforms in time-domain, and also in spectral-magnitude domain. In order to assess separation between phonemes in the two domains, we perform pairwise classification of phonemes on clean data and on data immersed in white additive Gaussian noise up to 0dB signal to noise ratio. While the classification based on spectral magnitudes exhibits high sensitivity to additive noise, the time-domain classification proves to be very robust.
Keywords
Nickel; Robustness; Signal to noise ratio;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location
Orlando, FL, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5743718
Filename
5743718
Link To Document