DocumentCode :
294678
Title :
Recognition of unvoiced stops from their time-frequency representation
Author :
Rangoussi, Maria ; Delopoulos, Anastasios
Author_Institution :
Dept. of Electr. Eng., Nat. Tech. Univ. of Athens, Greece
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
792
Abstract :
The recognition of the unvoiced stop sounds /k/, /p/ and /t/ in a speech signal is an interesting problem, due to the irregular, aperiodic, nonstationary nature of the corresponding signals. Their spotting is much easier, however, thanks to the characteristic silence interval they include. Classification of these three phonemes is proposed, based on the patterns extracted from their time-frequency representation. This is possible because the different articulation points of /k/, /p/ and /t/ are reflected into distinct patterns of evolution of their spectral contents with time. These patterns can be obtained by suitable time-frequency analysis, and then used for classification. The Wigner distribution of the unvoiced stop signals, appropriately smoothed and subsampled, is proposed as the basic classification pattern. Finally, for the classification step, the learning vector quantization (LVQ) classifier of Kohonen (1988) is employed on a set of unvoiced stop signals extracted from the TIMIT speech database, with encouraging results under context- and speaker-independent testing conditions
Keywords :
Wigner distribution; learning (artificial intelligence); pattern classification; self-organising feature maps; signal representation; signal sampling; smoothing methods; spectral analysis; speech processing; speech recognition; time-frequency analysis; vector quantisation; Kohonen LVQ classifier; TIMIT speech database; Wigner distribution; aperiodic signals; articulation points; classification pattern; context-independent testing; irregular signals; learning vector quantization classifier; nonstationary signals; phonemes classification; silence interval; smoothed signal; speaker-independent testing conditions; spectral evolution patterns; speech signal; subsampled signal; time-frequency analysis; time-frequency representation; unvoiced stop signals; unvoiced stops recognition; Acoustic testing; Computer science; Databases; Detectors; Explosions; Feature extraction; Pattern analysis; Speech recognition; Time frequency analysis; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479813
Filename :
479813
Link To Document :
بازگشت