مرکز منطقه ای اطلاع رساني علوم و فناوري - Recognition of unvoiced stops from their time-frequency representation

DocumentCode :

294678

Title :

Recognition of unvoiced stops from their time-frequency representation

Author :

Rangoussi, Maria ; Delopoulos, Anastasios

Author_Institution :

Dept. of Electr. Eng., Nat. Tech. Univ. of Athens, Greece

Volume :

fYear :

1995

fDate :

9-12 May 1995

Firstpage :

792

Abstract :

The recognition of the unvoiced stop sounds /k/, /p/ and /t/ in a speech signal is an interesting problem, due to the irregular, aperiodic, nonstationary nature of the corresponding signals. Their spotting is much easier, however, thanks to the characteristic silence interval they include. Classification of these three phonemes is proposed, based on the patterns extracted from their time-frequency representation. This is possible because the different articulation points of /k/, /p/ and /t/ are reflected into distinct patterns of evolution of their spectral contents with time. These patterns can be obtained by suitable time-frequency analysis, and then used for classification. The Wigner distribution of the unvoiced stop signals, appropriately smoothed and subsampled, is proposed as the basic classification pattern. Finally, for the classification step, the learning vector quantization (LVQ) classifier of Kohonen (1988) is employed on a set of unvoiced stop signals extracted from the TIMIT speech database, with encouraging results under context- and speaker-independent testing conditions

Keywords :

Wigner distribution; learning (artificial intelligence); pattern classification; self-organising feature maps; signal representation; signal sampling; smoothing methods; spectral analysis; speech processing; speech recognition; time-frequency analysis; vector quantisation; Kohonen LVQ classifier; TIMIT speech database; Wigner distribution; aperiodic signals; articulation points; classification pattern; context-independent testing; irregular signals; learning vector quantization classifier; nonstationary signals; phonemes classification; silence interval; smoothed signal; speaker-independent testing conditions; spectral evolution patterns; speech signal; subsampled signal; time-frequency analysis; time-frequency representation; unvoiced stop signals; unvoiced stops recognition; Acoustic testing; Computer science; Databases; Detectors; Explosions; Feature extraction; Pattern analysis; Speech recognition; Time frequency analysis; Vector quantization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location :

Detroit, MI

ISSN :

1520-6149

Print_ISBN :

0-7803-2431-5

Type :

conf

DOI :

10.1109/ICASSP.1995.479813

Filename :

479813

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=294678