Title :
Using temporal information for improving articulatory-acoustic feature classification
Author :
Schuppler, Barbara ; Van Doremalen, Joost ; Scharenborg, Odette ; Cranen, Bert ; Boves, Lou
Author_Institution :
Centre for Language & Speech Technol., Radboud Univ. Nijmegen, Nijmegen, Netherlands
fDate :
Nov. 13 2009-Dec. 17 2009
Abstract :
This paper combines acoustic features with a high temporal and a high frequency resolution to reliably classify articulatory events of short duration, such as bursts in plosives. SVM classification experiments on TIMIT and SV Articulatory showed that articulatory-acoustic features (AFs) based on a combination of MFCCs derived from a long window of 25 ms and a short window of 5 ms that are both shifted with 2.5 ms steps (Both) outperform standard MFCCs derived with a window of 25 ms and a shift of 10 ms (Baseline). Finally, comparison of the TIMIT and SV Articulatory results showed that for classifiers trained on data that allows for asynchronously changing AFs (SV Articulatory) the improvement from Baseline to Both is larger than for classifiers trained on data where AFs change simultaneously with the phone boundaries (TIMIT).
Keywords :
pattern classification; speech; speech processing; SV Articulatory; SVM classification experiments; TIMIT; articulatory acoustic feature classification; asynchronously changing AFs; classifiers; high frequency resolution; phone boundaries; standard MFCCs; temporal information; Acoustic testing; Automatic speech recognition; Frequency; Labeling; Natural languages; Paper technology; Speech analysis; Speech recognition; Support vector machine classification; Support vector machines;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
DOI :
10.1109/ASRU.2009.5373314