DocumentCode
2972014
Title
Using temporal information for improving articulatory-acoustic feature classification
Author
Schuppler, Barbara ; Van Doremalen, Joost ; Scharenborg, Odette ; Cranen, Bert ; Boves, Lou
Author_Institution
Centre for Language & Speech Technol., Radboud Univ. Nijmegen, Nijmegen, Netherlands
fYear
2009
fDate
Nov. 13 2009-Dec. 17 2009
Firstpage
70
Lastpage
75
Abstract
This paper combines acoustic features with a high temporal and a high frequency resolution to reliably classify articulatory events of short duration, such as bursts in plosives. SVM classification experiments on TIMIT and SV Articulatory showed that articulatory-acoustic features (AFs) based on a combination of MFCCs derived from a long window of 25 ms and a short window of 5 ms that are both shifted with 2.5 ms steps (Both) outperform standard MFCCs derived with a window of 25 ms and a shift of 10 ms (Baseline). Finally, comparison of the TIMIT and SV Articulatory results showed that for classifiers trained on data that allows for asynchronously changing AFs (SV Articulatory) the improvement from Baseline to Both is larger than for classifiers trained on data where AFs change simultaneously with the phone boundaries (TIMIT).
Keywords
pattern classification; speech; speech processing; SV Articulatory; SVM classification experiments; TIMIT; articulatory acoustic feature classification; asynchronously changing AFs; classifiers; high frequency resolution; phone boundaries; standard MFCCs; temporal information; Acoustic testing; Automatic speech recognition; Frequency; Labeling; Natural languages; Paper technology; Speech analysis; Speech recognition; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location
Merano
Print_ISBN
978-1-4244-5478-5
Electronic_ISBN
978-1-4244-5479-2
Type
conf
DOI
10.1109/ASRU.2009.5373314
Filename
5373314
Link To Document