Title :
Analysis of acoustic-phonetic variations in fluent speech using TIMIT
Author :
Sun, Don X. ; Deng, Li
Author_Institution :
Dept. of Appl. Math. & Stat., State Univ. of New York, Stony Brook, NY, USA
Abstract :
We propose a hierarchically structured analysis of variance (ANOVA) method to analyze, in a quantitative manner, the contributions of various identifiable factors to the overall acoustic variability exhibited in fluent speech data of TIMIT processed in the form of mel-frequency cepstral coefficients. The results of the analysis show that the greatest acoustic variability in TIMIT data is explained by the difference among distinct phonetic labels in TIMIT, followed by the phonetic context difference given a fixed phonetic label. The variability among sequential sub-segments within each TIMIT-defined phonetic segment is found to be significantly greater than the gender, dialect region, and speaker factors. Our results serve to provide useful insights to the understanding of the roles of various components of speech recognizers in contributing to the ultimate speech recognition performance
Keywords :
acoustic signal processing; cepstral analysis; speech processing; speech recognition; TIMIT data; acoustic variability; acoustic-phonetic variations analysis; dialect region; fluent speech; gender; hierarchically structured analysis; mel-frequency cepstral coefficients; phonetic context difference; phonetic label; phonetic labels; phonetic segment; sequential sub-segments; speaker factors; speech recognition; variance analysis; Acoustical engineering; Analysis of variance; Databases; Loudspeakers; Mathematics; Speech analysis; Speech processing; Speech recognition; Statistics; Sun;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479399