Title :
Multi-lingual label alignment using acoustic-phonetic features derived by neural-network technique
Author :
Dalsgaard, Paul ; Andersen, Ove ; Barry, William
Author_Institution :
Speech Technol. Centre, Aalborg Univ., Denmark
Abstract :
In previous work on label alignment, encouraging results were obtained using selected acoustic-phonetic features to model the individuals speech phonemes. Selection was based on minimal covariance between features on the one hand, and the inclusion of features underlying critical phonological opposition on the other. In the present work, principal component analysis was applied to give a number of uncorrelated output parameters which maximally exploit the discriminatory power of the features and are derived independently of the phonological functionality. Results of label alignment on three different European languages, Danish, English, and Italian, using different numbers of principal parameters show that the accuracy with ten parameters is at least as good as with 15 manually selected features. The best result is found for British English, which has 78% of its phoneme transition boundaries positioned within ±20 ms of manually placed reference boundaries
Keywords :
acoustic signal processing; neural nets; speech analysis and processing; speech recognition; Danish; English; European languages; Italian; acoustic-phonetic features; features discrimination; multilingual label alignment; neural network; phoneme transition boundaries; principal component analysis; reference boundaries; speech phonemes; speech recognition; uncorrelated output parameters; Acoustic testing; Cepstral analysis; Educational institutions; Europe; Natural languages; Neural networks; Principal component analysis; Spatial databases; Speech analysis; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7803-0003-3
DOI :
10.1109/ICASSP.1991.150311