DocumentCode :
591915
Title :
Automatic classification of unequal lexical stress patterns using machine learning algorithms
Author :
Shahin, Moustafa A. ; Ahmed, Beena ; Ballard, K.J.
Author_Institution :
Electr. & Comput. Eng. Program, Texas A&M Univ. at Qatar, Doha, Qatar
fYear :
2012
fDate :
2-5 Dec. 2012
Firstpage :
388
Lastpage :
391
Abstract :
Technology based speech therapy systems are severely handicapped due to the absence of accurate prosodic event identification algorithms. This paper introduces an automatic method for the classification of strong-weak (SW) and weak-strong (WS) stress patterns in children speech with American English accent, for use in the assessment of the speech dysprosody. We investigate the ability of two sets of features used to train classifiers to identify the variation in lexical stress between two consecutive syllables. The first set consists of traditional features derived from measurements of pitch, intensity and duration, whereas the second set consists of energies of different filter banks. Three different classifiers were used in the experiments: an Artificial Neural Network (ANN) classifier with a single hidden layer, Support Vector Machine (SVM) classifier with both linear and Gaussian kernels and the Maximum Entropy modeling (MaxEnt). these features. Best results were obtained using an ANN classifier and a combination of the two sets of features. The system correctly classified 94% of the SW stress patterns and 76% of the WS stress patterns.
Keywords :
Gaussian distribution; channel bank filters; learning (artificial intelligence); maximum entropy methods; natural languages; neural nets; pattern classification; signal classification; speech processing; support vector machines; ANN classifier; American English accent; Gaussian kernels; MaxEnt; SVM classifier; SW stress patterns; WS stress patterns; artificial neural network classifier; automatic classification; children speech; classifier training; consecutive syllables; filter banks; linear kernels; machine learning algorithms; maximum entropy modeling; pitch measurement; prosodic event identification algorithms; single hidden layer; speech dysprosody; strong-weak stress patterns; support vector machine classifier; technology-based speech therapy systems; unequal lexical stress patterns; weak-strong stress patterns; Accuracy; Acoustics; Artificial neural networks; Filter banks; Speech; Stress; Support vector machines; automatic assessment; lexical stress; prosody;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2012 IEEE
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4673-5125-6
Electronic_ISBN :
978-1-4673-5124-9
Type :
conf
DOI :
10.1109/SLT.2012.6424255
Filename :
6424255
Link To Document :
بازگشت