DocumentCode :
2361321
Title :
Modeling syllable duration in Indian languages using support vector machines
Author :
Rao, K. Sreenivasa ; Yegnanarayana, B.
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Madras, India
fYear :
2005
fDate :
4-7 Jan. 2005
Firstpage :
258
Lastpage :
263
Abstract :
In this paper we propose support vector machines (SVM) for predicting the durations of the syllables in Indian languages. In this work SVM regression models are used for modeling the durations of the syllables and SVM classification models are used for categorizing the syllables based on duration. Analysis is performed on broadcast news data in the languages Hindi, Telugu and Tamil, in order to predict the duration of syllables in these languages using SVM regression model. The input to the SVM consists of a set of phonological, positional and contextual features extracted from the text. We also propose two-stage duration models for improving the prediction accuracy. From the studies it was found that about 86% of the syllable durations are predicted within 25% of the actual duration. The performance of the duration models are evaluated using objective measures such as mean absolute error (μ), standard deviation (σ) and correlation coefficient (γ).
Keywords :
natural languages; pattern classification; regression analysis; support vector machines; Indian languages; SVM classification; SVM regression model; correlation coefficient; mean absolute error; standard deviation; support vector machines; syllable duration modeling; Accuracy; Broadcasting; Data mining; Feature extraction; Measurement standards; Natural languages; Performance analysis; Predictive models; Support vector machine classification; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Sensing and Information Processing, 2005. Proceedings of 2005 International Conference on
Print_ISBN :
0-7803-8840-2
Type :
conf
DOI :
10.1109/ICISIP.2005.1529458
Filename :
1529458
Link To Document :
بازگشت