DocumentCode :
3101764
Title :
Speech acts classification of Farsi texts
Author :
Soltani Panah, A. ; Homayounpour, Mohammad Mehdi
Author_Institution :
Comput. Eng. & IT Dept., Amirkabir Univ. of Technol., Tehran, Iran
fYear :
2008
fDate :
27-28 Aug. 2008
Firstpage :
539
Lastpage :
542
Abstract :
The objective of this paper is to design a system for classification of Farsi speech acts. The driving vision for this work is to provide intelligent text to Farsi speech (TTS) systems that are sensitive to the speech acts of the input texts and can pronounce the corresponding intonation correctly. Seven speech acts were considered and 3 classification methods including of (1) Naive Bayes, (2) K-Nearest Neighbors (KNN), and (3) Tree learner were used. The performance of speech act classification was evaluated using these methods including 10-Fold Cross-Validation, 70-30 Random Sampling and Area under ROC. KNN with an accuracy of 72% was shown to be the best classifier for classification of Farsi speech acts. It was observed that the amount of labeled training data has an important role in the classification performance.
Keywords :
natural languages; signal classification; speech processing; trees (mathematics); Farsi texts; K-nearest neighbors; naive Bayes; speech acts classification; text-to-speech systems; tree learner; Accuracy; Biological system modeling; Speech; Speech processing; Taxonomy; Testing; Training; Farsi text; Speech act; Text To Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Telecommunications, 2008. IST 2008. International Symposium on
Conference_Location :
Tehran
Print_ISBN :
978-1-4244-2750-5
Type :
conf
DOI :
10.1109/ISTEL.2008.4651360
Filename :
4651360
Link To Document :
بازگشت