DocumentCode :
421038
Title :
A new classification sentence technique using intension expressions
Author :
Morita, Kazuhiro ; Atlam, Elsayed ; Fuketa, Masao ; Kadoya, Yuki ; Sumitomo, Tom ; Aoe, Jun-Ichi
Author_Institution :
Dept. of Inf. Sci. & Intelligent Syst., Tokushima Univ., Japan
Volume :
1
fYear :
2004
fDate :
28 June-1 July 2004
Firstpage :
98
Abstract :
Although there are many text classification techniques depending on vector spaces, it is difficult to detect the meaning, which are relating to the user´s intension (complaint, encouragement, request, invitation, etc.). The intension discussed in This work is very useful for understanding focus points in conversation. This paper presents a method of determining the speaker´s intention for sentences in conversation. The intension association expressions are introduced, and the formal rule descriptions using these expressions are defined to build intention classification knowledge. A set pattern-matching algorithm is proposed to determine the intension class efficiently. From simulation results for 5,859 conversations, the presented set pattern-matching algorithm is about 44.5 times faster than Aho and Corasick method. Precision and recall of intension classifications are 90% and 95%. Moreover, precision and recall of unnecessary sentences extraction are 96% and 97%.
Keywords :
knowledge acquisition; pattern classification; pattern matching; speech processing; text analysis; intension expressions; intention classification knowledge; pattern-matching algorithm; speaker intention; speech processing; text classification; Computational modeling; Electronic mail; Information science; Intelligent systems; Mobile communication; Mobile computing; Pattern matching; Speech recognition; Telephony; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computers and Communications, 2004. Proceedings. ISCC 2004. Ninth International Symposium on
Print_ISBN :
0-7803-8623-X
Type :
conf
DOI :
10.1109/ISCC.2004.1358388
Filename :
1358388
Link To Document :
بازگشت