DocumentCode :
1994347
Title :
Recovering "lack of words" in text categorization for item banks
Author :
Nuntiyagul, Atorn ; Cercone, Nick ; Naruedomkul, Kanlaya
Author_Institution :
Inst. for Innovation & Dev. of Learning Process, Mahidol Univ., Bangkok, Thailand
Volume :
2
fYear :
2005
fDate :
26-28 July 2005
Firstpage :
31
Abstract :
PKIP, patterned keywords in phrase, is our feature selection approach to text categorization (TC) for item banks. An item bank is a collection of textual data in which each item consists of short sentences and has only a few relevant words for categorization. Traditional TC techniques cannot provide sufficiently accurate results because of a "lack of words" problem. PKIP improves categorization accuracy and recovers from the "lack of words" problem. Our sample item bank is the collection of Thai primary mathematics problems and we use SVM as our classifier. Classification results show that PKIP produces acceptable classification performance.
Keywords :
pattern classification; support vector machines; text analysis; SVM classifier; Thai primary mathematics problem; feature selection; item banks; lack of words recovery; patterned keywords in phrase; short sentences; text categorization; Computer science; Educational institutions; Frequency; Mathematics; Support vector machine classification; Support vector machines; Technological innovation; Text categorization; Web pages; World Wide Web;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Software and Applications Conference, 2005. COMPSAC 2005. 29th Annual International
ISSN :
0730-3157
Print_ISBN :
0-7695-2413-3
Type :
conf
DOI :
10.1109/COMPSAC.2005.128
Filename :
1508076
Link To Document :
بازگشت