Title :
Use syntax phrases of different level to improve BoW
Author :
Li, Ziqiang ; Zhou, Mingtian
Author_Institution :
Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
Abstract :
Phrases of different level in parse tree have different level of semantic abstract, and may function diversely in classification. This paper uses syntax phrases of different level to improve BoW representation. The result shows that level of phrases is valuable to capture the commonness of positive instances, and can better the discernment of positive instances. But it decreases the discernment of negative instances on the other side. And we also find BoW is sufficient as for negative instance recognition if there are enough positive instances.
Keywords :
text analysis; trees (mathematics); BoW representation; parse tree; semantic abstract; syntax phrases; Agricultural engineering; Classification tree analysis; Computer science; Computer science education; Costs; Educational technology; Information retrieval; Morphology; Ontologies; Text categorization; BoW; parse tree; syntax phrase; text classification;
Conference_Titel :
Education Technology and Computer (ICETC), 2010 2nd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-6367-1
DOI :
10.1109/ICETC.2010.5529364