Title :
Chinese Web Information Retrieval Based on Shallow Parsing
Author :
Chen, Zhi-qun ; Zhou, Qi-li ; Wang, Rong-bo
Author_Institution :
Inst. of Comput. Applic. Technol., Hangzhou Dianzi Univ., Hangzhou, China
Abstract :
To improve the retrieval performance, shallow parsing technique for text was introduced for Chinese Web information retrieval. Firstly, predicate, prepositive nominal component and succedent nominal component close to the predicate were extracted from Chinese sentence. Then, semantic vector of Chinese text was acquired based on converting predicate and nominal component to conception. An algorithm was presented for similarity calculating of semantic vector, and a Chinese Web information retrieval model was designed. The model evaluates the matching degree between indexed documents and users´ interests based on semantic similarity calculating. Users´ interests were expressed by delivering representative documents. Experimental results show that the precision is improved observably compared with the popular Web search engine.
Keywords :
Internet; grammars; indexing; information retrieval; natural language processing; search engines; text analysis; Chinese Web information retrieval; Chinese sentence; Chinese text; Web search engine; indexed documents; predicate component; prepositive nominal component; representative documents; retrieval performance; semantic similarity calculating; semantic vector; shallow parsing technique; succedent nominal component; Web information retrieval; semantic retrieval; shallow parsing for Chinese text; similarity calculating;
Conference_Titel :
Web Information Systems and Mining (WISM), 2010 International Conference on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-8438-6
DOI :
10.1109/WISM.2010.133