Title :
Searching Messages Based on Semantic Context
Author :
Huang, Guangjun ; Musilek, Petr ; Sun, Jianguo
Author_Institution :
Coll. of Elec. & Inf. Eng., Henan Univ. of Sci. & Technol., Luoyang
Abstract :
Clients´ queries upon keywords or other informed description do not usually provide complete and unambiguous retrieval of information. Expansion of the queries based on semantic relation and phrase patterns is an effective approach to improve the retrieval. In this paper, a novel approach to queries expansion is presented. In the first step, keywords and phrases in a query are extracted, and the query is classified using Bayesian classifier. The classification defines the domain of user interest which serves as the context around the query. The phrase is then matched against fixed patterns which are automatically extracted from the set of domain documents and serve as context within the query. This is followed by expansion of the remaining keywords that are not in the phrases. This expansion is based on synonyms and hyponyms in the domain ontology, and is controlled by measuring information gain. Finally, the similarity between the expanded query and the document of the domain is computed by combining the weighted phrases and keywords. Experimental results indicate that the proposed approach improves the precision and recall of information retrieval.
Keywords :
Bayes methods; feature extraction; pattern classification; query processing; Bayesian classifier; domain ontology; hyponyms; information retrieval; message searching; queries expansion; semantic context; synonyms; Automatic control; Bayesian methods; Data mining; Information retrieval; Internet; Natural languages; Ontologies; Pattern matching; Search engines; Semantic Web;
Conference_Titel :
Wireless Communications, Networking and Mobile Computing, 2008. WiCOM '08. 4th International Conference on
Conference_Location :
Dalian
Print_ISBN :
978-1-4244-2107-7
Electronic_ISBN :
978-1-4244-2108-4
DOI :
10.1109/WiCom.2008.2543