DocumentCode
478977
Title
Searching Messages Based on Semantic Context
Author
Huang, Guangjun ; Musilek, Petr ; Sun, Jianguo
Author_Institution
Coll. of Elec. & Inf. Eng., Henan Univ. of Sci. & Technol., Luoyang
fYear
2008
fDate
12-14 Oct. 2008
Firstpage
1
Lastpage
4
Abstract
Clients´ queries upon keywords or other informed description do not usually provide complete and unambiguous retrieval of information. Expansion of the queries based on semantic relation and phrase patterns is an effective approach to improve the retrieval. In this paper, a novel approach to queries expansion is presented. In the first step, keywords and phrases in a query are extracted, and the query is classified using Bayesian classifier. The classification defines the domain of user interest which serves as the context around the query. The phrase is then matched against fixed patterns which are automatically extracted from the set of domain documents and serve as context within the query. This is followed by expansion of the remaining keywords that are not in the phrases. This expansion is based on synonyms and hyponyms in the domain ontology, and is controlled by measuring information gain. Finally, the similarity between the expanded query and the document of the domain is computed by combining the weighted phrases and keywords. Experimental results indicate that the proposed approach improves the precision and recall of information retrieval.
Keywords
Bayes methods; feature extraction; pattern classification; query processing; Bayesian classifier; domain ontology; hyponyms; information retrieval; message searching; queries expansion; semantic context; synonyms; Automatic control; Bayesian methods; Data mining; Information retrieval; Internet; Natural languages; Ontologies; Pattern matching; Search engines; Semantic Web;
fLanguage
English
Publisher
ieee
Conference_Titel
Wireless Communications, Networking and Mobile Computing, 2008. WiCOM '08. 4th International Conference on
Conference_Location
Dalian
Print_ISBN
978-1-4244-2107-7
Electronic_ISBN
978-1-4244-2108-4
Type
conf
DOI
10.1109/WiCom.2008.2543
Filename
4680732
Link To Document