Title :
Context-Ontology Driven Focused Crawling of Web Documents
Author :
Pahal, Nisha ; Chauhan, Naresh ; Sharma, A.K.
Author_Institution :
YMCA Inst. of Eng., Faridabad
Abstract :
Most of the current focused crawling approaches perform syntactic matching, that is, they retrieve documents that contain particular keywords from the user´s query. This often leads to poor discovery results, because the keywords in the query can be semantically similar but syntactically different, or vice-versa. Moreover, the query matching score is calculated taking into account only the keywords from the user´s query. Thus, regardless of the context, the same list of results is returned in response to a particular query. This paper presents an approach for document discovery building on a comprehensive framework for context-ontology driven focused crawling of Web documents.
Keywords :
document handling; ontologies (artificial intelligence); pattern matching; query processing; semantic Web; Web document crawling; context-ontology; document discovery building; document retrieval; semantic Web; syntactic query matching; Crawlers; Information retrieval; Ontologies; Particle separators; Search engines; Uniform resource locators; Web pages; World Wide Web; context; focused crawler; ontology;
Conference_Titel :
Wireless Communication and Sensor Networks, 2007. WCSN '07. Third International Conference on
Conference_Location :
Allahabad
Print_ISBN :
978-1-4244-1877-0
Electronic_ISBN :
978-1-4244-1878-7
DOI :
10.1109/WCSN.2007.4475761