Title :
An Unsupervised Technical Readability Ranking Model by Building a Conceptual Terrain in LSI
Author :
Jameel, Shoaib ; Qian, Xiaojun
Author_Institution :
Dept. of Syst. Eng. & Eng. Manage., Chinese Univ. of Hong Kong, Hong Kong, China
Abstract :
Searching for domain-specific related information has gained a high popularity in recent years. Naturally, everyone is not at par with each other when it comes to knowledge about the concepts of a domain. A doctor may be well versed in her field of specialization and probably would search for advanced medical documents on the Internet. But she may look for a much simpler material related to Computer Programming. However, current information retrieval (IR) systems just return a mixed set of results based on similarity and popularity of the web pages. Existing methods which have tried to address the issue of matching readers with texts in domain-specific IR either use an ontology or some seed concepts thereby limiting their application in certain domains only. Moreover, readability methods cannot address the issue in domain-specific IR ranking because they fail to give precise prediction when applied on web pages. We address this problem in domain-specific search using a conceptual model where the sequence of the terms in a document is modeled as a connected conceptual terrain. Our model has achieved significant improvement in ranking documents by technical readability.
Keywords :
Internet; Web sites; information retrieval; ontologies (artificial intelligence); Internet; LSI; Web pages; advanced medical documents; computer programming; conceptual terrain; connected conceptual terrain; domain-specific IR ranking; domain-specific related information; information retrieval systems; ontology; seed concepts; unsupervised technical readability ranking model; Biopsy; Computational modeling; Internet; Large scale integration; Semantics; Vectors; Web pages; Cohesion; Domain-specific; LSI; Ranking; Readability;
Conference_Titel :
Semantics, Knowledge and Grids (SKG), 2012 Eighth International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-2561-5
DOI :
10.1109/SKG.2012.20