DocumentCode :
1832098
Title :
Finding Facet Content on Web by Position Inverted Index
Author :
Jin, Canghong ; Hou, Honglun ; Wu, Minghui ; Ying, Jing
Author_Institution :
Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
fYear :
2012
fDate :
25-27 June 2012
Firstpage :
1699
Lastpage :
1703
Abstract :
Entity facet can give the enhancement on search result since it can present web elements by multiple dimensions. Moreover, if web content is sorted by fixed dimension like term frequency, peculiarly data cannot be touched by user easily. Thus, how to extract and manage data facet is a significant work in web search area. Most of exist approaches find facets on web by manually defined annotation or cluster algorithm based on large corpus. These methods are very complex and need heavy resource. On the other hand, since inverted index structure is widely used on web search engine, in this paper, we propose a novel index structure called position index structure based on inverted index. By using this structure, we try to find a better solution to solve the facet extraction and peculiarly data find problems.
Keywords :
information management; information retrieval; pattern clustering; search engines; Web facet content sorting; Web search engine; annotation algorithm; cluster algorithm; data facet extraction; data facet management; position inverted index; Cities and towns; Data mining; Indexes; Search engines; Wind forecasting; facet search; peculiarly data extract; position inverted index;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on
Conference_Location :
Liverpool
Print_ISBN :
978-1-4673-2164-8
Type :
conf
DOI :
10.1109/HPCC.2012.253
Filename :
6332387
Link To Document :
بازگشت