DocumentCode
1832098
Title
Finding Facet Content on Web by Position Inverted Index
Author
Jin, Canghong ; Hou, Honglun ; Wu, Minghui ; Ying, Jing
Author_Institution
Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
fYear
2012
fDate
25-27 June 2012
Firstpage
1699
Lastpage
1703
Abstract
Entity facet can give the enhancement on search result since it can present web elements by multiple dimensions. Moreover, if web content is sorted by fixed dimension like term frequency, peculiarly data cannot be touched by user easily. Thus, how to extract and manage data facet is a significant work in web search area. Most of exist approaches find facets on web by manually defined annotation or cluster algorithm based on large corpus. These methods are very complex and need heavy resource. On the other hand, since inverted index structure is widely used on web search engine, in this paper, we propose a novel index structure called position index structure based on inverted index. By using this structure, we try to find a better solution to solve the facet extraction and peculiarly data find problems.
Keywords
information management; information retrieval; pattern clustering; search engines; Web facet content sorting; Web search engine; annotation algorithm; cluster algorithm; data facet extraction; data facet management; position inverted index; Cities and towns; Data mining; Indexes; Search engines; Wind forecasting; facet search; peculiarly data extract; position inverted index;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on
Conference_Location
Liverpool
Print_ISBN
978-1-4673-2164-8
Type
conf
DOI
10.1109/HPCC.2012.253
Filename
6332387
Link To Document