• DocumentCode
    1832098
  • Title

    Finding Facet Content on Web by Position Inverted Index

  • Author

    Jin, Canghong ; Hou, Honglun ; Wu, Minghui ; Ying, Jing

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
  • fYear
    2012
  • fDate
    25-27 June 2012
  • Firstpage
    1699
  • Lastpage
    1703
  • Abstract
    Entity facet can give the enhancement on search result since it can present web elements by multiple dimensions. Moreover, if web content is sorted by fixed dimension like term frequency, peculiarly data cannot be touched by user easily. Thus, how to extract and manage data facet is a significant work in web search area. Most of exist approaches find facets on web by manually defined annotation or cluster algorithm based on large corpus. These methods are very complex and need heavy resource. On the other hand, since inverted index structure is widely used on web search engine, in this paper, we propose a novel index structure called position index structure based on inverted index. By using this structure, we try to find a better solution to solve the facet extraction and peculiarly data find problems.
  • Keywords
    information management; information retrieval; pattern clustering; search engines; Web facet content sorting; Web search engine; annotation algorithm; cluster algorithm; data facet extraction; data facet management; position inverted index; Cities and towns; Data mining; Indexes; Search engines; Wind forecasting; facet search; peculiarly data extract; position inverted index;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on
  • Conference_Location
    Liverpool
  • Print_ISBN
    978-1-4673-2164-8
  • Type

    conf

  • DOI
    10.1109/HPCC.2012.253
  • Filename
    6332387