• DocumentCode
    3123367
  • Title

    Component-based search engine for blogs

  • Author

    Hirokawa, Sachio ; Yin, Chengjiu ; Nakatoh, Tetsuya

  • Author_Institution
    Res. Inst. for Inf. Technol., Kyushu Univ., Fukuoka, Japan
  • fYear
    2011
  • fDate
    27-30 June 2011
  • Firstpage
    1074
  • Lastpage
    1078
  • Abstract
    A wrapper is a program that selectively extracts a necessary part (component) from Web pages. Automatic or semi-automatic wrapper construction is crucial to achieve a fine grained search engine for Web pages. However, this is not an easy task to achieve. This paper proposes a component-based search engine in which the content components gain a high score in the search results. Thus, the required segments for a query can be obtained without using a wrapper.
  • Keywords
    Web sites; information retrieval; search engines; Web page; blog; component-based search engine; semiautomatic wrapper construction; Blogs; Data mining; HTML; Indexes; Noise; Search engines; Web pages; blogs; ranking; search engine; semi-structured document;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1098-7584
  • Print_ISBN
    978-1-4244-7315-1
  • Electronic_ISBN
    1098-7584
  • Type

    conf

  • DOI
    10.1109/FUZZY.2011.6007650
  • Filename
    6007650