Title :
Component-based search engine for blogs
Author :
Hirokawa, Sachio ; Yin, Chengjiu ; Nakatoh, Tetsuya
Author_Institution :
Res. Inst. for Inf. Technol., Kyushu Univ., Fukuoka, Japan
Abstract :
A wrapper is a program that selectively extracts a necessary part (component) from Web pages. Automatic or semi-automatic wrapper construction is crucial to achieve a fine grained search engine for Web pages. However, this is not an easy task to achieve. This paper proposes a component-based search engine in which the content components gain a high score in the search results. Thus, the required segments for a query can be obtained without using a wrapper.
Keywords :
Web sites; information retrieval; search engines; Web page; blog; component-based search engine; semiautomatic wrapper construction; Blogs; Data mining; HTML; Indexes; Noise; Search engines; Web pages; blogs; ranking; search engine; semi-structured document;
Conference_Titel :
Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-7315-1
Electronic_ISBN :
1098-7584
DOI :
10.1109/FUZZY.2011.6007650