DocumentCode :
3123367
Title :
Component-based search engine for blogs
Author :
Hirokawa, Sachio ; Yin, Chengjiu ; Nakatoh, Tetsuya
Author_Institution :
Res. Inst. for Inf. Technol., Kyushu Univ., Fukuoka, Japan
fYear :
2011
fDate :
27-30 June 2011
Firstpage :
1074
Lastpage :
1078
Abstract :
A wrapper is a program that selectively extracts a necessary part (component) from Web pages. Automatic or semi-automatic wrapper construction is crucial to achieve a fine grained search engine for Web pages. However, this is not an easy task to achieve. This paper proposes a component-based search engine in which the content components gain a high score in the search results. Thus, the required segments for a query can be obtained without using a wrapper.
Keywords :
Web sites; information retrieval; search engines; Web page; blog; component-based search engine; semiautomatic wrapper construction; Blogs; Data mining; HTML; Indexes; Noise; Search engines; Web pages; blogs; ranking; search engine; semi-structured document;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1098-7584
Print_ISBN :
978-1-4244-7315-1
Electronic_ISBN :
1098-7584
Type :
conf
DOI :
10.1109/FUZZY.2011.6007650
Filename :
6007650
Link To Document :
بازگشت