DocumentCode
2893592
Title
Research on Social Network Based on Meta-search Engine
Author
Yang, Shen ; Zi-tao, Liu ; Cheng, Luo ; Ye, Li
Author_Institution
Sch. of Inf. Manage., Wuhan Univ., Wuhan, China
fYear
2009
fDate
18-20 Sept. 2009
Firstpage
179
Lastpage
183
Abstract
In order to solve the problem that we can only collect data from one single data source at some fixed time after mining the keywords in a rather superficial level, and to take full use of the information returned by search engines to construct the social relationship network based on the semantic link of the searched subject, we do the regular research by using the ROST Content Mining System which helps to undergo the process of page monitoring, content analysis and social network mining based on the pages returned from the four search engines (Google, Baidu, Sougou and Youdao). In the mining process, we adopt the cross-page framework adaptive algorithm which helps to solve the instability problem of the HTML framework codes, to extract information from the acquired web pages. Then we extract the cooccurrence set of high-frequency characteristic words to create the tridimensional social network graph by adopting the progressive search algorithm in the meta-search engine to extend the attribute set of the keywords. Finally, we conducted three typical case studies. They are the comparison of the coverage rate between Google and the meta-search engine, the dynamic changes in real-time network based on the meta-search engine and the progressive mining of effective content in meta-search engine, which all showed the advantages of the method in which we proposed the meta-search engine, as we could have more data sources, stronger real-time dynamic monitoring capacity, and deeper progressive searching ability. So we propose this meta-search engine method which can be used in social network study, aiming to develop the quality of the social network based on content mining, observe the hiding relationships in deeper levels and widen the research scope of content mining.
Keywords
data mining; hypermedia markup languages; metacomputing; search engines; social networking (online); Baidu; Google; HTML framework codes instability; Sougou; Web pages; Youdao; content analysis; content mining system; cross page framework adaptive algorithm; keywords attribute set; meta search engine; page monitoring; progressive search algorithm; social network mining; social network research; tridimensional social network graph; adaptive algorithm; content mining; meta-search engine; progressive algorithm; real-time network; social network;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Information Systems and Applications Conference, 2009. WISA 2009. Sixth
Conference_Location
Xuzhou, Jiangsu
Print_ISBN
978-0-7695-3874-7
Type
conf
DOI
10.1109/WISA.2009.21
Filename
5368080
Link To Document