DocumentCode :
2409964
Title :
Study on Microblog Information Acquisition Based on Open API and Multithreading Mechanism
Author :
Mou, Gaidong ; Du, Yuncheng ; Lin, Chunyu ; Lv, Xueqiang
fYear :
2011
fDate :
21-23 Oct. 2011
Firstpage :
200
Lastpage :
203
Abstract :
With its rapid development, microblog is turning out to be an information repository. There is a new acquisition method of web information put forward in this paper, which combines automatic proxy with multithreading mechanism. This method creates a multiplexer channel between the client and the target server, which takes shape the way that the client drives a certain number of http proxy servers to send requests to the target server. It is applied in the access of the restful open API of microblog, and the collection rate grows about 6 times compared with the method complying with the access frequency limit API declared. The two typical problems are thus resolved to a certain extent, that microblog server limits the access frequency of the open API directed against the collection IP address and the collection rate of microblog information is relatively slow.
Keywords :
Computers; Data mining; Educational institutions; Message systems; Multithreading; Servers; Web pages; API; automatic proxy; information acquisition; multithreading;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational and Information Sciences (ICCIS), 2011 International Conference on
Conference_Location :
Chengdu, China
Print_ISBN :
978-1-4577-1540-2
Type :
conf
DOI :
10.1109/ICCIS.2011.264
Filename :
6086169
Link To Document :
بازگشت