• DocumentCode
    2409964
  • Title

    Study on Microblog Information Acquisition Based on Open API and Multithreading Mechanism

  • Author

    Mou, Gaidong ; Du, Yuncheng ; Lin, Chunyu ; Lv, Xueqiang

  • fYear
    2011
  • fDate
    21-23 Oct. 2011
  • Firstpage
    200
  • Lastpage
    203
  • Abstract
    With its rapid development, microblog is turning out to be an information repository. There is a new acquisition method of web information put forward in this paper, which combines automatic proxy with multithreading mechanism. This method creates a multiplexer channel between the client and the target server, which takes shape the way that the client drives a certain number of http proxy servers to send requests to the target server. It is applied in the access of the restful open API of microblog, and the collection rate grows about 6 times compared with the method complying with the access frequency limit API declared. The two typical problems are thus resolved to a certain extent, that microblog server limits the access frequency of the open API directed against the collection IP address and the collection rate of microblog information is relatively slow.
  • Keywords
    Computers; Data mining; Educational institutions; Message systems; Multithreading; Servers; Web pages; API; automatic proxy; information acquisition; multithreading;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational and Information Sciences (ICCIS), 2011 International Conference on
  • Conference_Location
    Chengdu, China
  • Print_ISBN
    978-1-4577-1540-2
  • Type

    conf

  • DOI
    10.1109/ICCIS.2011.264
  • Filename
    6086169