• DocumentCode
    2202616
  • Title

    Extraction of User Profile Based on the Hadoop Framework

  • Author

    Huang, Lan ; Wang, Xiao-Wei ; Zhai, Yan-Dong ; Yang, Bin

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun, China
  • fYear
    2009
  • fDate
    24-26 Sept. 2009
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    With the rapid development of Internet, the Web information dramatically increases, the users are often involved in voluminous information to feel lose, Distributed processing of mass data through a cluster composed by many machines and personalized search services based on the user profile have been the hotspots of research and development. This paper firstly studies the operation mechanism of Hadoop, which is a typical distributed processing framework of Apache, then realizes extraction of user profile from a large number of Web log data and through comparison experiment with single machine to verify its efficiency.
  • Keywords
    Internet; data mining; information retrieval; Apache distributed processing framework; Hadoop framework; Internet; Web information; Web log data; mass data distributed processing; personalized search service; user profile extraction; Data mining; Data processing; Distributed processing; Fault tolerance; File systems; Java; Logic; Parallel processing; Programming profession; Research and development;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Wireless Communications, Networking and Mobile Computing, 2009. WiCom '09. 5th International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-3692-7
  • Electronic_ISBN
    978-1-4244-3693-4
  • Type

    conf

  • DOI
    10.1109/WICOM.2009.5305856
  • Filename
    5305856