DocumentCode
2202616
Title
Extraction of User Profile Based on the Hadoop Framework
Author
Huang, Lan ; Wang, Xiao-Wei ; Zhai, Yan-Dong ; Yang, Bin
Author_Institution
Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun, China
fYear
2009
fDate
24-26 Sept. 2009
Firstpage
1
Lastpage
6
Abstract
With the rapid development of Internet, the Web information dramatically increases, the users are often involved in voluminous information to feel lose, Distributed processing of mass data through a cluster composed by many machines and personalized search services based on the user profile have been the hotspots of research and development. This paper firstly studies the operation mechanism of Hadoop, which is a typical distributed processing framework of Apache, then realizes extraction of user profile from a large number of Web log data and through comparison experiment with single machine to verify its efficiency.
Keywords
Internet; data mining; information retrieval; Apache distributed processing framework; Hadoop framework; Internet; Web information; Web log data; mass data distributed processing; personalized search service; user profile extraction; Data mining; Data processing; Distributed processing; Fault tolerance; File systems; Java; Logic; Parallel processing; Programming profession; Research and development;
fLanguage
English
Publisher
ieee
Conference_Titel
Wireless Communications, Networking and Mobile Computing, 2009. WiCom '09. 5th International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-3692-7
Electronic_ISBN
978-1-4244-3693-4
Type
conf
DOI
10.1109/WICOM.2009.5305856
Filename
5305856
Link To Document