DocumentCode
3773427
Title
High-Speed Classification of Financial Network Public Opinion Based on Hadoop
Author
Guichun Gong
Author_Institution
Res. Inst. of Electron. Sci. &
Volume
1
fYear
2015
Firstpage
89
Lastpage
92
Abstract
Financial network public opinion refers to those influential tendentious views and comments, spread through the network, on economic issues in real life. It reflects the economic focus in the corresponding period. To learn about the dynamics of financial market, processing and analysis of those opinions is needed. However, in the face of vast amounts of data, of which the analysis requires a great mount of storage and computations, standalone exposes the shortcoming of low storage capacity and slow processing speed problems. In this paper, the Hadoop distributed platform is applied to deal with this situation. Moreover, this article has presented the application of MapReduce programming model in key processes of public opinion text processing, including preprocessing, feature selection and vectorization of text. Experiments have showed that the Hadoop cluster well solved the disadvantages of low storage capacity and low-performance in computation.
Keywords
"Internet","Economics","Crawlers","Text processing","Media","Distributed databases","File systems"
Publisher
ieee
Conference_Titel
Computational Intelligence and Design (ISCID), 2015 8th International Symposium on
Print_ISBN
978-1-4673-9586-1
Type
conf
DOI
10.1109/ISCID.2015.138
Filename
7468905
Link To Document