• DocumentCode
    3773427
  • Title

    High-Speed Classification of Financial Network Public Opinion Based on Hadoop

  • Author

    Guichun Gong

  • Author_Institution
    Res. Inst. of Electron. Sci. &
  • Volume
    1
  • fYear
    2015
  • Firstpage
    89
  • Lastpage
    92
  • Abstract
    Financial network public opinion refers to those influential tendentious views and comments, spread through the network, on economic issues in real life. It reflects the economic focus in the corresponding period. To learn about the dynamics of financial market, processing and analysis of those opinions is needed. However, in the face of vast amounts of data, of which the analysis requires a great mount of storage and computations, standalone exposes the shortcoming of low storage capacity and slow processing speed problems. In this paper, the Hadoop distributed platform is applied to deal with this situation. Moreover, this article has presented the application of MapReduce programming model in key processes of public opinion text processing, including preprocessing, feature selection and vectorization of text. Experiments have showed that the Hadoop cluster well solved the disadvantages of low storage capacity and low-performance in computation.
  • Keywords
    "Internet","Economics","Crawlers","Text processing","Media","Distributed databases","File systems"
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Design (ISCID), 2015 8th International Symposium on
  • Print_ISBN
    978-1-4673-9586-1
  • Type

    conf

  • DOI
    10.1109/ISCID.2015.138
  • Filename
    7468905