• DocumentCode
    735881
  • Title

    Clustering to determine predictive model for news reports analysis and econometric modeling

  • Author

    Mukherjee, Sanjoy Kumar ; Bandyopadhyay, Sivaji

  • Author_Institution
    Comput. Sci. & Eng. Formerly at Narula Inst. of Technol., Kolkata, India
  • fYear
    2015
  • fDate
    9-11 July 2015
  • Firstpage
    302
  • Lastpage
    309
  • Abstract
    A tree model is constructed for the econometric problem domain and for topic modeling of news reports using a clustering approach. Here segments are represented as discretized intervals defined on econometric variables for speeding up the construction of regression tree. This discretization is achieved from variances defined on variables with predictability for that generated for calculating category utility values defined on correlated variables where the discretization method proposed has the aim to satisfy a constraint of minimum entropy distribution of values of the predictor variable among the categories. An algorithm is proposed for tree merging which is used for incrementally incorporating information for new time intervals with the existing model to generate updated tree model for maintaining logical consistency. The tree merging algorithm has been shown to be suitable for applying to news report documents or econometric information. This is accomplished with a proposed Pruning procedure for maintaining logical consistency in the merged tree which is applied together with existing approaches for limiting pruning and access costs for reducing misclassification error.
  • Keywords
    document handling; econometrics; entropy; information resources; pattern clustering; trees (mathematics); clustering; discretization method; econometric modeling; minimum entropy distribution; news report analysis; news report documents; regression tree; tree merging algorithm; tree model; Advertising; Clustering algorithms; Econometrics; Merging; Predictive models; Probabilistic logic; Regression tree analysis; discretization; econometric model; news report; regression tree; tree merging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Recent Trends in Information Systems (ReTIS), 2015 IEEE 2nd International Conference on
  • Conference_Location
    Kolkata
  • Type

    conf

  • DOI
    10.1109/ReTIS.2015.7232895
  • Filename
    7232895