DocumentCode :
735881
Title :
Clustering to determine predictive model for news reports analysis and econometric modeling
Author :
Mukherjee, Sanjoy Kumar ; Bandyopadhyay, Sivaji
Author_Institution :
Comput. Sci. & Eng. Formerly at Narula Inst. of Technol., Kolkata, India
fYear :
2015
fDate :
9-11 July 2015
Firstpage :
302
Lastpage :
309
Abstract :
A tree model is constructed for the econometric problem domain and for topic modeling of news reports using a clustering approach. Here segments are represented as discretized intervals defined on econometric variables for speeding up the construction of regression tree. This discretization is achieved from variances defined on variables with predictability for that generated for calculating category utility values defined on correlated variables where the discretization method proposed has the aim to satisfy a constraint of minimum entropy distribution of values of the predictor variable among the categories. An algorithm is proposed for tree merging which is used for incrementally incorporating information for new time intervals with the existing model to generate updated tree model for maintaining logical consistency. The tree merging algorithm has been shown to be suitable for applying to news report documents or econometric information. This is accomplished with a proposed Pruning procedure for maintaining logical consistency in the merged tree which is applied together with existing approaches for limiting pruning and access costs for reducing misclassification error.
Keywords :
document handling; econometrics; entropy; information resources; pattern clustering; trees (mathematics); clustering; discretization method; econometric modeling; minimum entropy distribution; news report analysis; news report documents; regression tree; tree merging algorithm; tree model; Advertising; Clustering algorithms; Econometrics; Merging; Predictive models; Probabilistic logic; Regression tree analysis; discretization; econometric model; news report; regression tree; tree merging;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Recent Trends in Information Systems (ReTIS), 2015 IEEE 2nd International Conference on
Conference_Location :
Kolkata
Type :
conf
DOI :
10.1109/ReTIS.2015.7232895
Filename :
7232895
Link To Document :
بازگشت