Title :
A framework to integrate unstructured and structured data for enterprise analytics
Author :
Dey, Lipika ; Verma, Ishan ; Khurdiya, Arpit ; Sameera Bharadwaja, H.
Author_Institution :
TCS Innovation Labs., Tata Consultancy Services Ltd., New Delhi, India
Abstract :
It is well-accepted that when information from structured and unstructured data sources is analyzed together, the potential of gaining meaningful insights increases manifolds. This paper provides a framework for integrating structured and unstructured data in the context of enterprise analytics. Structured data is assumed to be in the form of a time-series that encodes some aspect of enterprise performance over a specified period, like monthly or weekly sales figures or stock prices etc. Unstructured data may be gathered from news sources, internal repositories of consumer feedbacks, blogs and discussion forums or also from social-media like Twitter, Facebook etc. This paper focuses on intelligent methods of linking time-series data points to the unstructured content in an application-specific way such that the linked unstructured text creates a context for interpreting the time-series behavior. The aim is to generate new forms of data that can be employed in future to derive predictive models or perform causal analytics or also help in risk assessment for Enterprises.
Keywords :
business data processing; data analysis; data integration; text analysis; time series; causal analytics; enterprise analytics; enterprise performance; linked unstructured text; predictive models; risk assessment; time-series data points; unstructured-structured data integration; Companies; Context; Correlation; Data mining; Indexing; Semantics; Data Association; Fusion Enabled Decision Support; Information Fusion; Time-series Analysis;
Conference_Titel :
Information Fusion (FUSION), 2013 16th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-605-86311-1-3