DocumentCode
124140
Title
Obtaining Technology Insights from Large and Heterogeneous Document Collections
Author
Dey, Lipika ; Mahajan, Dhruv ; Gupta, H.
Author_Institution
Innovation Labs., Delhi Tata Consultancy Services, New Delhi, India
Volume
1
fYear
2014
fDate
11-14 Aug. 2014
Firstpage
102
Lastpage
109
Abstract
Keeping up with rapid advances in research in various fields of Engineering and Technology is a challenging task. Decision makers including academics, program managers, venture capital investors, industry leaders and funding agencies not only need to be abreast of latest developments but also be able to assess the effect of growth in certain areas on their core business. Though analyst agencies like Gartner, McKinsey etc. Provide such reports for some areas, thought leaders of all organisations still need to amass data from heterogeneous collections like research publications, analyst reports, patent applications, competitor information etc. To help them finalize their own strategies. Text mining and data analytics researchers have been looking at integrating statistics, text analytics and information visualization to aid the process of retrieval and analytics. In this paper, we present our work on automated topical analysis and insight generation from large heterogeneous text collections of publications and patents. While most of the earlier work in this area provides search-based platforms, ours is an integrated platform for search and analysis. We have presented several methods and techniques that help in analysis and better comprehension of search results. We have also presented methods for generating insights about emerging and popular trends in research along with contextual differences between academic research and patenting profiles. We also present novel techniques to present topic evolution that helps users understand how a particular area has evolved over time.
Keywords
data analysis; information retrieval; patents; text analysis; academic research; automated topical analysis; heterogeneous document collections; insight generation; large heterogeneous text collections; patenting profiles; publications; topic evolution; Context; Data mining; Data visualization; Hidden Markov models; Indexing; Market research; Patents; analyzing research trends; mining patent databases; mining publications;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
Conference_Location
Warsaw
Type
conf
DOI
10.1109/WI-IAT.2014.22
Filename
6927531
Link To Document