DocumentCode
3656940
Title
Discover trending domains using fusion of supervised machine learning with natural language processing
Author
Shilpa Lakhanpal;Ajay Gupta;Rajeev Agrawal
Author_Institution
Western Michigan University, Kalamazoo, MI, U.S.A
fYear
2015
fDate
7/1/2015 12:00:00 AM
Firstpage
893
Lastpage
900
Abstract
In this paper, a new technique is presented for mining key domain areas from scientific publications. A domain refers to a particular branch of scientific knowledge and hence largely defines the theme of any scientific research paper. The proposed technique stems from a fusion of knowledge derived from natural language processing and machine learning. Some words or phrases are extracted based on their meaning inferred by the application of preposition disambiguation. These key words or phrases are then classified as domain areas using supervised learning. Various experiments and their analyses yield concrete results validating the efficacy and application of our methodology. The fusion technique therefore extracts an interesting aspect of research from scientific text and hence propounds a hybrid methodology for deriving meaning from underlying text. This approach thus takes a definitive step in advancing text analytics.
Keywords
"Natural language processing","Hidden Markov models","Mathematical model","Feature extraction","Supervised learning","Computers","Data mining"
Publisher
ieee
Conference_Titel
Information Fusion (Fusion), 2015 18th International Conference on
Type
conf
Filename
7266654
Link To Document