DocumentCode
2634902
Title
Hypotheses Generation Pertaining to Ayurveda Using Automated Vocabulary Generation and Transitive Text Mining
Author
Vaka, Harsha Gopal Goud ; Mukhopadhyay, Snehasis
Author_Institution
Dept. of Comput. & Inf. Sci., Indiana Univ.-Purdue Univ. Indianapolis, Indianapolis, IN, USA
fYear
2009
fDate
19-21 Aug. 2009
Firstpage
200
Lastpage
205
Abstract
Automated extraction of knowledge from voluminous documents is a vast research area. Text mining is a promising approach for extracting knowledge from unstructured textual documents. The objective of this paper is to mine documents pertaining to Ayurveda, which are retrieved from PubMed into a databank, and find novel transitive associations among biological objects. This paper discusses the extraction of biological objects from the databank using an Automated Vocabulary Discovery (AVD) algorithm. A text-mining process is described for finding transitive (novel) associations among the extracted biological objects. The text mining algorithm, in addition to identifying novel associations (termed hypotheses), also assigns a numerical significance score to them. The expectation is that those with higher score have greater likelihood of being true than those with lower scores. Experimental results as well as their validation results are presented, demonstrating that the method has the potential to predict novel and interesting true associations.
Keywords
data mining; medical information systems; text analysis; vocabulary; Ayurveda; PubMed; automated knowledge extraction; automated vocabulary discovery algorithm; automated vocabulary generation; biological objects; databank; mine documents; novel associations; transitive associations; transitive text mining; unstructured textual documents; voluminous documents; Abstracts; Computer networks; Data mining; Databases; Diseases; Information retrieval; Information science; Information systems; Text mining; Vocabulary; Ayurveda; automated vocabulary discovery; breadth first search; text mining; transitive closure;
fLanguage
English
Publisher
ieee
Conference_Titel
Network-Based Information Systems, 2009. NBIS '09. International Conference on
Conference_Location
Indianapolis, IN
Print_ISBN
978-1-4244-4746-6
Electronic_ISBN
978-0-7695-3767-2
Type
conf
DOI
10.1109/NBiS.2009.30
Filename
5350040
Link To Document