DocumentCode :
2634902
Title :
Hypotheses Generation Pertaining to Ayurveda Using Automated Vocabulary Generation and Transitive Text Mining
Author :
Vaka, Harsha Gopal Goud ; Mukhopadhyay, Snehasis
Author_Institution :
Dept. of Comput. & Inf. Sci., Indiana Univ.-Purdue Univ. Indianapolis, Indianapolis, IN, USA
fYear :
2009
fDate :
19-21 Aug. 2009
Firstpage :
200
Lastpage :
205
Abstract :
Automated extraction of knowledge from voluminous documents is a vast research area. Text mining is a promising approach for extracting knowledge from unstructured textual documents. The objective of this paper is to mine documents pertaining to Ayurveda, which are retrieved from PubMed into a databank, and find novel transitive associations among biological objects. This paper discusses the extraction of biological objects from the databank using an Automated Vocabulary Discovery (AVD) algorithm. A text-mining process is described for finding transitive (novel) associations among the extracted biological objects. The text mining algorithm, in addition to identifying novel associations (termed hypotheses), also assigns a numerical significance score to them. The expectation is that those with higher score have greater likelihood of being true than those with lower scores. Experimental results as well as their validation results are presented, demonstrating that the method has the potential to predict novel and interesting true associations.
Keywords :
data mining; medical information systems; text analysis; vocabulary; Ayurveda; PubMed; automated knowledge extraction; automated vocabulary discovery algorithm; automated vocabulary generation; biological objects; databank; mine documents; novel associations; transitive associations; transitive text mining; unstructured textual documents; voluminous documents; Abstracts; Computer networks; Data mining; Databases; Diseases; Information retrieval; Information science; Information systems; Text mining; Vocabulary; Ayurveda; automated vocabulary discovery; breadth first search; text mining; transitive closure;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Network-Based Information Systems, 2009. NBIS '09. International Conference on
Conference_Location :
Indianapolis, IN
Print_ISBN :
978-1-4244-4746-6
Electronic_ISBN :
978-0-7695-3767-2
Type :
conf
DOI :
10.1109/NBiS.2009.30
Filename :
5350040
Link To Document :
بازگشت