• DocumentCode
    2634902
  • Title

    Hypotheses Generation Pertaining to Ayurveda Using Automated Vocabulary Generation and Transitive Text Mining

  • Author

    Vaka, Harsha Gopal Goud ; Mukhopadhyay, Snehasis

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Indiana Univ.-Purdue Univ. Indianapolis, Indianapolis, IN, USA
  • fYear
    2009
  • fDate
    19-21 Aug. 2009
  • Firstpage
    200
  • Lastpage
    205
  • Abstract
    Automated extraction of knowledge from voluminous documents is a vast research area. Text mining is a promising approach for extracting knowledge from unstructured textual documents. The objective of this paper is to mine documents pertaining to Ayurveda, which are retrieved from PubMed into a databank, and find novel transitive associations among biological objects. This paper discusses the extraction of biological objects from the databank using an Automated Vocabulary Discovery (AVD) algorithm. A text-mining process is described for finding transitive (novel) associations among the extracted biological objects. The text mining algorithm, in addition to identifying novel associations (termed hypotheses), also assigns a numerical significance score to them. The expectation is that those with higher score have greater likelihood of being true than those with lower scores. Experimental results as well as their validation results are presented, demonstrating that the method has the potential to predict novel and interesting true associations.
  • Keywords
    data mining; medical information systems; text analysis; vocabulary; Ayurveda; PubMed; automated knowledge extraction; automated vocabulary discovery algorithm; automated vocabulary generation; biological objects; databank; mine documents; novel associations; transitive associations; transitive text mining; unstructured textual documents; voluminous documents; Abstracts; Computer networks; Data mining; Databases; Diseases; Information retrieval; Information science; Information systems; Text mining; Vocabulary; Ayurveda; automated vocabulary discovery; breadth first search; text mining; transitive closure;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Network-Based Information Systems, 2009. NBIS '09. International Conference on
  • Conference_Location
    Indianapolis, IN
  • Print_ISBN
    978-1-4244-4746-6
  • Electronic_ISBN
    978-0-7695-3767-2
  • Type

    conf

  • DOI
    10.1109/NBiS.2009.30
  • Filename
    5350040