DocumentCode :
2871500
Title :
The Impact of Directionality in Predications on Text Mining
Author :
Leroy, Gondy ; Fiszman, Marcelo ; Rindflesch, Thomas C.
Author_Institution :
Claremont Graduate Univ., Claremont
fYear :
2008
fDate :
7-10 Jan. 2008
Firstpage :
228
Lastpage :
228
Abstract :
The number of publications in biomedicine is increasing enormously each year. To help researchers digest the information in these documents, text mining tools are being developed that present co-occurrence relations between concepts. Statistical measures are used to mine interesting subsets of relations. We demonstrate how directionality of these relations affects interestingness. Support and confidence, simple data mining statistics, are used as proxies for interestingness metrics. We first built a test bed of 126,404 directional relations extracted from biomedical abstracts, which we represent as graphs containing a central starting concept and 2 rings of associated relations. We manipulated directionality in four ways and randomly selected 100 starting concepts as a test sample for each graph type. Finally, we calculated the number of relations and their support and confidence. Variation in directionality significantly affected the number of relations as well as the support and confidence of the four graph types.
Keywords :
computer graphics; data mining; medical administrative data processing; biomedical abstracts; biomedicine; data mining statistics; text mining predications; text mining tools; Autism; Bioinformatics; Biomedical measurements; Data mining; Data visualization; Databases; Genomics; Statistics; Testing; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Hawaii International Conference on System Sciences, Proceedings of the 41st Annual
Conference_Location :
Waikoloa, HI
ISSN :
1530-1605
Type :
conf
DOI :
10.1109/HICSS.2008.443
Filename :
4438932
Link To Document :
بازگشت