DocumentCode :
3520122
Title :
Multiway Clustering for Creating Biomedical Term Sets
Author :
Kandylas, Vasileios ; Ungar, Lyle ; Sandler, Ted ; Jensen, Shane
Author_Institution :
Comput. & Inf. Sci., Univ. of Pennsylvania, Philadelphia, PA
fYear :
2008
fDate :
3-5 Nov. 2008
Firstpage :
449
Lastpage :
452
Abstract :
We present an EM-based clustering method that can be used for constructing or augmenting ontologies such as MeSH. Our algorithm simultaneously clusters verbs and nouns using both verb-noun and noun-noun co-occurrence pairs. This strategy provides greater coverage of words than using either set of pairs alone, since not all words appear in both datasets. We demonstrate it on data extracted from Medline and evaluate the results using MeSH and Wordnet.
Keywords :
bioinformatics; ontologies (artificial intelligence); pattern clustering; statistical analysis; word processing; EM-based clustering method; MeSH; Wordnet; multiway clustering; ontologies; Bioinformatics; Biomedical computing; Clustering algorithms; Clustering methods; Data mining; Information science; Mutual information; Natural language processing; Ontologies; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine, 2008. BIBM '08. IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-0-7695-3452-7
Type :
conf
DOI :
10.1109/BIBM.2008.25
Filename :
4684937
Link To Document :
بازگشت