DocumentCode
2578491
Title
Clustering gene expression data using Shannon´s entropy
Author
Mohanapriya, S. ; Elavarasi, S. Anitha ; Akilandeswari, J.
Author_Institution
Dept. of Comput. Sci. & Eng., Sona Coll. of Technol., Salem, India
fYear
2011
fDate
3-5 June 2011
Firstpage
1116
Lastpage
1120
Abstract
Clustering is a process of grouping a set of physical or abstract objects into classes of similar objects. The purpose of clustering gene expression data is to discover the natural data structures and gain some information regarding data distribution. It can be done with the help of clustering method. Hierarchical clustering groups´ data objects into a tree of clusters. Traditional clustering algorithms uses proximity measures to identify clusters with spherical shapes and is more sensitive in the presence of outliers. The most common proximity measures used are Euclidean distance, Manhattan distance, and Pearson correlation co-efficient. In this paper, Shannon´s entropy is used as a proximity measure. By using this entropy, we can able to capture the local structure of the input dataset regardless of their shapes and it is very less sensitive to outliers. It also helps to reduce the time complexity involved in identifying the gene clusters. The characteristics of the gene clusters which are produced as a result of this algorithm can be identified with the help of Gene Ontology (GO).
Keywords
computational complexity; entropy; genetics; ontologies (artificial intelligence); pattern clustering; tree data structures; Euclidean distance; Manhattan distance; Pearson correlation coefficient; Shannon entropy; cluster tree; data distribution; gene clusters; gene expression data clustering; gene ontology; hierarchical clustering; natural data structures; time complexity; Clustering algorithms; Clustering methods; Data mining; Entropy; Gene expression; Ontologies; Shape; Gene Ontology; Gene expression data; Hierarchical Clustering; Shannon´s entropy;
fLanguage
English
Publisher
ieee
Conference_Titel
Recent Trends in Information Technology (ICRTIT), 2011 International Conference on
Conference_Location
Chennai, Tamil Nadu
Print_ISBN
978-1-4577-0588-5
Type
conf
DOI
10.1109/ICRTIT.2011.5972412
Filename
5972412
Link To Document