Title :
Efficient agglomerative hierarchical clustering for biological sequence analysis
Author :
Thuy-Diem Nguyen;Chee-Keong Kwoh
Author_Institution :
School of Computer Engineering, Nanyang Technological University, Singapore
Abstract :
Cluster analysis is an important data mining technique widely used for pattern recognition and information retrieval. In the literature, over a hundred clustering algorithms have been developed to target input datasets with different characteristics. Among these algorithms, the hierarchical clustering method is particularly useful for analyzing genetic datasets in evolutionary biology studies because of the inherent hierarchical relationships amongst the genetic sequences extracted from related organisms. However, this algorithm is computational expensive in terms of both execution time and particularly memory usage. This paper summarizes our experience in using parallel computing technologies with new algorithms to perform hierarchical sequence clustering in a more effective way without compromising the accuracy of the results.
Conference_Titel :
TENCON 2015 - 2015 IEEE Region 10 Conference
Print_ISBN :
978-1-4799-8639-2
Electronic_ISBN :
2159-3450
DOI :
10.1109/TENCON.2015.7373194