DocumentCode :
2998537
Title :
The Development of Parallel Adaptive Sampling Algorithms for Analyzing Biological Networks
Author :
Dempsey, Kathryn ; Duraisamy, K. ; Bhowmick, Sourav S. ; Ali, Hamza
Author_Institution :
Dept. of Pathology & Microbiol., Univ. of Nebraska Med. Center, Omaha, NE, USA
fYear :
2012
fDate :
21-25 May 2012
Firstpage :
725
Lastpage :
734
Abstract :
The availability of biological data in massive scales continues to represent unlimited opportunities as well as great challenges in bioinformatics research. Developing innovative data mining techniques and efficient parallel computational methods to implement them will be crucial in extracting useful knowledge from this raw unprocessed data, such as in discovering significant cellular subsystems from gene correlation networks. In this paper, we present a scalable combinatorial sampling technique, based on identifying maximum chordal sub graphs, that reduces noise from biological correlation networks, thereby making it possible to find biologically relevant clusters from the filtered network. We show how selecting the appropriate filter is crucial in maintaining the key structures from the original networks and uncovering new ones after removing noisy relationships. We also conduct one of the first comparisons in two important sensitivity criteria - the perturbation due to the vertex numbers of the network and perturbations due to data distribution. We demonstrate that our chordal-graph based filter is effective across many different vertex permutations, as is our parallel implementation of the sampling algorithm.
Keywords :
bioinformatics; data mining; filtering theory; graph theory; knowledge acquisition; parallel algorithms; sampling methods; bioinformatics research; biological data; biological network analysis; cellular subsystems; chordal graph-based filter; data distribution; data mining techniques; gene correlation networks; knowledge extraction; maximum chordal subgraph identification; network vertex numbers; noise reduction; parallel adaptive sampling algorithms; parallel computational methods; scalable combinatorial sampling technique; sensitivity criteria; vertex permutations; Biological system modeling; Clustering algorithms; Correlation; Filtering algorithms; Noise; Program processors; chordal graphs; cluster overlap; correlation networks; edge enrichment; ordering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0974-5
Type :
conf
DOI :
10.1109/IPDPSW.2012.90
Filename :
6270712
Link To Document :
بازگشت