Title :
SiS: Significant subnetworks in massive number of network topologies
Author :
Hasan, Md Mahmudul ; Kavurucu, Yusuf ; Kahveci, Tamer
Author_Institution :
Dept. of Comput. & Inf. Sci. & Eng., Univ. of Florida, Gainesville, FL, USA
Abstract :
Availability of abundant biological network data and its noisy nature necessitates extracting reliable subnetworks hidden in it. One key step towards that goal is to discover the subnetworks that appear frequently in a collection of networks. This paper presents a method, named SiS (Significant Subnetworks), to discover most probable subnetworks (i.e., subnetworks that have the highest chance to exist) in a large collection of biological networks where each node is labeled with the corresponding molecule (such as gene or protein). SiS builds a template network which summarizes the entire set of input networks. It then grows subnetworks that are most probable with the guidance of this template network. Our experiments demonstrate that our method scales to very large datasets and subnetworks easily. On the metabolic networks of the eukaryote organisms, our method runs from a few seconds to a few minutes depending on the subnetwork size. MULE, an existing method for the same problem, takes hours or does not complete for days on the same dataset. Our results also suggest that the most probable subnetworks are often the most frequent ones as well.
Keywords :
cellular biophysics; complex networks; molecular biophysics; SiS; eukaryote organisms; metabolic networks; network topologies; noisy biological network data; significant subnetworks; subnetwork discovery; template network; Network topology; Organisms; Proteins; Space exploration; Upper bound; biological networks; freqaent subnetworks;
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-1-4673-2559-2
Electronic_ISBN :
978-1-4673-2558-5
DOI :
10.1109/BIBM.2012.6392653