• DocumentCode
    2998537
  • Title

    The Development of Parallel Adaptive Sampling Algorithms for Analyzing Biological Networks

  • Author

    Dempsey, Kathryn ; Duraisamy, K. ; Bhowmick, Sourav S. ; Ali, Hamza

  • Author_Institution
    Dept. of Pathology & Microbiol., Univ. of Nebraska Med. Center, Omaha, NE, USA
  • fYear
    2012
  • fDate
    21-25 May 2012
  • Firstpage
    725
  • Lastpage
    734
  • Abstract
    The availability of biological data in massive scales continues to represent unlimited opportunities as well as great challenges in bioinformatics research. Developing innovative data mining techniques and efficient parallel computational methods to implement them will be crucial in extracting useful knowledge from this raw unprocessed data, such as in discovering significant cellular subsystems from gene correlation networks. In this paper, we present a scalable combinatorial sampling technique, based on identifying maximum chordal sub graphs, that reduces noise from biological correlation networks, thereby making it possible to find biologically relevant clusters from the filtered network. We show how selecting the appropriate filter is crucial in maintaining the key structures from the original networks and uncovering new ones after removing noisy relationships. We also conduct one of the first comparisons in two important sensitivity criteria - the perturbation due to the vertex numbers of the network and perturbations due to data distribution. We demonstrate that our chordal-graph based filter is effective across many different vertex permutations, as is our parallel implementation of the sampling algorithm.
  • Keywords
    bioinformatics; data mining; filtering theory; graph theory; knowledge acquisition; parallel algorithms; sampling methods; bioinformatics research; biological data; biological network analysis; cellular subsystems; chordal graph-based filter; data distribution; data mining techniques; gene correlation networks; knowledge extraction; maximum chordal subgraph identification; network vertex numbers; noise reduction; parallel adaptive sampling algorithms; parallel computational methods; scalable combinatorial sampling technique; sensitivity criteria; vertex permutations; Biological system modeling; Clustering algorithms; Correlation; Filtering algorithms; Noise; Program processors; chordal graphs; cluster overlap; correlation networks; edge enrichment; ordering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-0974-5
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2012.90
  • Filename
    6270712