• DocumentCode
    3254040
  • Title

    Multiple Sources Classification of Gene Position on Chromosomes Using Statistical Significance of Individual Classification Results

  • Author

    Alnemer, Loai M. ; Al-Azzam, Omar ; Chitraranjan, Charith ; Denton, Anne M. ; Bassi, Filippo M. ; Iqbal, Muhammad J. ; Kianian, Shahryar F.

  • Author_Institution
    Dept. of Comput. Sci., North Dakota State Univ., Fargo, ND, USA
  • Volume
    1
  • fYear
    2011
  • fDate
    18-21 Dec. 2011
  • Firstpage
    7
  • Lastpage
    12
  • Abstract
    In data mining applications it is common to have more than one data source available to describe the same record. For example, in biological sciences, the same genes may be characterized through many types of experiments. Which of the data sources proves to be most reliable in predictions may depend on the record in question. For some records pieces of information may be unavailable because an experiment has not yet been done, or certain type of inferences may not be applicable, such as when a gene does not have a homologue in some species. We demonstrate how multi-classifier systems can allow classification in cases where any individual source is scarce or unreliable to provide an accurate prediction model by itself. We propose a method to predict a class label using statistical significance of individual classification results. We show that the proposed approach increases the accuracy of results compared with conventional techniques in a problem related to gene mapping in wheat.
  • Keywords
    bioinformatics; cellular biophysics; crops; data mining; genetics; pattern classification; statistical analysis; biological science; chromosome; class label prediction; classification result statistical significance; data mining; data source; gene mapping; gene position; multiclassifier system; multiple source classification; wheat; Bioinformatics; Biological cells; Databases; Genomics; Prediction algorithms; Proteins; Reliability; Density-based algorithms; classifier-fusion; gene mapping; synteny information;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications and Workshops (ICMLA), 2011 10th International Conference on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    978-1-4577-2134-2
  • Type

    conf

  • DOI
    10.1109/ICMLA.2011.101
  • Filename
    6146933