• DocumentCode
    569305
  • Title

    A Maximum Likelihood Method for Detecting Bad Samples from Illumina BeadChips Data

  • Author

    Ha Anh Tuan Nguyen ; Sy Vinh Le ; Si Quang Le

  • Author_Institution
    Univ. of Eng. & Technol., Hanoi, Vietnam
  • fYear
    2012
  • fDate
    17-19 Aug. 2012
  • Firstpage
    26
  • Lastpage
    33
  • Abstract
    Genotype data provide crucial information to understand effects of genetic variation to human health. Current microarray technologies are able to generate raw genotype data from thousands of samples across million of SNP sites. These raw data are processed by computational methods, called genotype caller, to obtain genotypes. Genotype calls of different callers might not be consistent due to noise of bad samples or SNPs. This requires a manual quality control step conducted by experts to remove bad samples or bad SNP sites. In this paper, we propose a maximum likelihood method to detect bad samples to improve the reliability of the results. Experiments with real data demonstrate the usefulness of our method in the quality control process. Thus, our method has the ability to reduce the number of samples that are requested to manually check by experts.
  • Keywords
    genetics; genomics; maximum likelihood detection; quality control; sampling methods; Illumina BeadChips data; bad SNP sites; bad samples detection; computational methods; genetic variation; genotype caller; genotype data; human health; manual quality control; maximum likelihood method; microarray technologies; quality control process; raw genotype data generation; Clustering algorithms; Educational institutions; Genetics; Maximum likelihood detection; Probability; Process control; Quality control; SNP; bad samples; genotype; quality control;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Knowledge and Systems Engineering (KSE), 2012 Fourth International Conference on
  • Conference_Location
    Danang
  • Print_ISBN
    978-1-4673-2171-6
  • Type

    conf

  • DOI
    10.1109/KSE.2012.24
  • Filename
    6299394