DocumentCode :
569305
Title :
A Maximum Likelihood Method for Detecting Bad Samples from Illumina BeadChips Data
Author :
Ha Anh Tuan Nguyen ; Sy Vinh Le ; Si Quang Le
Author_Institution :
Univ. of Eng. & Technol., Hanoi, Vietnam
fYear :
2012
fDate :
17-19 Aug. 2012
Firstpage :
26
Lastpage :
33
Abstract :
Genotype data provide crucial information to understand effects of genetic variation to human health. Current microarray technologies are able to generate raw genotype data from thousands of samples across million of SNP sites. These raw data are processed by computational methods, called genotype caller, to obtain genotypes. Genotype calls of different callers might not be consistent due to noise of bad samples or SNPs. This requires a manual quality control step conducted by experts to remove bad samples or bad SNP sites. In this paper, we propose a maximum likelihood method to detect bad samples to improve the reliability of the results. Experiments with real data demonstrate the usefulness of our method in the quality control process. Thus, our method has the ability to reduce the number of samples that are requested to manually check by experts.
Keywords :
genetics; genomics; maximum likelihood detection; quality control; sampling methods; Illumina BeadChips data; bad SNP sites; bad samples detection; computational methods; genetic variation; genotype caller; genotype data; human health; manual quality control; maximum likelihood method; microarray technologies; quality control process; raw genotype data generation; Clustering algorithms; Educational institutions; Genetics; Maximum likelihood detection; Probability; Process control; Quality control; SNP; bad samples; genotype; quality control;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Knowledge and Systems Engineering (KSE), 2012 Fourth International Conference on
Conference_Location :
Danang
Print_ISBN :
978-1-4673-2171-6
Type :
conf
DOI :
10.1109/KSE.2012.24
Filename :
6299394
Link To Document :
بازگشت