DocumentCode :
3714625
Title :
Leveraging the k-Nearest Neighbors classification algorithm for Microbial Source Tracking using a bacterial DNA fingerprint library
Author :
Jeffrey D. McGovern;Alexander Dekhtyar;Christopher Kitts;Michael Black;Jennifer VanderKelen;Anya Goodman
Author_Institution :
Department of Computer Science, California Polytechnic State University, San Luis Obispo, 93407, United States
fYear :
2015
Firstpage :
1694
Lastpage :
1701
Abstract :
Fecal contamination in bodies of water is an issue that cities must combat regularly. Often, city governments must restrict access to water sources until the contaminants dissipate. Sourcing the species of the fecal matter helps curb the issue in the future, giving city governments the ability to mitigate the effects before they occur again. Microbial Source Tracking (MST) aims to determine source host species of strains of microbiological lifeforms and library-based MST is one method that can assist in sourcing fecal matter. Recently, the Biology Department in conjunction with the Computer Science Department at California Polytechnic State University San Luis Obispo (Cal Poly) teamed up to build a database called the Cal Poly Library of Pyroprints (CPLOP). Students collect fecal samples, culture and pyrosequence the E. coli in the samples, and insert this data, called pyroprints, into CPLOP. Using two intergenic transcribed spacer regions of DNA, Cal Poly biologists perform studies on strain differentiation. We propose using k-Nearest Neighbors, a straightforward machine learning technique, to classify the host species of a given pyroprint, construct four algorithms to resolve the regions, and investigate classification accuracy.
Keywords :
"Measurement","Rabbits"
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/BIBM.2015.7359930
Filename :
7359930
Link To Document :
بازگشت