Title :
Automatic Detection of HIV Drug Resistance-Associated Mutations
Author :
Cheng, Betty Y. ; Carbonell, Jaime G.
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
Each HIV-1 patient has a diverse population of virus strains in his/her body as the virus quickly replicates and mutates, requiring a combination drug therapy optimized to the patient´s unique viral population. Towards this goal, prediction systems have been developed to deduce the susceptibility of a given HIV genotype to a single drug. Many are rule-based systems or rely on hand-crafted features which are difficult to update for HIV strains and new drugs. We adapted the vector-of-n-grams approach from document classification and chi-square feature selection to automatically generate a feature set that yields comparable performance to the expert-selected and database-derived feature sets without requiring treatment history data. Our automatically-generated feature set also found all the expert-selected mutations and more demonstrating its potential for knowledge discovery. Compared to the previous state-of-the-art with ample expert knowledge, our best fully-automated prediction model for each drug yielded comparable performance at 82.9% classification accuracy and 0.819 coefficient of determination on average. Along with its lack of need for human expertise and potential for knowledge discovery, our automatic feature selection method is a good candidate for the more complex prediction task of combination drug therapy optimization.
Keywords :
data mining; drugs; medical computing; optimisation; patient diagnosis; HIV drug resistance-associated mutations; HIV genotype; automatic detection; drug therapy optimization; knowledge discovery; viral population; virus strains; Accuracy; Artificial neural networks; Classification algorithms; Classification tree analysis; Drugs; Human immunodeficiency virus; Immune system; Classification; Feature selection; Regression;
Conference_Titel :
Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-9211-4
DOI :
10.1109/ICMLA.2010.83