Title :
The Quality Preserving Database: A Computational Framework for Encouraging Collaboration, Enhancing Power and Controlling False Discovery
Author :
Aharoni, Ehud ; Neuvirth, Hani ; Rosset, Saharon
Author_Institution :
Machine Learning & Data Min. Group, Haifa Univ. Campus, Haifa, Israel
Abstract :
The common scenario in computational biology in which a community of researchers conduct multiple statistical tests on one shared database gives rise to the multiple hypothesis testing problem. Conventional procedures for solving this problem control the probability of false discovery by sacrificing some of the power of the tests. We suggest a scheme for controlling false discovery without any power loss by adding new samples for each use of the database and charging the user with the expenses. The crux of the scheme is a carefully crafted pricing system that fairly prices different user requests based on their demands while keeping the probability of false discovery bounded. We demonstrate this idea in the context of HIV treatment research, where multiple researchers conduct tests on a repository of HIV samples.
Keywords :
database management systems; medical computing; microorganisms; patient treatment; probability; statistical analysis; HIV treatment research; collaboration; computational biology; false discovery; multiple hypothesis testing problem; multiple statistical tests; power loss; pricing system; probability; quality preserving database; user requests; Bioinformatics; Collaboration; Communities; Computational biology; Databases; Pricing; Testing; Bonferroni method.; Family-wise error rate; multiple comparisons; Biomedical Research; Computational Biology; Data Interpretation, Statistical; Database Management Systems;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2010.105