Title :
Integration strategies for toxicity data from an empirical perspective
Author :
Longzhi Yang ; Neagu, Daniel
Author_Institution :
Dept. of Comput. Sci. & Digital Technol., Northumbria Univ., Newcastle upon Tyne, UK
Abstract :
The recent development of information techniques, especially the state-of-the-art “big data” solutions, enables the extracting, gathering, and processing large amount of toxicity information from multiple sources. Facilitated by this technology advance, a framework named integrated testing strategies (ITS) has been proposed in the predictive toxicology domain, in an effort to intelligently jointly use multiple heterogeneous toxicity data records (through data fusion, grouping, interpolation/extrapolation etc.) for toxicity assessment. This will ultimately contribute to accelerating the development cycle of chemical products, reducing animal use, and decreasing development costs. Most of the current study in ITS is based on a group of consensus processes, termed weight of evidence (WoE), which quantitatively integrate all the relevant data instances towards the same endpoint into an integrated decision supported by data quality. Several WoE implementations for the particular case of toxicity data fusion have been presented in the literature, which are collectively studied in this paper. Noting that these uncertainty handling methodologies are usually not simply developed from conventional probability theory due to the unavailability of big datasets, this paper first investigates the mathematical foundations of these approaches. Then, the investigated data integration models are applied to a representative case in the predictive toxicology domain, with the experimental results compared and analysed.
Keywords :
Big Data; chemical engineering computing; chemical industry; data integration; probability; production engineering computing; toxicology; ITS; WoE; animal use; chemical products; consensus processes; data integration models; data quality; empirical perspective; information techniques; integrated decision; integrated testing strategies; integration strategies; predictive toxicology domain; probability theory; state-of-the-art big data solutions; toxicity assessment; toxicity data; weight of evidence; Bayes methods; Data integration; Equations; Mathematical model; Reliability; Uncertainty;
Conference_Titel :
Computational Intelligence (UKCI), 2014 14th UK Workshop on
Conference_Location :
Bradford
DOI :
10.1109/UKCI.2014.6930153