Title of article :
Analysis of nasopharyngeal carcinoma risk factors with Bayesian networks
Author/Authors :
Aussem، نويسنده , , Alex and de Morais، نويسنده , , Sérgio Rodrigues and Corbex، نويسنده , , Marilys، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2012
Abstract :
Objectives
pose a new graphical framework for extracting the relevant dietary, social and environmental risk factors that are associated with an increased risk of nasopharyngeal carcinoma (NPC) on a case–control epidemiologic study that consists of 1289 subjects and 150 risk factors.
s
ramework builds on the use of Bayesian networks (BNs) for representing statistical dependencies between the random variables. We discuss a novel constraint-based procedure, called Hybrid Parents and Children (HPC), that builds recursively a local graph that includes all the relevant features statistically associated to the NPC, without having to find the whole BN first. The local graph is afterwards directed by the domain expert according to his knowledge. It provides a statistical profile of the recruited population, and meanwhile helps identify the risk factors associated to NPC.
s
ive experiments on synthetic data sampled from known BNs show that the HPC outperforms state-of-the-art algorithms that appeared in the recent literature. From a biological perspective, the present study confirms that chemical products, pesticides and domestic fume intake from incomplete combustion of coal and wood are significantly associated with NPC risk. These results suggest that industrial workers are often exposed to noxious chemicals and poisonous substances that are used in the course of manufacturing. This study also supports previous findings that the consumption of a number of preserved food items, like house made proteins and sheep fat, are a major risk factor for NPC.
sion
e valuable data mining tools for the analysis of epidemiologic data. They can explicitly combine both expert knowledge from the field and information inferred from the data. These techniques therefore merit consideration as valuable alternatives to traditional multivariate regression techniques in epidemiologic studies.
Keywords :
feature selection , Epidemiology , Bayesian networks , Nasopharyngeal carcinoma , Machine Learning , Predictive modeling
Journal title :
Artificial Intelligence In Medicine
Journal title :
Artificial Intelligence In Medicine