Author/Authors :
Guo, Rong Northeast Forestry University - Harbin, China , Teng, Zhixia Northeast Forestry University - Harbin, China , Wang, Yiding Northeast Forestry University - Harbin, China , Zhou, Xin Northeast Forestry University - Harbin, China , Xu, Heze Department of Gynecology and Obstetrics - Tongji Hospital - Tongji Medical College - Huazhong University of Science and Technology - Wuhan - Hubei, China , Liu, Dan Northeast Forestry University - Harbin, China
Abstract :
Preeclampsia (PE) is a maternal disease that causes maternal and child death. Treatment and preventive measures are not sound
enough. The problem of PE screening has attracted much attention. The purpose of this study is to screen placental mRNA to
obtain the best PE biomarkers for identifying patients with PE. We use Limma in the R language to screen out the 48
differentially expressed genes with the largest differences and used correlation-based feature selection algorithms to reduce the
dimensionality and avoid attribute redundancy arising from too many mRNA samples participating in the classification. After
reducing the mRNA attributes, the mRNA samples are sorted from large to small according to information gain. In this study, a
classifier model is designed to identify whether samples had PE through mRNA in the placenta. To improve the accuracy of
classification and avoid overfitting, three classifiers, including C4.5, AdaBoost, and multilayer perceptron, are used. We use the
majority voting strategy integrated with the differentially expressed genes and the genes filtered by the best subset method as
comparison methods to train the classifier. The results show that the classification accuracy rate has increased from 79% to
82.2%, and the number of mRNA features has decreased from 48 to 13. This study provides clues for the main PE biomarkers of
mRNA in the placenta and provides ideas for the treatment and screening of PE.
Keywords :
mRNA , Preeclampsia , Biomarkers , PE