Title of article :
Resting-State Functional Network Scale Effects and Statistical Significance-Based Feature Selection in Machine Learning Classification
Author/Authors :
Guo, Hao Taiyuan University of Technology - Taiyuan, China , Li, Yao Taiyuan University of Technology - Taiyuan, China , Mensah, Godfred Kim Taiyuan University of Technology - Taiyuan, China , Xu, Yong Department of Psychiatry - First Hospital of Shanxi Medical University - Taiyuan, China , Chen, Junjie Taiyuan University of Technology - Taiyuan, China , Xiang, Jie Taiyuan University of Technology - Taiyuan, China , Chen, Dongwei Taiyuan University of Technology - Taiyuan, China
Abstract :
In recent years, functional brain network topological features have been widely used as classification features. Previous studies
have found that network node scale differences caused by different network parcellation definitions significantly affect the
structure of the constructed network and its topological properties. However, we still do not know how network scale
differences affect the classification accuracy, performance of classification features, and effectiveness of the feature selection
strategy using P values in terms of the machine learning method. .is study used five scale parcellations, involving 90, 256, 497,
1003, and 1501 nodes. .ree local properties of resting-state functional brain networks were selected (degree, betweenness
centrality, and nodal efficiency), and the support vector machine method was used to construct classifiers to identify patients
with major depressive disorder. We analyzed the impact of the five scales on classification accuracy. In addition, the effectiveness and redundancy of features obtained by the different scale parcellations were compared. Finally, traditional
statistical significance (P value) was verified as a feature selection criterion. .e results showed that the feature effectiveness of
different scales was similar; in other words, parcellation with more regions did not provide more effective discriminative
features. Nevertheless, parcellation with more regions did provide a greater quantity of discriminative features, which led to an
improvement in the accuracy of the classification. However, due to the close distance between brain regions, the redundancy of
parcellation with more regions was also greater. .e traditional P value feature selection strategy is feasible with different scales,
but our analysis showed that the traditional P < 0.05 threshold was too strict for feature selection. .is study provides an
important reference for the selection of network scales when applying topological properties of brain networks to machine
learning methods.
Keywords :
Significance-Based , Resting-State , Machine , Classification
Journal title :
Computational and Mathematical Methods in Medicine