مرکز منطقه ای اطلاع رساني علوم و فناوري - Resting-State Functional Network Scale Effects and Statistical Significance-Based Feature Selection in Machine Learning Classification

Author/Authors :

Guo, Hao Taiyuan University of Technology - Taiyuan, China , Li, Yao Taiyuan University of Technology - Taiyuan, China , Mensah, Godfred Kim Taiyuan University of Technology - Taiyuan, China , Xu, Yong Department of Psychiatry - First Hospital of Shanxi Medical University - Taiyuan, China , Chen, Junjie Taiyuan University of Technology - Taiyuan, China , Xiang, Jie Taiyuan University of Technology - Taiyuan, China , Chen, Dongwei Taiyuan University of Technology - Taiyuan, China

Abstract :

In recent years, functional brain network topological features have been widely used as classification features. Previous studies have found that network node scale differences caused by different network parcellation definitions significantly affect the structure of the constructed network and its topological properties. However, we still do not know how network scale differences affect the classification accuracy, performance of classification features, and effectiveness of the feature selection strategy using P values in terms of the machine learning method. .is study used five scale parcellations, involving 90, 256, 497, 1003, and 1501 nodes. .ree local properties of resting-state functional brain networks were selected (degree, betweenness centrality, and nodal efficiency), and the support vector machine method was used to construct classifiers to identify patients with major depressive disorder. We analyzed the impact of the five scales on classification accuracy. In addition, the effectiveness and redundancy of features obtained by the different scale parcellations were compared. Finally, traditional statistical significance (P value) was verified as a feature selection criterion. .e results showed that the feature effectiveness of different scales was similar; in other words, parcellation with more regions did not provide more effective discriminative features. Nevertheless, parcellation with more regions did provide a greater quantity of discriminative features, which led to an improvement in the accuracy of the classification. However, due to the close distance between brain regions, the redundancy of parcellation with more regions was also greater. .e traditional P value feature selection strategy is feasible with different scales, but our analysis showed that the traditional P < 0.05 threshold was too strict for feature selection. .is study provides an important reference for the selection of network scales when applying topological properties of brain networks to machine learning methods.