Title :
An Ontology-Based Text-Mining Method to Cluster Proposals for Research Project Selection
Author :
Ma, Jian ; Xu, Wei ; Sun, Yong-hong ; Turban, Efraim ; Wang, Shouyang ; Liu, Ou
Author_Institution :
Dept. of Inf. Syst., City Univ. of Hong Kong, Kowloon, China
fDate :
5/1/2012 12:00:00 AM
Abstract :
Research project selection is an important task for government and private research funding agencies. When a large number of research proposals are received, it is common to group them according to their similarities in research disciplines. The grouped proposals are then assigned to the appropriate experts for peer review. Current methods for grouping proposals are based on manual matching of similar research discipline areas and/or keywords. However, the exact research discipline areas of the proposals cannot often be accurately designated by the applicants due to their subjective views and possible misinterpretations. Therefore, rich information in the proposals´ full text can be used effectively. Text-mining methods have been proposed to solve the problem by automatically classifying text documents, mainly in English. However, these methods have limitations when dealing with non-English language texts, e.g., Chinese research proposals. This paper presents a novel ontology-based text-mining approach to cluster research proposals based on their similarities in research areas. The method is efficient and effective for clustering research proposals with both English and Chinese texts. The method also includes an optimization model that considers applicants´ characteristics for balancing proposals by geographical regions. The proposed method is tested and validated based on the selection process at the National Natural Science Foundation of China. The results can also be used to improve the efficiency and effectiveness of research project selection processes in other government and private research funding agencies.
Keywords :
data mining; government data processing; ontologies (artificial intelligence); optimisation; pattern classification; pattern clustering; pattern matching; project management; research and development; text analysis; China; Chinese text; English; National Natural Science Foundation; government agency; grouping proposals; manual matching; nonEnglish language text; ontology-based text mining approach; optimization model; private research funding agency; research project selection process; research proposal clustering; text document classification; Clustering algorithms; Educational institutions; Humans; Ontologies; Proposals; Sun; Vectors; Clustering analysis; decision support systems; ontology; research project selection; text mining;
Journal_Title :
Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on
DOI :
10.1109/TSMCA.2011.2172205