Title :
Protein-protein interaction network constructing based on text mining and reinforcement learning with application to prostate cancer
Author :
Fei Zhu ; Quan Liu ; Xiaofang Zhang ; Bairong Shen
Author_Institution :
Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
Abstract :
As a notoriously lethal human disease, cancer has obtained much concern for a long time. There have accumulated huge amounts of literature and experimental data on cancer-related research. It is impossible for people to deal with these texts manually to discover novel information and knowledge. However, text mining has an advantage of extracting previously unknown and understandable knowledge from large amounts of texts, and forming well-defined knowledge, providing the possibility to fully taking use of the existed texts. With the proceeding of biomedical research, people have gradually realized that complex biological functions and the phenomenon of life are the results of complex interactions among a variety of biological entities, such as protein. Deeply studying protein interaction network is essential to understand life. We, adopting reinforcement learning idea, put forward an algorithm for protein interaction network constructing. With the algorithm, nodes are used to represent proteins and edges denote interactions. During the evolutionary process, a node selects with which nodes in the network it tends to interact. Keep selecting and carrying on iteration, until eventually attaining an optimal network. The network is the result of the dynamic nature of learning behavior. As a malignancy, prostate cancer has been concerned for a long time. We attain biological texts from PubMed and establish a prostate cancer protein interaction networks by the proposed methods. The results show that our proposed method is pretty good. Network topology analysis results also show that the network node degree distribution is scale-free.
Keywords :
cancer; data mining; learning (artificial intelligence); medical computing; molecular biophysics; proteins; text analysis; PubMed; biological texts; dynamic nature; evolutionary process; network topology analysis; prostate cancer; protein-protein interaction network; reinforcement learning; scale-free network node degree distribution; text mining; Learning (artificial intelligence); Prostate cancer; Protein engineering; Proteins; Text mining; prostate cancer; protein interaction network; reinforcement learning; systems biology; text mining;
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on
Conference_Location :
Belfast
DOI :
10.1109/BIBM.2014.6999302