Title of article :
Predicting with the quantify intensities of transcription factor-target genes binding using random forest technique
Author/Authors :
K. AL-Mashanji, Ameer University of Babylon, Hilla, Iraq , Z. AL-Rashid, Sura University of Babylon, Hilla, Iraq
Pages :
17
From page :
145
To page :
161
Abstract :
With the rapid development of technology, this development led to the emergence of microarray technology. It has the eect of studying the levels of gene expression in a way that makes it easier for researchers to observe the expression levels of millions of genes at the same time in a single exper- iment. Development also helped in the emergence of powerful tools to identify interactions between target genes and regulatory factors. The main aim of this study is to build models to predicate the relationship (Interaction) between Transcription Factors (TFs) proteins and target genes by selecting the subset of important genes (Relevant genes) from original data set. The proposed methodology comprises into three major stages: the genes selection, merge data sets and the prediction stage. The process of reducing the computational space of gene data has been accomplished by using proposed mutual information method for genes selection based on the data of gene expression. In the predic- tion, the proposed prediction regression techniques are utilized to predict with binding rate between single TF-target gene. It has been compared the eciency of two dierent proposed regression tech- niques including: Linear Regression and Random Forest Regression. Two available data sets have been utilized to achieve the objectives of this study: Gene's expression data of Yeast Cell Cycle data set and Transcription Factors data set. The evaluation of predictions performance has been performed depending on two performance prediction measures (Root Mean Square Error (RMSE) and Mean Absolute Error (MAE) with (10) Folds-Cross Validation.
Keywords :
Microarray Technology , Gene Expression , Genes Selection , Prediction Techniques , Transcription Factors Proteins
Journal title :
International Journal of Nonlinear Analysis and Applications
Serial Year :
2021
Record number :
2701582
Link To Document :
بازگشت