مرکز منطقه ای اطلاع رساني علوم و فناوري - Feature selection and classification of microarray gene expression data of ovarian carcinoma patients using weighted voting support vector machine

Title of article :

Feature selection and classification of microarray gene expression data of ovarian carcinoma patients using weighted voting support vector machine

Author/Authors :

Masoum، M. A. S. نويسنده , , GHAHERI ، S. نويسنده Department of Analytical Chemistry, Faculty of Chemistry, University of Kashan, I. R. Iran ,

Issue Information :

دوفصلنامه با شماره پیاپی 0 سال 2013

Pages :

From page :

163

To page :

175

Abstract :

داده هاي بيان ژني ميكروآرايه اي DNA حجم عظيمي از اطلاعات مربوط به هزاران متغير (ژن) را در اختيار ما قرار مي دهند. تجزيه و تحليل اين اطلاعات مي تواند منجر به تشخيص علت ژنتيكي بيماري ها و اختلاف هاي بين تومور هاي سزطاني شود. در اين مطالعه تلاش شده است تا با كاهش ابعاد اين نوع داده ها به كمك روش هاي آماري ، ژن هاي ارزشمند كه در ايجاد سرطان تخمدان موثرند، به عنوان بيوماركر انتخاب شده وسپس تومور هاي سزطان تخمدان مربوط به دو گروه از بيماران بر اساس داده هاي بيان ژني آنها طبقه بندي ودسته بندي شوند. يك گروه از اين بيماران با درمان هاي رايج معالجه شده وگروه ديگر مداوا نشده وبعد از مدتي فوت شده اند. ما از الگوريتم وزن-راي براي كاهش ابعاد داده ها و انتخاب ژن هاي موثر استفاده كرده ايم. در اين حالت با حذف داده هاي نامطلوب و مزاحم تجزيه وتحليل داده ها آسان تر مي شود. در مرحله بعد از الگوريتم تجزيه تمايزي كمترين مربعات جزيي (PLS-DA ) و روش ماشين هاي بردار پايه (SVM ) براي طبقه بندي داده هاي كاهش يافته استفاده شده است. نتايج نشان مي دهد هر چند الگوريتم PLS-DA تا حدودي موفق به ايجاد تمايز بين دو گروه بيماران مي شود ولي روش SVM طبقه بندي موثرتر و قابل قبول تري ارايه مي كند.

Abstract :

We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of two patient groups. One group treated by standard therapies and survived, while another group didn’t be cure and die after some times. In the first step we used weighted voting algorithm (WVA) for selecting impressive genes to reduce dimension, therefore eliminate noisy data and make analysis easier and then partial least square – discriminante analysis (PLS-DA) and support vector machine (SVM) methods have been applied for classification of diminished data. Results show that classification by PLS-DA can distinguish two groups somewhat but SVM is more efficient and sufficient classification method.

Journal title :

Iranian Journal of Mathematical Chemistry

Serial Year :

2013

Journal title :

Iranian Journal of Mathematical Chemistry

Record number :

1595125

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=1595125