DocumentCode :
607722
Title :
Impact of characteristics in viral integration hotspots on classification performance
Author :
Gumus, E. ; Kursun, O. ; Sertbas, A.
Author_Institution :
Bilgisayar Muhendisligi Bolumu, Istanbul Univ., Istanbul, Turkey
fYear :
2013
fDate :
24-26 April 2013
Firstpage :
1
Lastpage :
4
Abstract :
Main reason of genetic defects is the disorders in gene regions which are responsible for coding the proteins necessary for normal body functions. By gene therapy, the regions with disorders can be detected and their genetic content can be changed for good. These regions may have special characteristics in terms of nucleotide dispersion which are beyond the known statistical norms of genome. In this study, such a characteristic is defined and its effect on predicting the strand direction of genomic reads (classification) is analyzed. By the analyses, it is observed that Canonical Correlation Analysis (CCA) method outperforms well known Support Vector Machines (SVM) approach considering the discrimination of reads according to their strand directions.
Keywords :
bioinformatics; correlation methods; genetics; genomics; medical computing; medical disorders; pattern classification; proteins; CCA method; canonical correlation analysis; classification performance; disorder region detection; gene region; gene therapy; genetic content; genetic defect; genome; genomic read; normal body function; nucleotide dispersion; protein coding; reads discrimination; strand directions; viral integration hotspot; Bioinformatics; Correlation; Encoding; Genomics; Proteins; Support vector machines; canonical correlation analysis; genome sequencing; viral integration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications Applications Conference (SIU), 2013 21st
Conference_Location :
Haspolat
Print_ISBN :
978-1-4673-5562-9
Electronic_ISBN :
978-1-4673-5561-2
Type :
conf
DOI :
10.1109/SIU.2013.6531383
Filename :
6531383
Link To Document :
بازگشت