DocumentCode
2323287
Title
On the optimal window shape for genomic signal processing
Author
Gunawan, Teddy Surya
Author_Institution
Dept. of Electr. & Comput. Eng., Int. Islamic Univ. Malaysia, Kuala Lumpur
fYear
2008
fDate
13-15 May 2008
Firstpage
252
Lastpage
255
Abstract
The segments of DNA molecule, called genes are known to carry useful information in their protein coding regions (exons) and are responsible for protein synthesis. The most popular frequency domain technique for gene identification is by using a sliding window DFT technique to locate the well known period-3 property in DNA sequences. This paper investigates the use of parametric and nonparametric window types with DFT based period-3 detection method to identify the coding and noncoding regions. The HMR195 dataset was selected for the performance comparison utilizing the receiver operating characteristic (ROC) curve and area under ROC (AUC) measure. Results showed that the Bartlett window (nonparametric) and Gaussian window with alpha=2.2401 (parametric) provide the optimal window shape compared to 13 other window types for gene identification of coding and noncoding regions.
Keywords
biology computing; discrete Fourier transforms; frequency-domain analysis; genetics; molecular biophysics; proteins; sequences; signal processing; DNA molecule; DNA sequence; HMR195 dataset; ROC curve; frequency domain technique; gene identification; genomic signal processing; optimal window shape; protein coding region; protein synthesis; receiver operating characteristic; sliding window DFT technique; Area measurement; Bioinformatics; DNA; Frequency domain analysis; Genomics; Proteins; Sequences; Shape; Signal processing; Signal synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Communication Engineering, 2008. ICCCE 2008. International Conference on
Conference_Location
Kuala Lumpur
Print_ISBN
978-1-4244-1691-2
Electronic_ISBN
978-1-4244-1692-9
Type
conf
DOI
10.1109/ICCCE.2008.4580606
Filename
4580606
Link To Document