DocumentCode
139698
Title
Cloud-scale genomic signals processing classification analysis for gene expression microarray data
Author
Harvey, Benjamin ; Soo-Yeon Ji
Author_Institution
Bowie State Univ., Bowie, MD, USA
fYear
2014
fDate
26-30 Aug. 2014
Firstpage
1843
Lastpage
1846
Abstract
As microarray data available to scientists continues to increase in size and complexity, it has become overwhelmingly important to find multiple ways to bring inference though analysis of DNA/mRNA sequence data that is useful to scientists. Though there have been many attempts to elucidate the issue of bringing forth biological inference by means of wavelet preprocessing and classification, there has not been a research effort that focuses on a cloud-scale classification analysis of microarray data using Wavelet thresholding in a Cloud environment to identify significantly expressed features. This paper proposes a novel methodology that uses Wavelet based Denoising to initialize a threshold for determination of significantly expressed genes for classification. Additionally, this research was implemented and encompassed within cloud-based distributed processing environment. The utilization of Cloud computing and Wavelet thresholding was used for the classification 14 tumor classes from the Global Cancer Map (GCM). The results proved to be more accurate than using a predefined p-value for differential expression classification. This novel methodology analyzed Wavelet based threshold features of gene expression in a Cloud environment, furthermore classifying the expression of samples by analyzing gene patterns, which inform us of biological processes. Moreover, enabling researchers to face the present and forthcoming challenges that may arise in the analysis of data in functional genomics of large microarray datasets.
Keywords
DNA; RNA; cloud computing; data analysis; genetics; genomics; inference mechanisms; medical signal processing; signal classification; signal denoising; wavelet transforms; Cloud computing; DNA sequence data analysis; GCM; biological inference; biological processes; cloud-based distributed processing environment; cloud-scale genomic signals processing classification analysis; functional genomics; gene expression microarray data; gene pattern analysis; global cancer map; mRNA sequence data analysis; wavelet based denoising; wavelet classification; wavelet preprocessing; wavelet thresholding; Bioinformatics; Gene expression; Genomics; Signal processing; Tumors; Wavelet analysis; Wavelet transforms;
fLanguage
English
Publisher
ieee
Conference_Titel
Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual International Conference of the IEEE
Conference_Location
Chicago, IL
ISSN
1557-170X
Type
conf
DOI
10.1109/EMBC.2014.6943968
Filename
6943968
Link To Document