Title :
Microarray gene expression data association rules mining based on JG-Tree
Author :
Jiang, Xiang-Rong ; Gruenwald, Le
Author_Institution :
Dept. of Chem. & Biochem., Oklahoma Univ., Norman, OK, USA
Abstract :
The main techniques currently employed in analyzing microarray expression data are clustering and classification. In this paper we propose to use association rules to mine the association relationships among different genes under the same experimental condition. These kinds of relations may also exist across many different experiments with various experimental conditions. In this paper, a new approach, called LIS-growth (Large ItemSet growth) tree, is proposed for mining the microarray data. Our approach uses a new data structure, JG-tree (Jiang, Gruenwald), and a new data partition format for gene expression level data. Each data value can be presented by a sign bit, fractions and exponent bits. Each bit at the same position can be organized into a JG-tree. A JG-tree is a lossless and compression tree. It can be built on fly, a kind of real-time compression for bits string. Based on these two new data structures it is possible to mine the association rules efficiently and quickly from the gene expression database. Our algorithm was tested using the real-life datasets from the gene expression database at Stanford University.
Keywords :
DNA; biology computing; data compression; data mining; database management systems; genetics; pattern classification; pattern clustering; scientific information systems; tree data structures; DNA; JG-tree; LIS-growth tree; Large ItemSet growth tree; compression tree; data classification; data clustering; data partition format; data structure; genes; lossless tree; microarray gene expression data association rule mining; real-time compression; Association rules; Bioinformatics; Computer science; DNA; Data analysis; Data mining; Data structures; Gene expression; Genomics; Itemsets;
Conference_Titel :
Database and Expert Systems Applications, 2003. Proceedings. 14th International Workshop on
Print_ISBN :
0-7695-1993-8
DOI :
10.1109/DEXA.2003.1231993