Title :
Effective data mining using neural networks
Author :
Lu, Hongjun ; Setiono, Rudy ; Liu, Huan
Author_Institution :
Dept. of Inf. Syst. & Comput. Sci., Nat. Univ. of Singapore, Singapore
fDate :
12/1/1996 12:00:00 AM
Abstract :
Classification is one of the data mining problems receiving great attention recently in the database community. The paper presents an approach to discover symbolic classification rules using neural networks. Neural networks have not been thought suited for data mining because how the classifications were made is not explicitly stated as symbolic rules that are suitable for verification or interpretation by humans. With the proposed approach, concise symbolic rules with high accuracy can be extracted from a neural network. The network is first trained to achieve the required accuracy rate. Redundant connections of the network are then removed by a network pruning algorithm. The activation values of the hidden units in the network are analyzed, and classification rules are generated using the result of this analysis. The effectiveness of the proposed approach is clearly demonstrated by the experimental results on a set of standard data mining test problems
Keywords :
classification; deductive databases; knowledge acquisition; neural nets; very large databases; accuracy rate; activation values; classification; concise symbolic rules; database community; effective data mining; hidden units; knowledge acquisition; network pruning algorithm; neural networks; redundant connections; rule extraction; standard data mining test problems; symbolic classification rules; verification; Classification tree analysis; Data mining; Databases; Decision trees; Error analysis; Humans; Intelligent networks; Machine learning; Neural networks; Testing;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on