DocumentCode
3469105
Title
Modeling Information Quality Risk in Data Mining
Author
Su, Ying ; Li, Donghong ; Peng, Jie
Author_Institution
Inst. of Sci. & Tech. Inf. of China, Beijing
fYear
2008
fDate
12-14 Oct. 2008
Firstpage
1
Lastpage
4
Abstract
Information quality (IQ) is a critical factor in the success of the data mining (DM). Therefore, it is essential to measure the risk of IQ in a data warehouse to ensure success in implementing DM. This paper presents a methodology to determine two IQ characteristics-accuracy and comprehensiveness-that are of critical importance to decision makers. This methodology can examine how the quality risks of source information affect the quality for information outputs produced using the relational algebra operations selection, projection, and Cubic product. It can be used to determine how quality risks associated with diverse data sources affect the quality of the derived data. The study resulted in the development of a model of a data cube and an algebra to support IQ risk operations on this cube. The model we present is simple and intuitive, and the algebra provides a means to concisely express complex DM queries.
Keywords
data mining; data warehouses; decision making; relational algebra; risk analysis; cubic product; data mining; data warehouse; decision making; information quality risk; relational algebra; source information; Algebra; Companies; Costs; Data mining; Data warehouses; Databases; Delta modulation; Information analysis; Quality management; Risk management;
fLanguage
English
Publisher
ieee
Conference_Titel
Wireless Communications, Networking and Mobile Computing, 2008. WiCOM '08. 4th International Conference on
Conference_Location
Dalian
Print_ISBN
978-1-4244-2107-7
Electronic_ISBN
978-1-4244-2108-4
Type
conf
DOI
10.1109/WiCom.2008.2424
Filename
4680613
Link To Document