DocumentCode
2589692
Title
Discovery of constraints from data for information system reverse engineering
Author
Lim, Wie Ming ; Harrison, John
Author_Institution
Centre for Software Maintenance, Queensland Univ., St. Lucia, Qld., Australia
fYear
1997
fDate
29 Sep-2 Oct 1997
Firstpage
39
Lastpage
48
Abstract
The extraction of functional dependencies is a fundamental activity in the database design recovery process which is part of an overall information systems reverse engineering effort. Existing algorithms for this task are computationally expensive and appear to be infeasible if applied to large legacy database instances, e.g., their performance deteriorated when number of attributes or/and instances is large and they cannot tolerate erroneous data that may occur in deployed commercial systems. The contributions of this paper are as follows. We propose two algorithms for discovering functional dependencies from data. The collective-FD algorithm, which is based on a top-down approach, eliminates redundant specialised functional dependencies to be proposed. The attribute-list algorithm, which is based on the bottom-up approach, enables more accurate functional dependency hypotheses to be discovered. In anticipating noisy data, we propose an effective method to discover possible data errors and compute partial functional dependencies. The result is an error-tolerant functional dependency discovery approach that is more applicable to real world databases for design recovery
Keywords
data integrity; information systems; knowledge acquisition; reverse engineering; software performance evaluation; very large databases; attribute-list algorithm; collective-FD algorithm; computational expense; constraint discovery; data errors; database design recovery process; erroneous data; error-tolerant functional dependency discovery; functional dependencies; information system reverse engineering; legacy database; noisy data; performance; top-down approach; Australia; Data mining; Databases; Design engineering; Information systems; Information technology; Maintenance engineering; Process design; Reverse engineering; Software maintenance;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering Conference, 1997. Proceedings., Australian
Conference_Location
Sydney, NSW
Print_ISBN
0-8186-8081-4
Type
conf
DOI
10.1109/ASWEC.1997.623753
Filename
623753
Link To Document