• DocumentCode
    2589692
  • Title

    Discovery of constraints from data for information system reverse engineering

  • Author

    Lim, Wie Ming ; Harrison, John

  • Author_Institution
    Centre for Software Maintenance, Queensland Univ., St. Lucia, Qld., Australia
  • fYear
    1997
  • fDate
    29 Sep-2 Oct 1997
  • Firstpage
    39
  • Lastpage
    48
  • Abstract
    The extraction of functional dependencies is a fundamental activity in the database design recovery process which is part of an overall information systems reverse engineering effort. Existing algorithms for this task are computationally expensive and appear to be infeasible if applied to large legacy database instances, e.g., their performance deteriorated when number of attributes or/and instances is large and they cannot tolerate erroneous data that may occur in deployed commercial systems. The contributions of this paper are as follows. We propose two algorithms for discovering functional dependencies from data. The collective-FD algorithm, which is based on a top-down approach, eliminates redundant specialised functional dependencies to be proposed. The attribute-list algorithm, which is based on the bottom-up approach, enables more accurate functional dependency hypotheses to be discovered. In anticipating noisy data, we propose an effective method to discover possible data errors and compute partial functional dependencies. The result is an error-tolerant functional dependency discovery approach that is more applicable to real world databases for design recovery
  • Keywords
    data integrity; information systems; knowledge acquisition; reverse engineering; software performance evaluation; very large databases; attribute-list algorithm; collective-FD algorithm; computational expense; constraint discovery; data errors; database design recovery process; erroneous data; error-tolerant functional dependency discovery; functional dependencies; information system reverse engineering; legacy database; noisy data; performance; top-down approach; Australia; Data mining; Databases; Design engineering; Information systems; Information technology; Maintenance engineering; Process design; Reverse engineering; Software maintenance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering Conference, 1997. Proceedings., Australian
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    0-8186-8081-4
  • Type

    conf

  • DOI
    10.1109/ASWEC.1997.623753
  • Filename
    623753