Title :
An Exploratory Study of Macro Co-changes
Author :
Jaafar, Fehmi ; Guéhéneuc, Yann-Gaël ; Hamel, Sylvie ; Antoniol, Giuliano
Author_Institution :
DGIGL, Ecole Polytech. de Montreal, Montreal, QC, Canada
Abstract :
The literature describes several approaches to identify the artefacts of programs that change together to reveal the (hidden) dependencies among these artefacts. These approaches analyse historical data, mined from version control systems, and report co-changing artefacts, which hint at the causes, consequences, and actors of the changes. We introduce the novel concepts of macro co-changes (MCC), i.e., of artefacts that co-change within a large time interval, and of dephase macro co-changes (DMCC), i.e., macro co-changes that always happen with the same shifts in time. We describe typical scenarios of MCC and DMCC and we use the Hamming distance to detect approximate occurrences of MCC and DMCC. We present our approach, Macocha, to identify these concepts in large programs. We apply Macocha and compare it in terms of precision and recall with UML Diff (file stability) and association rules (co-changing files) on four systems: Argo UML, Free BSD, SIP, and XalanC. We also use external information to validate the (approximate) MCC and DMCC found by Macocha. We thus answer two research questions showing the existence and usefulness of theses concepts and explaining scenarios of hidden dependencies among artefacts.
Keywords :
Unified Modeling Language; data mining; program diagnostics; Hamming distance; Macocha; UML Diff; association rules; co-changing artefacts; co-changing files; file stability; historical data analysis; macro co-changes; version control system; Association rules; DH-HEMTs; Hamming distance; Java; Stability analysis; Vectors; Co-changes; bit vectors; dephase macro co-change; macro co-change; stability;
Conference_Titel :
Reverse Engineering (WCRE), 2011 18th Working Conference on
Conference_Location :
Limerick
Print_ISBN :
978-1-4577-1948-6
DOI :
10.1109/WCRE.2011.47