Title :
Supporting Domain Analysis through Mining and Recommending Features from Online Product Listings
Author :
Hariri, Negar ; Castro-Herrera, Carlos ; Mirakhorli, Mehdi ; Cleland-Huang, Jane ; Mobasher, Bamshad
Author_Institution :
Sch. of Comput., DePaul Univ., Chicago, IL, USA
Abstract :
Domain analysis is a labor-intensive task in which related software systems are analyzed to discover their common and variable parts. Many software projects include extensive domain analysis activities, intended to jumpstart the requirements process through identifying potential features. In this paper, we present a recommender system that is designed to reduce the human effort of performing domain analysis. Our approach relies on data mining techniques to discover common features across products as well as relationships among those features. We use a novel incremental diffusive algorithm to extract features from online product descriptions, and then employ association rule mining and the (k)-nearest neighbor machine learning method to make feature recommendations during the domain analysis process. Our feature mining and feature recommendation algorithms are quantitatively evaluated and the results are presented. Also, the performance of the recommender system is illustrated and evaluated within the context of a case study for an enterprise-level collaborative software suite. The results clearly highlight the benefits and limitations of our approach, as well as the necessary preconditions for its success.
Keywords :
Internet; data mining; groupware; learning (artificial intelligence); pattern classification; recommender systems; software engineering; association rule mining; data mining techniques; domain analysis activity; enterprise-level collaborative software suite; feature extraction; feature mining; feature recommendation algorithms; incremental diffusive algorithm; k-nearest neighbor machine learning method; labor-intensive task; online product descriptions; online product listings; recommender system; software projects; software systems; Algorithm design and analysis; Clustering; Clustering algorithms; Data mining; Domain analysis; Electronic mail; Feature extraction; Nearest neighbor search; Recommender systems; Domain analysis; association rule mining; clustering; k-nearest neighbor; recommender systems;
Journal_Title :
Software Engineering, IEEE Transactions on
DOI :
10.1109/TSE.2013.39