Title :
Evaluating music recommendation in a real-world setting: On data splitting and evaluation metrics
Author :
Szu-Yu Chou ; Yi-Hsuan Yang ; Yu-Ching Lin
Author_Institution :
Res. Center for IT innovation, Taipei, Taiwan
fDate :
June 29 2015-July 3 2015
Abstract :
Evaluation is important to assess the performance of a computer system in fulfilling a certain user need. In the context of recommendation, researchers usually evaluate the performance of a recommender system by holding out a random subset of observed ratings and calculating the accuracy of the system in reproducing such ratings. This evaluation strategy, however, does not consider the fact that in a real-world setting we are actually given the observed ratings of the past and have to predict for the future. There might be new songs, which create the cold-start problem, and the users´ musical preference might change over time. Moreover, the user satisfaction of a recommender system may be related to factors other than accuracy. In light of these observations, we propose in this paper a novel evaluation framework that uses various time-based data splitting methods and evaluation metrics to assess the performance of recommender systems. Using millions of listening records collected from a commercial music streaming service, we compare the performance of collaborative filtering (CF) and content-based (CB) models with low-level audio features and semantic audio descriptors. Our evaluation shows that the CB model with semantic descriptors obtains a better trade-off among accuracy, novelty, diversity, freshness and popularity, and can nicely deal with the cold-start problems of new songs.
Keywords :
collaborative filtering; content-based retrieval; music; recommender systems; CB models; CF; cold-start problem; collaborative filtering; commercial music streaming service; computer system; content-based models; data evaluation metrics; low-level audio features; music recommendation evaluation; recommender systems; semantic audio descriptors; semantic descriptors; time-based data splitting methods; user musical preference; user satisfaction; Computational modeling; Density estimation robust algorithm; Encoding; FCC; Mel frequency cepstral coefficient; Silicon carbide; Collaborative filtering; cold-start; content-based recommendation; data splitting; evaluation metrics;
Conference_Titel :
Multimedia and Expo (ICME), 2015 IEEE International Conference on
Conference_Location :
Turin
DOI :
10.1109/ICME.2015.7177456