Title :
Discovering Similarities for Content-Based Recommendation and Browsing in Multimedia Collections
Author :
Lehinevych, Taras ; Kokkinis-Ntrenis, Nikolaos ; Siantikos, Giorgos ; Dogruoz, A. Seza ; Giannakopoulos, Theodoros ; Konstantopoulos, Stasinos
Author_Institution :
Fac. of Comput. Sci., Nat. Univ. of “Kyiv-Mohyla Acad.”, Kiev, Ukraine
Abstract :
The purpose of the research described in this paper is to examine the existence of correlation between low level audio, visual and textual features and movie content similarity. In order to focus on a well defined and controlled case, we have built a small dataset of movie scenes from three sequel movies. In addition, manual annotations have led to a ground-truth similarity matrix between the adopted scenes. Then, three similarity matrices (one for each medium) have been computed based on Gaussian Mixture Models (audio and visual) and Latent Semantic Indexing (text). We have evaluated the automatically extracted similarities along with two simple fusion approaches and results indicate that the low-level features can lead to an accurate representation of the movie content. In addition, the fusion approach seems to outperform the individual modalities, which is a strong indication that individual modules lead to diverse similarities (in terms of content). Finally, we have evaluated the extracted similarities for different groups of human annotators, based on what a human interprets as similar and the results show that different groups of people correlate better with different modalities. This last result is very important and can be either used in (a) a personalized content-based retrieval and recommender system and (b) in a local weighted fusion approach, in future research.
Keywords :
Gaussian processes; audio signal processing; cinematography; content management; content-based retrieval; feature extraction; image fusion; image sequences; indexing; matrix algebra; mixture models; multimedia systems; recommender systems; text analysis; Gaussian mixture models; browsing; content-based recommendation; ground-truth similarity matrix; latent semantic indexing; local weighted fusion approach; low level audio features; manual annotations; movie content representation; movie content similarity; movie scenes; multimedia collections; optical flow; personalized content-based retrieval; recommender system; textual features; three sequel movies; visual features; Collaboration; Feature extraction; Motion pictures; Multimedia communication; Recommender systems; Visualization; audio features; fusion; movie recommendation; multimedia signal analysis; optical flow; recommender systems; similarity; visual features;
Conference_Titel :
Signal-Image Technology and Internet-Based Systems (SITIS), 2014 Tenth International Conference on
DOI :
10.1109/SITIS.2014.98