Go top
Paper information

Building user profiles based on sequences for content and collaborative filtering

P. Sánchez, A. Bellogín

Information Processing & Management Vol. 56, nº. 1, pp. 192 - 211

Summary:

Modeling user profiles is a necessary step for most information filtering systems – such as recommender systems – to provide personalized recommendations. However, most of them work with users or items as vectors, by applying different types of mathematical operations between them and neglecting sequential or content-based information. Hence, in this paper we study how to propose an adaptive mechanism to obtain user sequences using different sources of information, allowing the generation of hybrid recommendations as a seamless, transparent technique from the system viewpoint. As a proof of concept, we develop the Longest Common Subsequence (LCS) algorithm as a similarity metric to compare the user sequences, where, in the process of adapting this algorithm to recommendation, we include different parameters to control the efficiency by reducing the information used in the algorithm (preference filter), to decide when a neighbor is considered useful enough to be included in the process (confidence filter), to identify whether two interactions are equivalent (δ-matching threshold), and to normalize the length of the LCS in a bounded interval (normalization functions). These parameters can be extended to work with any type of sequential algorithm.

We evaluate our approach with several state-of-the-art recommendation algorithms using different evaluation metrics measuring the accuracy, diversity, and novelty of the recommendations, and analyze the impact of the proposed parameters. We have found that our approach offers a competitive performance, outperforming content, collaborative, and hybrid baselines, and producing positive results when either content- or rating-based information is exploited.


Keywords: Hybrid recommender systems; Preference filtering; Content-based filtering; Collaborative filtering; Longest Common Subsequence


JCR Impact Factor and WoS quartile: 4,787 - Q1 (2019); 8,600 - Q1 (2022)

DOI reference: DOI icon https://doi.org/10.1016/j.ipm.2018.10.003

Published on paper: January 2019.

Published on-line: October 2018.



Citation:
P. Sánchez, A. Bellogín, Building user profiles based on sequences for content and collaborative filtering. Information Processing & Management. Vol. 56, nº. 1, pp. 192 - 211, January 2019. [Online: October 2018]


pdf Preview
Request Request the document to be emailed to you.