Search in:

Applied Artificial Intelligence

An International Journal

Volume 34, 2020 - Issue 10

Submit an article Journal homepage

Free access

841

Views

CrossRef citations to date

Altmetric

Listen

Research Article

Effective Context-Aware Recommendations Based on Context Weighting Using Genetic Algorithm and Alleviating Data Sparsity

Sonal LindaSchool of Computer and Systems Sciences, Jawaharlal Nehru University, New Delhi, IndiaCorrespondence[email protected]

Sonajharia MinzSchool of Computer and Systems Sciences, Jawaharlal Nehru University, New Delhi, India

K.K. BharadwajSchool of Computer and Systems Sciences, Jawaharlal Nehru University, New Delhi, India

Pages 730-753 | Published online: 16 Jun 2020

Cite this article
https://doi.org/10.1080/08839514.2020.1775011
CrossMark

In this article

ABSTRACT
Introduction
Related Work
Proposed RCGA-based CARS Framework
Experiments and Results
Conclusions and Future Directions
Footnotes
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

Context-aware collaborative filtering (CACF) is an effective approach for adapting recommendations under users’ specific contextual situations and aims to improve predictive accuracy for Context-aware recommender systems (CARSs). Incorporating context in recommender systems (RSs) considering the equal importance to all contextual dimensions is not appropriate for seeking an intelligent and useful recommendation. In this paper, we propose a Real-coded Genetic Algorithm (RCGA) based CARS framework that exploits contextual pre-filtering and contextual modeling paradigms into CACF with appropriate context feature weights for enhancing accuracy as well as the diversity of the recommendation list. Further to alleviate the data sparsity, an effective missing value prediction (EMVP) algorithm is applied into proposed framework. The accuracy based on RCGA is compared with other two schemes: Support Vector Machine (SVM) and Particle Swarm Optimization (PSO), and RCGA has shown better results. Experimental results based on real-world datasets have clearly established the effectiveness of our proposed CARS schemes.

Introduction

At the time of exceptional growth in the importance of search technologies, many researchers and practitioners focus on how interactive systems can encourage and support the users’ behavior changing with encapsulated context. Context-aware recommender systems (CARSs) offer a new perspective of multi-dimensionality that makes the recommendation more relevant and intelligent (Adomavicius and Tuzhilin Citation2005).

CARSs differ from traditional RSs because they not only use ratings given by users for items but also exploit both the knowledge of the contextual situations under which the ratings were acquired and the target user asking for a recommendation. More specific definition of context can be as any circumstances or conditions which affect something or someone. The significance of contextual information has been identified in many disciplines including information retrieval, e-commerce personalization, ubiquitous and mobile context-aware systems, databases, data mining, marketing, and management (Adomavicius et al. Citation2011). However, the aggregation of multiple contexts gathers potentially useful information that would result in the increased predictive power of CARSs. An ideal CARS should be capable of labeling each user action with an appropriate context and effectively tailor the system output to the user in that given context.

Various approaches based on context-aware collaborative filtering (CACF) have been developed to utilize the strength of traditional collaborative filtering (CF) techniques to enhance recommendation ability of CARSs to pursue different tasks other than a product/item recommendation (Shi, Larson, and Hanjalic Citation2014; Verbert et al. Citation2012). In spite of that, the danger of sparsity arises if contextual information is applied too strictly. However, data sparsity usually complicates the process of item recommendation under neighborhood-based CF domain (Shams and Haratizadeh Citation2017). The performance of CF-based RSs is mostly evaluated through the predictive accuracy of the recommendations. Relying on the accuracy of recommendations alone may not be enough to find the most relevant items for a user and hence the diversity can be another important quality measure/criterion in the recommendation process (Adomavicius and Kwon Citation2008). Though the highly desirable feature of diversity is contrasting to accuracy, many researchers have compared several filtering techniques for CARSs in terms of accuracy and diversity (Panniello, Tuzhilin, and Gorgoglione Citation2014). The aim of our work is to identify an influential set of context features with appropriate weights effectively learned for each individual and handling sparse data that preserves both the accuracy and diversity.

The main contributions of this paper are summarized as follows:

We designed a context-aware recommendation scheme CACF, that is based on contextual pre-filtering and contextual modeling paradigms with a novel approach of context weighting using RCGA.
Our proposed scheme is compared with other well-Known schemes: Support Vector Machine (SVM) and Particle Swarm Optimization (PSO) for parameter optimization.
Further, an effective missing value prediction algorithm (EMVP) (Ma, King, and Lyu Citation2007) is incorporated into the proposed schemes to handle the sparsity problem.
Finally, we generated appropriate context feature weights using RCGA for the proposed CARS which would balance both accuracy and diversity of Top-N recommended list of items for each user.

The rest of the paper is organized as follows. Section 2 provides an overview of the state-of-the-art in the areas relevant for this research work. Section 3 presents the proposed RCGA-based CARS framework and demonstrates how the missing value prediction approach is assembled to alleviate the problem of sparsity and also presents $F_{β} - m e a s u r e$ for optimizing the tradeoff between accuracy and diversity. The experimental evaluation of aforementioned schemes is discussed in Section 4. Finally, Section 5 concludes with a discussion of the findings of our work and an outlook on future research needs and opportunities.

Related Work

With the increasing popularity of mobile apps, there is a need for CARS frameworks and models, appropriate for people who have a wide range of marketplaces where they can seek a lot of resources. A methodology is built for Context-aware mobile recommender systems, whereby users are asked to judge whether a contextual factor (e.g. Weather) influences the rating given under a certain contextual condition (e.g. The weather is cloudy) based on recommendation domain (e.g. Movie) (Campos et al. Citation2013). Furthermore, for providing location-based services like travel and tourism, various paradigms in Location-aware recommender system (LARS) have been proposed that use location-based ratings and real-world GPS datasets to produce personalized recommendation (Liu et al. Citation2013; Sarwat et al. Citation2014). Similarly, the long tail Context-aware music recommender systems (CAMRSs) can automatically play suitable music considering various users’ contextual information, such as weather, emotional state, running pace, location, time, social media activities, and low-level activities in real time that could save our time and effort (Wang et al. Citation2014).

The different application domains exhibited by different recommender algorithms show that recommendation process is not a one-size-fits-all problem. We need to have a deep understanding of choosing a recommender algorithm embedded with CARS that depends on specific domains. Most of the approaches are based on CF which strongly depends on the availability of meaningful user ratings on a large scale that leads to the problem of sparsity. Therefore, two models have been proposed: Differential Context Relaxation (DCR) and Differential Context Weighting (DCW) to deal with the problem of the sparsity of contexts (Zheng, Burke, and Mobasher Citation2013).

Techniques for Learning Context Feature Weights

In the following subsections, we briefly introduce some optimization techniques such as, SVM, PSO, and RCGA for learning context feature weights.

Support Vector Machine (SVM)

The Support Vector Machine (SVM) is a statistical learning theory, based on data mining method developed by Vapnik and the principle of Structural Risk Minimization is implemented by constructing an optimal separating hyperplane (Min and Han Citation2005; Min, Lee, and Han Citation2006). A linear SVM classifier is a hyperplane that separates all items into two classes, such as, like and dislike for each active user $u_{a}$ . Suppose $n$ training samples have pairs $(x_{1}, y_{1}), (x_{2}, y_{2}), (x_{3}, y_{3}), \dots, (x_{n}, y_{n})$ where $x_{i} = (x_{i 1}, x_{i 2}, \dots, x_{i m})$ are a set of input context features, and $x_{i} \in R^{n}$ and $y_{i} \in \{- 1, 1\}$ are corresponding outputs, 1 for like and −1 for dislike class. The task of linear SVM is to learn feature weights by mapping all pairs $(x_{i}, y_{i})$ into separating hyperplanes $u_{j} = \vec{w} \cdot \vec{x} \pm b$ , where $\vec{w}$ is the vector of context feature weights and $\vec{x}$ is the vector of input context features. The target is to maximize the distance of the hyperplanes to the nearest of the like and dislike classes. Maximizing the margin can be expressed as an optimization problem: $m i n \frac{1}{2} {\vec{w}}^{2}$ subject to $y_{i} (\vec{w} \cdot \vec{x} \pm b) \geq 1$ , $\forall i$ where $x_{i}$ is the $i^{t h}$ training samples.

Particle Swarm Optimization (PSO)

Inspired by the collective behavior of birds and fishes, and the concept of evolutionary algorithm, Particle Swarm Optimization (PSO) algorithm was developed by Kennedy and Eberhart (Citation1995). It is an evolutionary computation technique based on swarm intelligence. For solving complex optimization problems, it is easy to implement and computationally less expensive in terms of both speed and memory requirements (Osuna-Enciso et al. Citation2016). PSO starts with a population of random solutions, and each individual solution is named as “particle” which represents a potential solution. Each particle is treated as a point in $n$ -dimensional space. The $i^{t h}$ particle is represented as $z_{i} = (z_{i 1}, z_{i 2}, \dots, z_{i m})$ . The best previous position of any particle is recorded and represented as $p_{i} = (p_{i 1}, p_{i 2}, \dots, p_{i n})$ . The index of globally best particle’s position is represented as $p_{g}$ and the velocity (i.e. rate of change of a particle’s position) is represented as $v_{i} = (v_{i 1}, v_{i 2}, \dots, v_{i n})$ . The updated velocity $v_{i n}^{k + 1}$ and position $z_{i n}^{k + 1}$ of the $i^{t h}$ particle at the $k^{t h}$ iteration are:

(1)

v_{i n}^{k + 1} = σ \cdot v_{m}^{k} + l_{1} \cdot r a n d_{1} \cdot (p_{i n} - z_{i n}^{k}) + l_{2} \cdot r a n d_{2} \cdot (p_{g n} - z_{i n}^{k})

(1)

(2)

z_{i n}^{k + 1} = z_{i n}^{k} + v_{i n}^{k + 1}

(2)

where $l_{1}$ and $l_{2}$ are learning factors, $σ$ is inertia weight for balancing the global and local search, $r a n d_{1}$ and $r a n d_{2}$ are random values in the range between $0$ and $1$ .

Real-Coded Genetic Algorithm (RCGA)

Genetic Algorithm (GA) (Goldberg Citation1989) has received considerable attention toward handling any kind of objective function and any kind of constraints, i.e. linear or non-linear, defined on discrete, continuous or mixed search spaces. Most studies on feature weighting have used GA as the main heuristic method for determining weight vector (Noori Citation2015). The basic building blocks of binary GAs are genes and chromosomes. Chromosomes are evaluated by running GA for the respective parameter configuration and each GA uses a small-sized population of chromosomes to alleviate the problem of slow convergence, without losing potential solution (Wahde Citation2008). The conventional binary GA encodes the gene as a binary bit and chromosome as a string of binary bits. Whereas RCGA encodes the parameters in continuous domain representing genes as floating-point number and chromosome as a vector of floating-point numbers. In RCGA scheme, a chromosome length becomes much shorter than binary coding scheme (Blanco, Delgado, and Pegalaja Citation2001).

Proposed RCGA-based CARS Framework

The goal of the proposed framework is to enhance the capability of CARSs by improving the predictive accuracy. It combines a proposed novel aspect of context weighting using RCGA with features of state-of-the-art CARSs framework. The purpose of Context-Aware Collaborative Filtering (CACF) algorithm is to recommend a list of new items with current context for a particular active user based on his past experiences and like-minded users having experience in a similar context. The real coding approach seems particularly natural when tackling the problem of optimizing parameters with the variables in continuous domains (Herrera, Lozano, and Verdegay Citation1998). Therefore, we use RCGA to learn the individual user’s preferences for quality recommendations. The contextual pre-filtering and contextual modeling-based CACF algorithm is proposed for the CARS framework. Further to establish the superiority of our proposed RCGA, we have compared it with other two schemes, SVM and PSO.

The RCGA-based CARS framework exploits the idea of context weighting scheme into CACF algorithm, signifies the contribution of each context feature is weighted, where weighting vector $W = (w_{1}, w_{2}, \dots, w_{k})$ consists of real values lies between $\{0, 1\}$ and the sum of weights equal to $1$ . More specifically, there is a list of $n$ users $U = (u_{1}, u_{2}, \dots, u_{n})$ , a list of $m$ items $I = (i_{1}, i_{2}, \dots, i_{m})$ , and a list of $k$ context features $C = (c_{1}, c_{2}, \dots, c_{k})$ . Each user u has a list of items experienced in certain contexts. We have given a target context $C_{a}$ for an active user $u_{a}$ , we need to assess how much to weight a rating $r_{u_{a}, i, C_{j}}$ issued in some different context $C_{j}$ , subject to a weighting vector $W$ . Context weighting using RCGA can be considered as a novel approach for CARSs which requires an optimal set of context feature weights for each user.

Data Collection

To anticipate the relevance of an item for a user in a certain context, it needs to be filled with the value of each context feature through users’ past history which helps in providing personalized recommendations. The acquired contextual information is either static or dynamic in nature, but we picked up only the static nature of the context. Such information is explicitly captured, i.e. input given by users.

Neighbor Generation

One critical step of CACF is to compute the similarity between users interact with items in similar context features and identify the users with similar inclinations, which is useful for the generation of the relevant neighborhood set. The original CF-based recommendation scenario is most familiar, mature, and widely implemented to filter out the undesired list of users. The CACF follows the same scenario of CF which leverages the pervasive contextual information such that a user’s preference is not only predicted from opinions of similar users but also from feedback of other users in a context similar to that the user currently is in.

Similarity Computation of Context Features

The notion of context similarity computation is to give a higher importance to ratings of items when the computed context similarity is high. Selecting top neighbors $N_{u_{a}}, W, μ_{1}$ of the active user $u_{a}$ for target item $i$ under context feature vector $C_{a}$ satisfying threshold value $μ_{1}$ using weighted Jaccard metric $J (C_{a}, C_{j}, W)$ , we need to assess how much to weight a rating $r_{u_{a}, i, C_{j}}$ issued in some different context vector $C_{j} = (c_{1}, c_{2}, \dots, c_{k})$ subject to a weighting vector $W$ . The context similarity computation metric $J (C_{a}, C_{j}, W)$ and top neighbors $N_{u_{a}, W, μ_{1}}$ are defined as:

(3)

J (C_{a}, C_{j}, W) = \frac{\sum_{c_{f} \in C_{a} \cap C_{j}} W_{c_{f}}}{\sum_{c_{f} \in C_{a} \cup C_{j}} W_{c_{f}}}

(3)

(4)

N_{u_{a}, W, μ_{1}} = \{u : {max}_{r_{u, i}} (J (C_{a}, C_{j}, W) > μ_{1})\}

(4)

Computation of Users’ Similarity

The traditional similarity measure matrices gauge efficiently that how closely the opinions of a user’s pair match, taking into account only the ratings made by such pair. Although they rely only on computing the degree of agreement based on the set of items co-rated by the users, Pearson Correlation Coefficient (PCC) is the most popular among them (Chen Citation2005; Anand Citation2011). The modified CACF technique computed to measure users’ similarity using EquationEquations (5)(5) $T_{μ_{2}} = \{(I_{a}, C_{a}, C_{j}) : \exists r_{u_{a}, i, C_{a}}, r_{u_{j}, i, C_{j}} \cdot J (C_{a}, C_{j}, W) > μ_{2}\}$ (5) and (6) respectively.

(5)

T_{μ_{2}} = \{(I_{a}, C_{a}, C_{j}) : \exists r_{u_{a}, i, C_{a}}, r_{u_{j}, i, C_{j}} \cdot J (C_{a}, C_{j}, W) > μ_{2}\}

(5)

(6)

\begin{aligned} s i m_{W} (u_{a}, u_{j}, W, μ_{2}) \\ = \frac{\sum_{(i, C_{a}, C_{j}) \in T_{μ_{2}}} (r_{u_{a}, i, C_{a}} - {\overset{ˉ}{r}}_{u_{a}}) (r_{u_{j}, i, C_{j}} - {\overset{ˉ}{r}}_{u_{j}}) (J (C_{a}, C_{j}, W))}{\sqrt{\sum_{(i, C_{a}, C_{j}) \in T_{μ_{2}} (i, C_{a}, C_{j}) \in T_{μ_{2}}} {(r_{u_{a}, i, C_{a}} - {\overset{ˉ}{r}}_{u_{a}})}^{2} {(r_{u_{j}, i, C_{j}} - {\overset{ˉ}{r}}_{u_{j}})}^{2} {(J (C_{a}, C_{j}, W))}^{2}}} \end{aligned}

(6)

where $μ_{2}$ is second similarity threshold, $T_{μ_{2}}$ is the set of all collected items $I_{a}$ , and pair of context feature vectors $C_{a}$ and $C_{j}$ is used for users $u_{a}$ and $u_{j}$ respectively, such that each has rated $i$ in that context with $J (C_{a}, C_{j}, W) > μ_{2}$ . We follow two stages in the recommendation process leading to two paradigms contextual pre-filtering and contextual modeling. Our assumption here, the given ratings of more similar contexts are more reliable for further predictions. However, there is a limit to this effect that context features with low similarity may add irrelevant ratings to the predictions. So, we use two similarity thresholds $μ_{1}$ and $μ_{2}$ to filter ratings for each stage.

Learning Context Feature Weights Using RCGA

The similarity is computed in terms of individual context feature similarity using weighted Jaccard measure and users’ similarity using modified PCC, treats all context features equally important, and considers all weights, i.e. $w_{i}^{' s}$ are equal. This may not truly reflect the contribution of each context feature toward the similarity where users put different weights to different features (Agrawal and Bharadwaj Citation2013). To overcome such limitation, we adopt RCGA approach, which is one of the most effective and appropriate techniques for optimization problem.

Chromosome Representation

In our RCGA-based approach, each chromosome is represented as a set of weights $\{w_{1}, w_{2}, \dots, w_{10}\}$ , where each weight has two variables which indicate the maximum and minimum limits for weights in the range of valid values. Our approach shows how the weights defining users’ priorities can be evolved by the RCGA to learn the personal preferences of users and provide tailored suggestions. These weights are generated offline to every context feature for each user and determined in such a way that the sum of all weights is equal to $1$ , i.e. $\sum w_{i} = 1$ . Initially, we considered all weights are equally distributed, indicating that user $u$ is giving equal priorities to all features. The potential to the problem of evolving context feature weights $W (u_{a})$ for the active user $u_{a}$ is represented as a set of weights $W_{c f} = \{w_{1}, w_{2}, \dots, w_{10}\}$ , where $W_{c f}$ is the weight is associated with each context feature $c_{f} \in (c_{1}, c_{2}, \dots, c_{10})$ whose chromosome is a sequence of floating point numbers.

Crossover and Mutation Operators

Genetic operators are used in GA to maintain genetic diversity. In general, the gathered information resulting from GA is done by the selection mechanism is referred as exploitation, while exploration is searching for new regions within search space by Genetic operators. Although a number of crossover and mutation operators are suggested and applied for RCGA (Agrawal and Bharadwaj Citation2013). We employed the mostly used operators arithmetic crossover and uniform mutation for RCGA-based CARS. Crossover creates two new child chromosomes by allowing two parent chromosomes to exchange meaningful information, while mutation is used to maintain the genetic diversity of the population by introducing a completely new member into the parent chromosome. illustrates with an example of how RCGA operators work.

Figure 1. RCGA operators (a) Arithmetic crossover with a random value $r = 0.251$ and (b) Uniform mutation with selected gene $i = 10$ replaces with $(α_{10}, β_{10}) = 0.4854$ .

Fitness Function

Finding an appropriate fitness function is a challenging task for GA applications (Goldberg Citation1989). Each individual candidate solution in the population is assessed for its quality score known as the fitness score. A fitness function is an objective function that prescribes the optimality of a solution (chromosome) in a GA so that a particular chromosome may be ranked against all the other chromosomes. By applying GA operators, optimal chromosomes are allowed to breed and mix their datasets producing a new generation that will hopefully be better. An ideal fitness function correlates closely with the algorithm’s goal, and yet may be computed quickly. For each chromosome in the population, CACF is applied and Mean Absolute Error (MAE) is computed using EquationEquation (7)(7) $f i t n e s s (u_{a}) = \frac{1}{|S_{a}^{T E}|} \sum_{j = 1}^{|S_{a}^{T E}|} |r_{a, j} - P r_{a, j}|$ (7) as the average difference between actual rating and predicted rating for all users in the training set which is also used as a fitness score for that set of weights.

(7)

f i t n e s s (u_{a}) = \frac{1}{|S_{a}^{T E}|} \sum_{j = 1}^{|S_{a}^{T E}|} |r_{a, j} - P r_{a, j}|

(7)

Each user $u$ contains $10$ genes corresponding to weights for each context feature, which are evolved by an elitist approach. When the weight for any context feature is zero, that feature is ignored, which enables feature selection to be adaptive to each user’s preference. Such weights are used in EquationEquations (3-6)(3) $J (C_{a}, C_{j}, W) = \frac{\sum_{c_{f} \in C_{a} \cap C_{j}} W_{c_{f}}}{\sum_{c_{f} \in C_{a} \cup C_{j}} W_{c_{f}}}$ (3) to generate neighborhood set on the basis of users’ similarity and context similarity. The chromosome search space for RCGA is defined in range $[0, 1]$ , i.e. initially we assume that each weight lies between $0$ and $1$ . Finally, the chromosome that gives the minimum MAE is chosen (such as minimization problem) which is suitable for our work.

Recommendation

To make recommendation accurate for an active user $u_{a}$ and a neighborhood set matching with a similar profile of the active user, it is necessary to find items experienced by users in the neighborhood set that the active user has not experienced before. After neighbors’ selection, the next step is to utilize similarity values for the computation of predicted ratings. The predicted rating $P_{u_{a}, i, W}$ of an item $i$ for an active user $u_{a}$ is obtained by using EquationEquation (8)(8) $P_{u_{a}, i, W} = {\overset{ˉ}{r}}_{u_{a}, C_{a}} + \frac{\sum_{u \in N_{u_{a}, W, μ_{1}}} ((r_{u_{j}, i, C_{j}} - {\overset{ˉ}{r}}_{u_{j}, C_{a}}) \times s i m_{W} (u_{a}, u_{j}, W, μ_{2}))}{\sum_{u \in N_{u_{a}, W, μ_{1}}} s i m_{W} (u_{a}, u_{j}, W, μ_{2})}$ (8) .

(8)

P_{u_{a}, i, W} = {\overset{ˉ}{r}}_{u_{a}, C_{a}} + \frac{\sum_{u \in N_{u_{a}, W, μ_{1}}} ((r_{u_{j}, i, C_{j}} - {\overset{ˉ}{r}}_{u_{j}, C_{a}}) \times s i m_{W} (u_{a}, u_{j}, W, μ_{2}))}{\sum_{u \in N_{u_{a}, W, μ_{1}}} s i m_{W} (u_{a}, u_{j}, W, μ_{2})}

(8)

The main steps of the proposed RCGA-based CARS using a hybrid CACF algorithm to perform recommendation task are summarized as follows and also depicted in :

Figure 2. Schematic view of the RCGA-based CARS.

Step 1. Input data

In this step, users’ profiles, items’ profiles, and value of context features with ratings are collected in the form of $U s e r \times I t e m \times C o n t e x t s (c_{1}, c_{2}, \dots, c_{k}) \to R a t i n g s$ , where $c_{1}, c_{2}, \dots ., c_{k}$ are context features.

Step 2. Learning context feature weights using RCGA

Context feature weights for each individual are learned by applying RCGA and finding the best fitness score using EquationEquation (7)(7) $f i t n e s s (u_{a}) = \frac{1}{|S_{a}^{T E}|} \sum_{j = 1}^{|S_{a}^{T E}|} |r_{a, j} - P r_{a, j}|$ (7) . Let $\{w_{1}, w_{2}, \dots, w_{k}\}$ be the context feature weights for an active user.

Step 3. Neighborhood set generation

Compute the context similarity $J (C_{a}, C_{j}, W)$ for an active user with context feature weights using EquationEquation (3)(3) $J (C_{a}, C_{j}, W) = \frac{\sum_{c_{f} \in C_{a} \cap C_{j}} W_{c_{f}}}{\sum_{c_{f} \in C_{a} \cup C_{j}} W_{c_{f}}}$ (3) , and then generate the neighborhood set $N_{u_{a}, W, μ_{1}}$ using EquationEquation (4)(4) $N_{u_{a}, W, μ_{1}} = \{u : {max}_{r_{u, i}} (J (C_{a}, C_{j}, W) > μ_{1})\}$ (4) .
Compute the similarity $s i m_{w} (u_{a}, u_{j}, W, μ_{2})$ between all pairs of users from the generated neighborhood set in Step 3(a) with context feature weights by using EquationEquations (5)(5) $T_{μ_{2}} = \{(I_{a}, C_{a}, C_{j}) : \exists r_{u_{a}, i, C_{a}}, r_{u_{j}, i, C_{j}} \cdot J (C_{a}, C_{j}, W) > μ_{2}\}$ (5) and (6), and then generate final neighborhood set.

Step 4. Prediction computation and recommendation

Compute predicted rating $P_{u_{a}, i, W}$ based on adapted Resnick’s prediction formula by using EquationEquation (8)(8) $P_{u_{a}, i, W} = {\overset{ˉ}{r}}_{u_{a}, C_{a}} + \frac{\sum_{u \in N_{u_{a}, W, μ_{1}}} ((r_{u_{j}, i, C_{j}} - {\overset{ˉ}{r}}_{u_{j}, C_{a}}) \times s i m_{W} (u_{a}, u_{j}, W, μ_{2}))}{\sum_{u \in N_{u_{a}, W, μ_{1}}} s i m_{W} (u_{a}, u_{j}, W, μ_{2})}$ (8) and finally recommend top- $N$ list of items for $u_{a}$ .

Alleviating the Problem of Sparsity

In this section, we consider the framework for the effective missing value prediction (EMVP) follows the basic outline suggested by Ma, King, and Lyu (Citation2007). It exploits both the ratings associated with context features and feature-based similarity among the similar breeds to predict the strength of users’ preferences according to the situation not yet provided. As illustrated in , the rating scale $1 - 5$ represents the users’ fondness of or preference toward the item within some contextual situation, and shaded blocks represent items rated for a given context feature vector in which rating is not available yet. Our approach utilizes the information linked up a user’s choices or preferences with the context in which the user rated an item to predict the shaded block (missing value) if possible. Otherwise, set it to zero, as seen in . The adapted EMVP algorithm consists of two components: Neighborhood set selection and Missing value prediction.

Figure 3. The (m x n) user-item subset matrix of movie dataset: (a) before missing value prediction. (b) after missing value prediction.

Neighborhood Set Selection

To predict missing value, both users’ similarity and items’ similarity are equally important. PCC is used to measure users’ similarity and items’ similarity. It involves context feature similarity using weighted Jaccard similarity measure which determines how relevant the ratings are under a given context for the active user when prediction occurs. For predicting every missing value $r_{u_{a}, i}$ , a set of similar users $S (u_{j})$ toward user $u_{a}$ can be generated according to:

(9)

S (u_{j}) = \{u_{a} : (s i m^{'} (u_{a}, u_{j}) > φ, u_{a} \neq u_{j})\}

(9)

Meanwhile, for predicting every missing value $r_{u_{a}, i}$ , a set of similar items $S (i_{j})$ toward item $i_{a}$ can be generated according to

(10)

S (i_{j}) = \{i_{a} : (s i m^{'} (i_{a}, i_{j}) > ω, i_{a} \neq i_{j})\}

(10)

where $φ$ is a threshold parameter for users’ similarity and $ω$ is a threshold parameter for items’ similarity. If the value of similarity computed between chosen neighbor and target user computed by using EquationEquations (9)(9) $S (u_{j}) = \{u_{a} : (s i m^{'} (u_{a}, u_{j}) > φ, u_{a} \neq u_{j})\}$ (9) and (Equation10(10) $S (i_{j}) = \{i_{a} : (s i m^{'} (i_{a}, i_{j}) > ω, i_{a} \neq i_{j})\}$ (10) ) exceeds the threshold, then the neighbor is selected as a similar user or similar item, otherwise neglected.

Missing Value Prediction

The missing value prediction algorithm systematically combines contextual information into both user-based CF and item-based CF approaches to take advantage of both user correlations and item correlations in the $u s e r - i t e m - c o n t e x t$ matrix which make the prediction more accurate. It will predict the missing value only if it will bring positive influence for the recommendation of active users instead of predicting every missing value of the matrix. If $S (u_{j}) = ϕ Λ S (i_{j}) = ϕ$ then the predicted missing value $P (r_{u_{a}, i_{a}, C_{a}})$ is set to zero, whereas, in other cases, the missing values are predicted using EquationEquations (11(11) $\begin{aligned} P (r_{u_{a}, i_{a}, C_{a}}) = α^{'} \times ({\overset{ˉ}{u}}_{a} + \frac{\sum_{u_{a} ϵ S (u_{j})} s i m^{'} (u_{a}, u_{j}) (r_{u_{a}, i} - {\overset{ˉ}{u}}_{a}) J (C_{a}, C_{j}, W)}{\sum_{u_{a} ϵ S (u_{j})} s i m^{'} (u_{a}, u_{j}) J (C_{a}, C_{j}, W)}) \\ + (1 - α^{'}) \times ({\overset{ˉ}{i}}_{a} + \frac{\sum_{i_{a} \in S (i_{j})} s i m^{'} (i_{a}, i_{j}) (r_{u_{a}, i} - {\overset{ˉ}{i}}_{a}) J (C_{a}, C_{j}, W)}{\sum_{i_{a} \in S (i_{j})} s i m^{'} (i_{a}, i_{j}) J (C_{a}, C_{j}, W)}) \end{aligned}$ (11) –Equation13(13) $P (r_{u_{a}, i_{a}, C_{a}}) = ({\overset{ˉ}{i}}_{a} + \frac{\sum_{i_{a} \in S (i_{j})} s i m^{'} (i_{a}, i_{j}) (r_{u_{a}, i} - {\overset{ˉ}{i}}_{a}) J (C_{a}, C_{j}, W)}{\sum_{i_{a} \in S (i_{j})} s i m^{'} (i_{a}, i_{j}) J (C_{a}, C_{j}, W)})$ (13) ).

If $S (u_{j}) \neq ϕ Λ S (i_{j}) \neq ϕ$ , the prediction of missing value $P (r_{u_{a}, i_{a}, C_{a}})$ is defined as:

(11)

\begin{aligned} P (r_{u_{a}, i_{a}, C_{a}}) = α^{'} \times ({\overset{ˉ}{u}}_{a} + \frac{\sum_{u_{a} ϵ S (u_{j})} s i m^{'} (u_{a}, u_{j}) (r_{u_{a}, i} - {\overset{ˉ}{u}}_{a}) J (C_{a}, C_{j}, W)}{\sum_{u_{a} ϵ S (u_{j})} s i m^{'} (u_{a}, u_{j}) J (C_{a}, C_{j}, W)}) \\ + (1 - α^{'}) \times ({\overset{ˉ}{i}}_{a} + \frac{\sum_{i_{a} \in S (i_{j})} s i m^{'} (i_{a}, i_{j}) (r_{u_{a}, i} - {\overset{ˉ}{i}}_{a}) J (C_{a}, C_{j}, W)}{\sum_{i_{a} \in S (i_{j})} s i m^{'} (i_{a}, i_{j}) J (C_{a}, C_{j}, W)}) \end{aligned}

(11)

It needs to be considered those special cases when either do not get similar users’ set or similar items’ set, then it would fully utilize the information that makes predictions of missing values accurate as possible by follows.

If $S (u_{j}) \neq ϕ Λ S (i_{j}) = ϕ$ , the prediction of missing value $P (r_{u_{a}, i_{a}, C_{a}})$ is defined as

(12)

P (r_{u_{a}, i_{a}, C_{a}}) = ({\overset{ˉ}{u}}_{a} + \frac{\sum_{u_{a} ϵ S (u_{j})} s i m^{'} (u_{a}, u_{j}) (r_{u_{a}, i} - {\overset{ˉ}{u}}_{a}) J (C_{a}, C_{j}, W)}{\sum_{u_{a} ϵ S (u_{j})} s i m^{'} (u_{a}, u_{j}) J (C_{a}, C_{j}, W)})

(12)

If $S (u_{j}) = ϕ Λ S (i_{j}) \neq ϕ$ , the prediction of missing value $P (r_{u_{a}, i_{a}, C_{a}})$ is defined as

(13)

P (r_{u_{a}, i_{a}, C_{a}}) = ({\overset{ˉ}{i}}_{a} + \frac{\sum_{i_{a} \in S (i_{j})} s i m^{'} (i_{a}, i_{j}) (r_{u_{a}, i} - {\overset{ˉ}{i}}_{a}) J (C_{a}, C_{j}, W)}{\sum_{i_{a} \in S (i_{j})} s i m^{'} (i_{a}, i_{j}) J (C_{a}, C_{j}, W)})

(13)

Optimizing the Tradeoff between Accuracy and Diversity Using RCGA

The set of recommended item list should maintain a certain level of diversity with compromising certain level of accuracy, since the increasing diversity level can help commercial sites to promote long tail items sell. In order to suggest highly idiosyncratic and personalized products, one should consider offering not only accurate, but also diverse recommendations to fulfill users’ satisfaction. Our primary goal is to improve users’ satisfaction by maximizing the relevance of the suggested items to the target user. In this paper, we focus on aggregate diversity computation with measure dissimilarity between the pair of recommendation items on the basis of assigned weights to each context feature for users. For dissimilarity measure, we use the weighted Euclidean distance as given below.

(14)

d i s t (i_{a}, i_{j}, W) = \sqrt{\sum_{j = 1}^{n} W {(C_{i_{a}} - C_{i_{j}})}^{2}}

(14)

The dissimilarity $d i s t (i_{a}, i_{j}, W)$ between pair of items $i_{a}$ and $i_{j}$ is calculated by the given value of context features $C_{i_{a}}$ for item $i_{a}$ and $C_{i_{j}}$ for item $i_{j}$ respectively with learned weights $W$ assigned for active user $u_{a}$ The following steps are used to compute aggregate diversity using S-TDE (Premchaiswadi et al. Citation2013) for top- $N$ recommendations, the items must be ranked in descending order based on their predicted ratings before suggesting to active user $u_{a}$ . For evaluation of such ranking, the number of actual relevant items $|L_{a} \cap L_{p}|$ in the recommended list of a user is calculated. Where $L_{a}$ and $L_{p}$ are the target list of all items and predicted relevant items, respectively. Due to each item in a recommendation list can effect on the total diversity value of recommendations differently, Total diversity Effect (TDE) (Premchaiswadi et al. Citation2013) of items in top- $N$ recommendations rely on the dissimilarities between each pair of items in the list. So, TDE is calculated with the help of EquationEquation (15)(15) $T D E = \sum_{j = 1 \dots |L_{p}|} d i s t (i_{a}, i_{j}, W); a \neq j, C_{i_{a}} \neq C_{i_{j}}$ (15) .

(15)

T D E = \sum_{j = 1 \dots |L_{p}|} d i s t (i_{a}, i_{j}, W); a \neq j, C_{i_{a}} \neq C_{i_{j}}

(15)

The total diversity $d i v (L_{p})$ of the recommendation list $L_{p}$ is defined as the average distance between all possible pairs of items in the list is calculated using EquationEquation (16)(16) $d i v (L_{p}) = \frac{T D E}{|L_{p}| \cdot (|L_{p}| - 1) / 2}$ (16) .

(16)

d i v (L_{p}) = \frac{T D E}{|L_{p}| \cdot (|L_{p}| - 1) / 2}

(16)

Then, we measure recommendation diversity as the total number of distinct items that are being recommended across all users. We aggregate all diversity of the recommendation list $T L$ for all users to get average diversity $a v g d i v (T L)$ .

(17)

a v g d i v (T L) = \frac{\sum_{L_{p} \in T L} d i v (L_{p})}{|T L|}

(17)

Finally, we propose a set of context feature weights learned by RCGA for each individual applying in $F_{β} - m e a s u r e$ formula, which satisfy both accuracy and diversity in an optimum level. The $F_{β} - m e a s u r e$ summarized both accuracy and diversity in terms of threshold $β$ to establish a balanced relationship between the accuracy $A$ and diversity $D$ .

(18)

F_{β} - m e a s u r e (A, D) = (1 + β^{2}) \frac{A \times D}{A + D}

(18)

Experiments and Results

This section illustrates the results of the computational experiments performed to evaluate and analyze the effectiveness of the proposed RCGA-based CARS. The following subsections describe the set of experiments that we have conducted to examine the effectiveness of our new context weighting scheme. Particularly, we address the following three issues. First, we analyze the performance of the proposed CARS framework in terms of accuracy by considering two cases (a) CARS with equal context feature weights and (b) CARS with learned context feature weights. Second, in order to handle the sparsity problem we use learned context feature weights and employing EMVP algorithm. Finally, we analyze the tradeoff between accuracy and diversity of the CARS for the two real-world datasets discussed below.

Experimental Settings

To carry out experiments, we take two real-world datasets to evaluate our proposed model. In the area of movie, the movie dataset extracted from LDOS-CoMoDa dataset.Footnote¹ The description of $10$ opted context features is presented in . The Restaurant-Customer dataset extracted from UCI Machine Learning RepositoryFootnote² is also used for our experiments. Users added and rated new and existing restaurants filled with $10$ opted context features described in . We performed experiments with gathering 10 random splits into training and active users. For each random split, three different sets of sample users (3, 5, and 10) were chosen randomly as active users, and remaining users were treated as training users for proposed RCGA-based CARS. Such random splits are intended for the execution of ten-fold cross-validation, where all experiments are repeated for $s p l i t 1, s p l i t 2, \dots, s p l i t 10$ . The set of training users is used to find a set of neighbors for the active user, while the set of active users is used to test the performance of the system. To perform a context feature weights learning process for each dataset, we use different chromosome structures with $10$ weights represented in and .

Table 1. Context features in Movie Dataset.

Download CSV Display Table

Table 2. Context features in Restaurant-Customer dataset.

Download CSV Display Table

Figure 4. The structure of chromosome in Movie dataset.

Figure 5. The structure of chromosome in Restaurant-Customer dataset.

In order to demonstrate relative performances of the following schemes:

Context-aware collaborative filtering with equal weights (CE).
Context-aware collaborative filtering with learned weights using SVM (CW-SVM).
Context-aware collaborative filtering with learned weights using PSO (CW-PSO).
Context-aware collaborative filtering with learned weights using RCGA (CW-RCGA).
Proposed RCGA-based CARS framework (CW-ERCGA).
The proposed framework is further enhanced using learned weights with $F_{β} - m e a s u r e$ (CW-ERCGA-F).

We have conducted three experiments:

Experiment 1: Variation of Context feature weights depending on user-to-user using SVM, PSO, and RCGA.
Experiment 2: Resolving the problem of sparsity using EMVP algorithm with the RCGA.
Experiment 3: Learned context feature weights using RCGA for controlling the tradeoff between accuracy and diversity.

Results and Discussion

In this section, we present the results of the experimental evaluation with two different real-world datasets to show a generic approach in RCGA-based CARS framework.

Variation of Context Feature Weights Depending on User-to-user Using SVM, PSO, and RCGA

In the first experiment, we calculated the weights of each context feature for each individual user by CACF scheme using some of the parameters optimizing techniques SVM, PSO, and RCGA. We took $10$ sample sets of active users to test the importance of context features for each user and compared with other traditional CE scheme. The CE scheme computes the MAE using equal weights for each context feature contributing in similarity computation for each user $u_{i} : w_{i} = \frac{1}{10}$ and the other schemes CW-SVM uses SVM and CW-PSO uses PSO (see ) for learning context feature weights. The proposed scheme CW-RCGA uses an elitist GA for evolving the context feature weights for each user separately based on parameter values as shown in . RCGA process begins with roulette wheel selection for the next generation, followed by the real value assignment for each of the 10 genes in the range [0, 1] and the normalized weights are such that $\sum_{i = 1} w_{i} = 1$ .

Table 3. PSO parameter values used in Experiment.

Download CSV Display Table

Table 4. GA parameter values used in Experiment.

Download CSV Display Table

RCGA learns context feature weights using the actual ratings in the training set for the active user and computes the fitness score using EquationEquation (7)(7) $f i t n e s s (u_{a}) = \frac{1}{|S_{a}^{T E}|} \sum_{j = 1}^{|S_{a}^{T E}|} |r_{a, j} - P r_{a, j}|$ (7) . This process is repeated until the fitness score does not improve for $10$ consecutive generations. Therefore, the number of generations will vary with the fitness score of weights for each user as depicted in . It is proved by that each user has a different proportion of affinity toward various context features and CW-RCGA outperforms other schemes CW-PSO, CW-SVM, and CE (see ).

Table 5. Comparison of MAE using various CARS approaches CE, CW-SVM, CW-PSO and CW-RCGA in terms of accuracy using two datasets.

Download CSV Display Table

Figure 6. Variations in number of generations with fitness value for (a) Movie dataset (b) Restaurant-Customer dataset.

Figure 7. Comparison of evolved context feature weights for users in two datasets: (a) User 1 and User 2 in Movie dataset and (b) User 6 and User 7 in Restaurant-Customer dataset.

Resolving the Problem of Sparsity Using EMVP Algorithm with the RCGA

In the second experiment, we have tried to resolve the problem of high sparsity of contextual information in the rating matrix due to the fact that most users have not rated items under a similar context which is the main cause of low accuracy in the recommendation process. The EMVP algorithm works with the option of not to predict the missing value if it does not meet the predefined criteria, but it prevents from a bad prediction on missing value also. Our next idea is to propagate contextual information from one user to another in order to reduce the sparsity of contextual information.

Ideally, the configurable parameters should be set in a training set which indicate how we learned best suited CW-ERCGA’s parameters for desirable MAE. We assessed the impact of the various parameters on the predictive performance of CW-RCGA. For this purpose, we randomly choose different sets of parameters and picked the best one that gives a desirable MAE score over the testing set of active users. Naturally, the minimum MAE is attained for the best configuration of CW-ERCGA. It is evident from that the different parameters have a substantially different influence on the predictive performance of CW-ERCGA. The parameters $φ$ and $ω$ determine how many missing values that need to be predicted. If it sets too high, most of the missing values cannot be predicted, and if it sets too low, every user/item will obtain too many neighbor users/items which would cause the inaccuracy as well as an increase in the computation cost. Accordingly, to simplify our model we set $α^{'} = 0.5$ , which balances the information from users and items and takes advantage of both types of CF (item-based CF and user-based CF). We plotted the results obtained by CW-RCGA against the results from CW-ERCGA over different sets of active users (see ). The results clearly demonstrate that CW-ERCGA outperforms CW-RCGA.

Figure 8. Comparison of evolved context feature weights for users in two datasets: (a) User 1 and User 2 in Movie dataset and (b) User 6 and User 7 in Restaurant-Customer dataset.

Figure 9. Comparison of MAEs between CW-RCGA and CW-ERCGA using two datasets: (a) Movie dataset (b) Restaurant-Customer dataset.

Learned Contextual Weights Using RCGA for Controlling the Tradeoff between Accuracy and Diversity

In the third experiment, we focus on the quality of top- $N$ recommendation of items in such order that the diversity is improved, while recommendation accuracy still mentioned. The effects of the top- $N$ list on both diversity and accuracy are conversely changing direction, while the diversity tends to increase with the values of top- $N$ increase (see ). We design the experiment to establish the balance between diversity and accuracy using the $F_{β} - m e a s u r e$ approach and set the parameter value $β$ according to the preference of users on various application domains, such as, $β = 1$ for Movie dataset and $β = 0.5$ for the Restaurant-Customer dataset. Finally, we use RCGA for learning those weights to identify appropriate context features for each user that matches their criteria and give effective recommendation solutions. The results across all the evaluation triplets of accuracy, diversity, and $F_{β} - m e a s u r e$ together with the learned context feature weights are depicted in .

Figure 10. Top-N item recommendations with diversity for five samples of active users in two different datasets: (a) Movie dataset and (b) Restaurant-Customer dataset.

Figure 11. Effect of learned context feature weights on recommendation accuracy and diversity using RCGA in two CARS schemes CW-ERCGA and CW-ERCGA-F.

Conclusions and Future Directions

We have presented a real-coded genetic algorithm (RCGA) based CARS framework, where context features are weighted according to individual user’s preferences and choices. By using RCGA each user’s priority for each contextual feature is captured and that has significantly enhanced the performance in terms of both accuracy and diversity. The major issue in this approach is time complexity; however, this difficulty is resolved by performing offline learning of weights and the best set of weights are then stored on the user’s local machine, in a separate weight matrix which can be used for online recommendations (Al-Shamri and Bharadwaj Citation2008). Our work aims to address the sparsity problem in CARS by utilizing missing value prediction and optimize the tradeoff between accuracy and diversity using $F_{β} - m e a s u r e$ . We analyzed the effectiveness of different CARS schemes and compare their performance with the proposed scheme. Experimental results based on two real-world datasets show that the proposed context weighting scheme leads to a significant improvement as compared to other schemes.

One of the important directions in the future work would be toward enhancing the capability of the proposed CF-based CARS through hybridization with Reclusive method (Kant and Bharadwaj Citation2013). We would also consider the exploitation of spatio-temporal contextual information into mobile applications and incorporation of trust-distrust propagation mechanism to further enhance the recommendation accuracy of the proposed scheme (Anand and Bharadwaj Citation2013; Cao et al. Citation2008; Park, Park, and Cho Citation2015). As a further research, we would also like to extend the proposed framework of CARS considering multicriteria contextual information (Yi-Chung Citation2014) and the dynamic nature of context feature (Hee and Keith Citation2004) with different feature values for leveraging its recommendation capability to generate recommendations for both individual users as well as groups (Contreras, Maria, and Jordi Citation2015).

Notes

1. https://www.lucami.org/index.php/research/ldos-comoda-dataset/.

2. https://archive.ics.uci.edu/ml/datasets/.

References

Adomavicius, G., and Y. Kwon. 2008. Overcoming accuracy-diversity tradeoff in recommender systems: A variance-based approach. Proceedings of the 18th Workshop on Information Technology and Systems. Paris, France.
Google Scholar
Adomavicius, G., B. Mobasher, F. Ricci, and A. Tuzhilin. 2011. Context-aware recommender systems. AI Magazine 32 (3):67–80. doi:10.1609/aimag.v32i3.2364.
Web of Science ®Google Scholar
Adomavicius, G., and A. Tuzhilin. 2005. Toward the next generation of recommender systems: A survey of the state-of the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering 17 (6):734–49.
Web of Science ®Google Scholar
Agrawal, V., and K. K. Bharadwaj. 2013. A collaborative filtering framework for friends recommendation in social networks based on interaction intensity and adaptive user similarity. Social Network Analysis and Mining 3 (3):359–79. doi:10.1007/s13278-012-0083-7.
Google Scholar
Al-Shamri, M. Y. H., and K. K. Bharadwaj. 2008. Fuzzy-genetic approach to recommender system based on a novel hybrid user model. Expert Systems with Applications 35 (3):1386–99. doi:10.1016/j.eswa.2007.08.016.
Web of Science ®Google Scholar
Anand, D., and K. K. Bharadwaj. 2011. Utilizing various sparsity measures for enhancing accuracy of collaborative recommender systems based on local and global similarities. Expert System with Applications 38 (5):5101–09. doi:10.1016/j.eswa.2010.09.141.
Web of Science ®Google Scholar
Anand, D., and K. K. Bharadwaj. 2013. Pruning trust-distrust network via reliability and risk estimates for quality recommendations. Social Network Analysis and Mining 3 (1):65–84. doi:10.1007/s13278-012-0049-9.
Google Scholar
Blanco, A., M. Delgado, and M. C. Pegalaja. 2001. A real-coded genetic algorithm for training recurrent neural networks. Neural Networks 14 (1):93–105. doi:10.1016/S0893-6080(00)00081-2.
PubMed Web of Science ®Google Scholar
Campos, P. G., I. Fernandez-Tobias, I. Cantador, and F. Diez. 2013. Context-aware movie recommendations: An empirical comparison of pre-filtering, post-filtering and contextual modeling approaches. Proceedings of the 14th International Conference on E-commerce and Web Technologies, Lecture Notes in Bioinformatics, 137–49. Prague, Czech Republic.
Google Scholar
Cao, Y., R. Klamma, M. Hou, and M. Jarke. 2008. Follow me, follow you - spatiotemporal community context modeling and adaptation for mobile information systems. Proceedings of the 9th International Conference on Mobile Data Management, 108–15. Beijing, China: IEEE Computer Society.
Google Scholar
Chen, A. 2005. Context-aware collaborative filtering system: Predicting the user’s preference in the ubiquitous computing environment. Proceedings of the First International Conference on Location- and Context-Awareness, 244–53. Oberpfaffenhofen, Germany: Lecture Notes in Computer Science.
Google Scholar
Contreras, D., S. Maria, and P. Jordi. 2015. A web-based environment to support online and collaborative group recommendation scenarios. Applied Artificial Intelligence 29 (5):480–99. doi:10.1080/08839514.2015.1026661.
Web of Science ®Google Scholar
Goldberg, D. 1989. Genetic algorithms in search, optimization, and machine learning. Boston, MA: Addison-Wesley.
Google Scholar
Hee, E. B., and C. Keith. 2004. Utilizing context history to provide dynamic adaptations. Applied Artificial Intelligence 18 (6):533–48. doi:10.1080/08839510490462894.
Web of Science ®Google Scholar
Herrera, F., M. Lozano, and J. L. Verdegay. 1998. Tackling real- coded genetic algorithms: Operators and tools for behavioural analysis. Artificial Intelligence Review 12 (4):265–319. doi:10.1023/A:1006504901164.
Web of Science ®Google Scholar
Kant, V., and K. K. Bharadwaj. 2013. Integrating collaborative and reclusive methods for effective recommendations: A fuzzy bayesian approach. International Journal of Intelligent Systems 28 (11):1099–123. doi:10.1002/int.21619.
Web of Science ®Google Scholar
Kennedy, J., and R. C. Eberhart. 1995. Particle swarm optimization. Proceedings of the IEEE International Conference on Neural Networks, 1942–48. Perth, WA, Australia.
Google Scholar
Liu, Q., H. Ma, E. Chen, and H. Xiong. 2013. A survey of context-aware mobile recommendations. International Journal of Information Technology & Decision Making 12 (1):139–72. doi:10.1142/S0219622013500077.
Web of Science ®Google Scholar
Ma, H., I. King, and M. R. Lyu. 2007. Effective missing data prediction for collaborative filtering. Proceedings of the 30th annual international ACM SIGIR Conference on Research and Development in Information Retrieval, 39–46. Amsterdam, Netherlands.
Google Scholar
Min, S. H., and I. Han. 2005. Recommender systems using support vector machines. Proceedings of the 5th International Conference on Web Engineering, pp.387–93. Sydney, Australia: Lecture notes in Computer Science.
Google Scholar
Min, S. H., J. Lee, and I. Han. 2006. Hybrid genetic algorithms and support vector machines for bankruptcy prediction. Expert Systems with Applications 31 (3):652–600. doi:10.1016/j.eswa.2005.09.070.
Web of Science ®Google Scholar
Noori, B. 2015. Developing a CBR system for marketing mix planning and weighting method selection using fuzzy AHP. Applied Artificial Intelligence 29 (1):1–32. doi:10.1080/08839514.2014.962282.
Web of Science ®Google Scholar
Osuna-Enciso, V., E. Cuevas, D. Oliva, H. Sossa, and M. Pérez-Cisneros. 2016. A bio-inspired evolutionary algorithm: Allostatic optimisation. International Journal of Bio-Inspired Computation 8 (3):154–69. doi:10.1504/IJBIC.2016.076633.
Web of Science ®Google Scholar
Panniello, U., A. Tuzhilin, and M. Gorgoglione. 2014. Comparing context-aware recommender systems in terms of accuracy and diversity’. User Modeling and User-Adapted Interaction 24 (1–2):35–65. doi:10.1007/s11257-012-9135-y.
Web of Science ®Google Scholar
Park, H. S., M. H. Park, and S. B. Cho. 2015. Mobile information recommendation using multi-criteria decision making with bayesian network. International Journal of Information Technology & Decision Making 14 (2):317–38. doi:10.1142/S0219622015500017.
Web of Science ®Google Scholar
Premchaiswadi, W., P. Poompuang, N. Jongswat, and N. Premchaiswadi. 2013. Enhancing diversity-accuracy technique on user-based top-N recommendation algorithms. Proceedings of the IEEE 37th Annual Computer Software and Applications Conference Workshop, 403–08. Japan.
Google Scholar
Sarwat, M., J. J. Levandoski, A. Eldawy, and M. F. Mokbel. 2014. LARS*: An efficient and scalable Location-aware recommender system. IEEE Transaction on Knowledge and Data Engineering 26 (6):1384–99. doi:10.1109/TKDE.2013.29.
Web of Science ®Google Scholar
Shams, B., and S. Haratizadeh. 2017. Graph-based collaborative ranking. Expert Systems With Applications 67:59–70. doi:10.1016/j.eswa.2016.09.013.
Web of Science ®Google Scholar
Shi, Y., M. Larson, and A. Hanjalic. 2014. Collaborative filtering beyond the user-item matrix: A survey of the state of the art and future challenges. ACM Computing Surveys (CSUR) 47 (1):3. doi:10.1145/2556270.
Web of Science ®Google Scholar
Verbert, K., N. Manouselis, X. Ochoa, M. Wolpers, H. Drachsler, I. Bosnic, and E. Duval. 2012. Context-aware recommender systems for learning: A survey and future challenges. IEEE Transactions on Learning Technologies 5 (4):318–35. doi:10.1109/TLT.2012.11.
Web of Science ®Google Scholar
Wahde, M. 2008. Biologically inspired optimization methods: An introduction. Sweden: WIT press.
Google Scholar
Wang, M., T. Kawamura, Y. Sei, H. Nakagawa, Y. Tahara, and A. Ohsuga.2014. Context-aware music recommendation with serendipity using semantic relations. Proceedings of 3rd Joint International Conference, 17–32. Seoul, South Korea: Lecture Notes in Computer Science.
Google Scholar
Yi-Chung, H. 2014. A multicriteria collaborative filtering approach using the indifference relation and its application to initiator recommendation for group-buying. Applied Artificial Intelligence 28 (10):992–1008. doi:10.1080/08839514.2014.962279.
Web of Science ®Google Scholar
Zheng, Y., R. Burke, and B. Mobasher. 2013. Recommendation with differential context weighting. Proceedings of the 21st Conference on User Modeleling, Adaptation and Personalization, Lecture Notes in Computer Science, 7899, 152–64. Rome, Italy.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Effective Context-Aware Recommendations Based on Context Weighting Using Genetic Algorithm and Alleviating Data Sparsity

ABSTRACT

Introduction

Related Work

Techniques for Learning Context Feature Weights

Support Vector Machine (SVM)

Particle Swarm Optimization (PSO)

Real-Coded Genetic Algorithm (RCGA)

Proposed RCGA-based CARS Framework

Data Collection

Neighbor Generation

Similarity Computation of Context Features

Computation of Users’ Similarity

Learning Context Feature Weights Using RCGA

Chromosome Representation

Crossover and Mutation Operators

Fitness Function

Recommendation

Alleviating the Problem of Sparsity

Neighborhood Set Selection

Missing Value Prediction

Optimizing the Tradeoff between Accuracy and Diversity Using RCGA

Experiments and Results

Experimental Settings

Table 1. Context features in Movie Dataset.

Table 2. Context features in Restaurant-Customer dataset.

Results and Discussion

Variation of Context Feature Weights Depending on User-to-user Using SVM, PSO, and RCGA

Table 3. PSO parameter values used in Experiment.

Table 4. GA parameter values used in Experiment.

Table 5. Comparison of MAE using various CARS approaches CE, CW-SVM, CW-PSO and CW-RCGA in terms of accuracy using two datasets.

Resolving the Problem of Sparsity Using EMVP Algorithm with the RCGA

Learned Contextual Weights Using RCGA for Controlling the Tradeoff between Accuracy and Diversity

Conclusions and Future Directions

References

Information for

Open access

Opportunities

Help and information

Effective Context-Aware Recommendations Based on Context Weighting Using Genetic Algorithm and Alleviating Data Sparsity

ABSTRACT

Introduction

Related Work

Techniques for Learning Context Feature Weights

Support Vector Machine (SVM)

Particle Swarm Optimization (PSO)

Real-Coded Genetic Algorithm (RCGA)

Proposed RCGA-based CARS Framework

Data Collection

Neighbor Generation

Similarity Computation of Context Features

Computation of Users’ Similarity

Learning Context Feature Weights Using RCGA

Chromosome Representation

Crossover and Mutation Operators

Fitness Function

Recommendation

Alleviating the Problem of Sparsity

Neighborhood Set Selection

Missing Value Prediction

Optimizing the Tradeoff between Accuracy and Diversity Using RCGA

Experiments and Results

Experimental Settings

Table 1. Context features in Movie Dataset.

Table 2. Context features in Restaurant-Customer dataset.

Results and Discussion

Variation of Context Feature Weights Depending on User-to-user Using SVM, PSO, and RCGA

Table 3. PSO parameter values used in Experiment.

Table 4. GA parameter values used in Experiment.

Table 5. Comparison of MAE using various CARS approaches CE, CW-SVM, CW-PSO and CW-RCGA in terms of accuracy using two datasets.

Resolving the Problem of Sparsity Using EMVP Algorithm with the RCGA

Learned Contextual Weights Using RCGA for Controlling the Tradeoff between Accuracy and Diversity

Conclusions and Future Directions

Notes

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date