• No results found

Recommendation System for Grocery Store Considering Data Sparsity

N/A
N/A
Protected

Academic year: 2021

Share "Recommendation System for Grocery Store Considering Data Sparsity"

Copied!
8
0
0

Loading.... (view fulltext now)

Full text

(1)

Procedia Computer Science 60 ( 2015 ) 1406 – 1413

1877-0509 © 2015 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Peer-review under responsibility of KES International doi: 10.1016/j.procs.2015.08.216

ScienceDirect

19th International Conference on Knowledge Based and Intelligent Information and Engineering

Systems

Recommendation system for grocery store considering data sparsity

Natsuki Sano

a,∗

, Natsumi Machino

b

, Katsutoshi Yada

c

, Tomomichi Suzuki

a

aTokyo University of Science, 2641 Yamazaki, Noda 278-8510, Japan bRecruit Communications Co., Ltd. 1-13-1 Kachidoki, Chuo-ward 104-0054, Japan

cKansai University, 3-3-35 Yamate-cho, Suita 564-8680, Japan

Abstract

In grocery stores, large-scale transaction data with identification, such as point of sales (POS) data, is being accumulated as a result of the introduction of frequent shopper programs. We propose two recommendation systems based on transaction data of a grocery store. In recommending product items in grocery stores, data sparsity is a problem. This is because individual customers only purchase very few of the total number of product items a store sells. We evaluate various recommendation methods including SVD-type recommendation based on real POS data and summarize methods suitable for the proposed recommendation systems.

c

 2015 The Authors. Published by Elsevier B.V. Peer-review under responsibility of KES International.

Keywords: Recommendation;grocery store;data sparsity;singlur value decomposition;nonlinear principal component analysis

1. Introduction

In grocery stores, large-scale transaction data with identification, such as point of sales (POS) data, is being accu-mulated as a result of the introduction of frequent shopper programs (FSPs). The accuaccu-mulated POS data have been used to examine customer shopping behavior, especially by professionals in the marketing field1,2.

Although the recommendations based on this data are often adopted in e-commerce shopping stores3, they are rarely introduced in face-to-face selling, such as in brick-and-mortar grocery stores. Therefore, introducing a system based on these recommendations to grocery stores could induce customers to visit the store to make a purchase.

We propose two recommended systems based on stored POS data, and these are shown in Figure 1. The first system gathers the e-mail address during the registration procedure and directly determines recommended products based on the stored POS data and sends reminders with discount information to customers by e-mail as shown in Figure 1 (a).

When constructing this system for grocery stores, the sparsity of evaluation values presents a problem. Evaluation values are constructed based on customers’ purchase frequency of product items and is very sparse. This is because individual customers only purchase very few of the total number of product items a store sells.

We alleviated the problem of data sparsity by proposing a system in which recommended product items are deter-mined by a two-step procedure as shown in Figure 1 (b). First, the system determines recommended product categories

Corresponding author. Tel.:+81-4-7124-1501-3841 ; fax: +81-4-7122-4566.

E-mail address: [email protected]

© 2015 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

(2)

(a) Direct recommendation of product item

(b) Two-step recommendation of product item

Fig. 1. Recommendation system for grocery stores

enabling the store manager to determine recommended product items according to the circumstances of the particular grocery store; for example, specific product items may have to be cleared out from inventory or could be purchased at a lower price than usual from the manufacturer. Discount information relating to the recommended product items are sent to remind customers by e-mail similar to the direct system.

The problem of data sparsity is often addressed technically by adopting singular value decomposition (SVD) in the recommended system3,4,5. In our work we evaluate various recommended methods, including SVD-type rec-ommendations, based on real POS data. A representative recommendation method is (i) user-based collaborative filtering (CF). A representative SVD-type recommendation method is (ii) recommendation based on evaluation values reconstructed by SVD, which factorizes the original evaluation matrix, thereby reducing it to low rank matrices and reconstructing the evaluation matrix. We also evaluate another type of SVD recommendation, (iii) recommendation by item-user similarity by SVD and (iv) recommendation by using a combination of CF and SVD. In addition, we evaluate (v) recommendation by nonlinear principal component analysis (NL-PCA)6, also known as sandglass-type neural networks (SNN).

The results of numerical experiments show that recommendation by SVD reconstruction is suitable for the recom-mendation of product items via the direct recomrecom-mendation method, whereas CF is suitable for the recomrecom-mendation of product categories via the two-step recommendation method.

The remainder of this article is organized as follows. Section 2 describes the POS data obtained from a real grocery store and the approach followed to construct a matrix of evaluation values. Section 3 explains the recommendation methods that were examined in the numerical experiment using real data. Section 4 presents the comparison results for product item and product category recommendations. Section 5 presents a summary of the study and offers conclusions and subjects for future studies.

2. Data Description

We employ POS data with identification gathered during May and June of 2009 in the Kanto area of Japan. The POS data reflect customers’ purchase behavior, namely, who purchases what item, when, how many, and how much. In addition, the POS data contain information about the product category, that is, a large classification of the purchased product items. Examples of product items and categories are shown in Table 1.

We estimated the user’s evaluation value of a product item (category) by using a scale from 0 to 5 by aggregating the number of purchases (i.e., the purchase frequency) of the specific product item (category) for each user. If the purchase frequency of items exceeded 5, then the evaluation value was considered as 5. The format of the matrix consisting of the evaluation valuesX is listed in Table 2.

Each row and column corresponds to the respective product item and user. An element of X, xi j denotes the evaluation value of product item i by customer j. n and p denote the number of product items and users, respectively. When the evaluation value matrix for product item recommendation was constructed, the values that were used for n and p were 8674 and 6997, respectively.

(3)

Table 1. Examples of product items and categories

Product item Product category

Takanashi low-fat milk (1L) Milk CGC 100% apple juice (1L) Juice Morinaga pudding (85g×3) Dessert Nippon ham margherita pizza (1whole) Pizza Yamazaki double soft (6 slices) Bread Meiji bulgar plain yogurt (450g) Yogurt

Yukijirushi butter (200g) Butter & Margarine

Morinaga pino Ice cream

QBB baby cheese (4 pieces 60g) Cheese Iodine egg hikari (6 eggs) Egg

Shimadaya udon Noodle

CGC soybean tofu kinu (350g) Tofu

... ...

Table 2. Construction of evaluation matrix

(a) Matrix of evaluation value of product item (category)

customer 1 customer 2 . . . customer j . . . customer p product item (product category) 1 x11 x12 . . . x1 j . . . x1p

product item (product category) 2 x21 x22 . . . x2 j . . . x2p

..

. ... ... ... ... ... ...

product item (product category) i xi1 xi2 . . . xi j . . . xip

..

. ... ... ... ... ... ...

product item (product category) n xn1 xn2 . . . xn j . . . xnp

An evaluation value for product item recommendation is very sparse, since the number of product items a customer purchases is very few among all the products on offer n. This sparsity of evaluation values could result in product item recommendation performing poorly. We alleviated the sparsity problem by constructing a matrix of evaluation values for the product categories. For this evaluation the values of n and p were 228 and 6997, respectively.

The sparsity of the evaluation value for product items was compared with that for product categories by defining a measure of non-sparsity as the ratio of the number of non-zero elements in the matrix of evaluation values among all the elements, n× p. The results of the determination of the sparsity measure are shown in Table 3.

Table 3. Comparison between sparsity of evaluation values of product items and categories

Product item Product category

0.032% 8.75%

These results show that although the non-sparsity of product items has the value of 0.032, that of the product categories has the value of 8.75; therefore, the non-sparsity of the evaluation value matrix of product categories is increased 273.44 times compared to that of product items.

3. Recommendation methods for comparison

In this section we present a comparison between the five recommended methods. The predictive recommendation values for each method is computed for item i by user j, ri j, i= 1, . . . , n, j = 1, . . . , p. For an active user a, the values of riai= 1, . . . , n are sorted in decreasing order and the top m and unpurchased items are recommended to active user

(4)

3.1. User-based collaborative filtering

Collaborative filtering is the most popular recommendation method and we employ user-based collaborative filter-ing, which computes a recommendation value based on similarity between users as follows:

ri j= ¯xj+ N l=1sl jxi j p l=1|sl j| , (1)

where ¯xjdenotes j-th user’s mean ¯xj= (nl=1xl j)/n, x∗i j= xi j− ¯xjdenotes user j’s evaluation of item i that subtracts user j’s mean ¯xj, and sl jdenotes the similarity between user l and user j by using the Pearson correlation

sl j=

(xl− ¯xl)(xj− ¯xj) ||xl− ¯xl|| · ||xj− ¯xj||,

(2) where xj = (x1 j, x2 j, . . . , xn j) denotes j-th users evaluation vector and ¯xj = ( ¯xj, ¯xj, . . . , ¯xj) denotes n-length repli-cates of j-th user’s mean. Note that N denotes the number of neighborhoods and significant to the recommendation performance.

3.2. Recommendation by SVD-reconstructed evaluation values

ˆ

Xk, the least-square approximation ofX, is obtained by using the SVD technique. ˆ

Xk= UkDkVk, (3) whereUkandVkdenotes n× k and p × k orthogonal matrix, Dkdenotes k× k diagnol matrix. ˆxi j, the i- j element of ˆX is available for the predictive recommendation value of the i-th item for the j-th user, ri j. The dimension of the latent class, k is a significant parameter for evaluating recommendation performance.

3.3. Recommendation by item-user similarity by SVD

Similarity between the i-th item and the j-th user by using SVD is expressed as follows:

suseri j ,item= (ui− ¯ui)

(v j− ¯vj) ||ui− ¯ui|| · ||vj− ¯vj||,

(4)

i = 1, . . . , n, j = 1, . . . , p, where ui denotes the i-th vector ofUk andvj denotes the j-th vector ofVk. Further, ¯

ui = (¯ui, ¯ui, . . . , ¯ui) denotes replicates of the mean of the i-th item ¯ui= (lk=1uil)/k and ¯vj = (¯vj, ¯vj, . . . , ¯vj) denotes replicates of the j-th user’s mean ¯vj= (kl=1vjl)/k. suseri j ,itemis also available for the predictive recommendation value of the i-th item of the j-th user, ri j.

3.4. Recommendation by combination of CF and SVD

Although ordinal user-based collaborative filtering recommends prospective items based on similarities between users in Equation (2), we combine collaborative filtering and latent similarity between users by using SVD, such as in

suseri j ,user= (vi− ¯vi)

(v j− ¯vj) ||vi− ¯vi|| · ||vj− ¯vj||

. (5)

Equation (5) is substituted to sl jin equation (1) and the predictive recommendation value of the i-th item for the j-th user, ri jis calculated.

(5)

Fig. 2. Topology of sandglass-type neural networks

3.5. Recommendation by non-linear PCA

We apply non-linear PCA6(NL-PCA) to recommendation systems by using sandglass-type neural networks (SNN). Vozalis7applied non-linear PCA to recommendation by combining this method with collaborative filtering. We used the output values of SNN directly as predictive recommendation value. A topology of SNN is shown in Figure 2.

The input signal in the input layer and instruction signal in the output layer are the same and the input-output relation is learned through the back-propagation algorithm in NL-PCA. The number of units in the middle layer, u is smaller than the number of units in the input and output layer. Information from the input layer is compressed to reduce the dimension of the middle layer and reconstructed in the output layer. This means that NL-PCA uses a nonlinear reconstruction to execute the input signal.

In the proposed recommendation system, the input and instruction signals are evaluation values of the product items or categories and the number of units in the input and output layer is n. The weights between units are learned, such that the sum of the square error in the output layer is minimized. After learning, the evaluation values are once again input into the input units, and the output values in the output units are available for the predictive recommendation value of the i-th item for the j-th user, ri j.

4. Numerical experiments based on real data

4.1. Performance measure

We sampled 2% of the elements from the evaluation matrixX, making up missing values intentionally, and com-plimenting them with 0 values. In other words, we assumed that actually purchased products are not purchased and performed estimations by using the recommendation methods in Section 3. The sets consisting of the sampled items, hit items determined by the recommendation method, and the recommended items, were termed testset, hitset, and recset respectively. By using these sets, the recommendation methods are compared through precision, recall, and

F-value as follows:

recall = |testset||hitset| precision= |recset||hitset|

F-value = 2×recall×precisionrecall+precision

We conducted the above procedures 10 times and evaluated the recommendation methods by using the average value of recall, precision, and F-value.

(6)

Table 4. Performance comparison of recommendation method for product items with m= 5

Recommendation method optimal parameter precision recall F-value

User-based CF N= 30 0.0030 0.0271 0.0054

SVD reconstruction k= 17 0.0040 0.0358 0.0072

SVD similarity k= 4 0.0033 0.0296 0.0059

CF & SVD k= 17, N = 390 0.0030 0.0273 0.0055

NL-PCA – – – –

4.2. Performance evaluation of item recommendation

We evaluated the recommendation algorithms for product item recommendation in terms of recall, precision, and F-value with the number of recommended items for active users, m = 5. The parameters of the recommendation methods were optimized by maximizing the F-value.

Table 4 shows the results of the optimized parameters and the value of the performance measure. The F-value of recommendation by SVD reconstruction shows the highest value (0.0072), followed by recommendation by SVD similarity (0.0059), recommendation by CF & SVD (0.0055), and CF (0.0054). The horizontal bar in the row of NL-PCA shows that the algorithm could not be executed, as the number of input units, i.e., the number of product items, n is 8574, which exceeds the admissibility condition of the number of input units in thennet package of R.

4.3. Performance evaluation of category recommendation

For recommendation of the product category, we conducted an evaluation for each recommendation method. Table 5(a) lists the results of the optimized parameters and value of the performance measure with m = 3. The F-value of user-based CF shows the highest value (0.0429), followed by recommendation by SVD reconstruction (0.0407), recommendation by CF & SVD (0.0388), NL-PCA (0.0387), and SVD similarity (0.0036).

Table 5(b) lists the results of optimized parameters and values of the performance measure with m = 4. F-value of recommend by CF shows the highest value (0.0398), followed by recommend by SVD reconstruction (0.0375), recommend by CF & SVD (0.0363), NL-PCA (0.0357), and SVD similarity (0.0039).

Table 5(c) lists the results of the optimized parameters and values of the performance measure with m = 5. The F-value of recommendation by CF shows the highest value (0.0377), followed by recommendation by SVD recon-struction (0.0362), NL-PCA (0.0348), recommendation by CF & SVD (0.0347), and SVD similarity (0.0037).

Therefore, user-based CF is superior to the other recommendation methods for product categories for all values of

m in all the cases. On the contrary, the recommend by SVD similarity is inferior to the other methods.

4.4. Performance comparison of item recommendation with category recommendation

We compared the best performance of product item recommendation with that of product category recommendation with m= 5. The F-value of recommendation by SVD reconstruction for product item recommendation is 0.0072. On the other hand, the F-value of recommendation by user-based collaborative filtering is 0.0377. We can observe that the F-value of the best recommendation method for product category recommendation is increased 5.24 times compared to the product item method.

5. Conclusion

When recommendation systems are applied to grocery stores, the sparsity of evaluation values can be a problem. This paper proposes two recommendation systems (a) direct recommendation and (b) two-step recommendation of product items based on stored POS data considering the sparsity of data. Two-step recommendation is composed of product category recommendation by using a recommendation algorithm and product item recommendation by heuris-tic decision of store manager. To seek the appropriate recommendation algorithm for the proposed algorithm, five

(7)

Table 5. Performance comparison of recommend method for product category

(a) m= 3

Recomend method optimal parameter precision recall F-value User-based CF N= 220 0.0245 0.176 0.0429 SVD reconstruction k= 4 0.0233 0.166 0.0407 SVD similarity k= 6 0.0021 0.0147 0.0036 CF & SVD k= 4, N = 470 0.0222 0.159 0.0388 NL-PCA u= 2 0.0219 0.165 0.0387 (b) m= 4

Recomend method optimal parameter precision recall F-value

User-based CF N= 90 0.0220 0.210 0.0398 Approximation by SVD k= 2 0.0208 0.198 0.0375 Similarity by SVD k= 6 0.0021 0.0204 0.0039 CF & SVD k= 4, N = 420 0.0202 0.192 0.0363 NL-PCA u= 3 0.0196 0.197 0.0357 (c) m= 5

Recomend method optimal parameter precision recall F-value

User-based CF N= 280 0.0205 0.245 0.0377

Approximation by SVD k= 2 0.0197 0.234 0.0362 Similarity by SVD k= 6 0.00201 0.0241 0.0037 CF & SVD k= 4, N = 320 0.0189 0.225 0.0347

NL-PCA u= 2 0.0179 0.225 0.0348

recommendation algorithms are compared based on real POS data. The results show that recommendation according to the reconstructed evaluation value by SVD is appropriate for direct recommendation and user-based collaborative filtering is appropriate for two step recommendation. Decision of specific procedure of product item recommend by store manager in two-step recommend is a subject for future study.

Acknowledgments

This work was supported by JSPS KAKENHI Grant Number 25750131 and the Strategic Project to Support the Formation of Research Bases at Private Universities: Matching Fund Subsidy from MEXT (Ministry of Education, Culture, Sports, Science and Technology).

(8)

References

1. Guadagni, P.M., Little, J.D.C.. A logit model of brand choice, calibrated on scanner data. Marketing Science 1983;2:203–238.

2. Sano, N., Yada, K.. The influence of sales areas and bargain sales on customer behavior in a grocery store. Neural Computing and Applications 2015;26(2):355–361.

3. Sarwar, B.M., Karypis, G., Konstan, J.A., Riedl, J.T.. Application of dimensionality reduction in recommender system –a case study. In:

Proceedings of the ACM WebKDD Workshop. 2000, .

4. Sarwar, B.M., Karypis, G., Konstan, J.A., Riedl, J.T.. Analysis of recommendation algorithms for ecommerce. In: Proceedings of the 2nd ACM conference on Electronic commerce. 2000, p. 158–167.

5. Goldberg, K., Roeder, T., Gupta, D., Perkins, C.. Eigentaste : A constant time collaborative filtering algorithm. Information Retrieval

Journal 2001;4:133–151.

6. Scholz, M., Fraunholz, M.,, , Selbig, J.. Nonlinear principal component analysis: Neural network models and applications. In: Principal

Manifolds for Data Visualization and Dimension Reduction; vol. 58 of Lecture Notes in Computational Science and enginee. Heidelberg:

Springer; 2008, p. 44–67.

7. Vozalis, M.G., Markos, A., Margaritis, K.G.. Collaborative filtering through svd-based and hierarchical nonlinear pca. In: Proceedings of

References

Related documents

19% serve a county. Fourteen per cent of the centers provide service for adjoining states in addition to the states in which they are located; usually these adjoining states have

diagnosis of heart disease in children. : The role of the pulmonary vascular. bed in congenital heart disease.. natal structural changes in intrapulmon- ary arteries and arterioles.

Standardization of herbal raw drugs include passport data of raw plant drugs, botanical authentification, microscopic & molecular examination, identification of

Field experiments were conducted at Ebonyi State University Research Farm during 2009 and 2010 farming seasons to evaluate the effect of intercropping maize with

Al-Hazemi (2000) suggested that vocabulary is more vulnerable to attrition than grammar in advanced L2 learners who had acquired the language in a natural setting and similar

Chrysanthemum [8,9], ii) the antioxidant enzyme activities and lipid peroxidation of Dendrobium candidum [10], iii) the stimulation on the secondary structure of plasma

This essay asserts that to effectively degrade and ultimately destroy the Islamic State of Iraq and Syria (ISIS), and to topple the Bashar al-Assad’s regime, the international

The total coliform count from this study range between 25cfu/100ml in Joju and too numerous to count (TNTC) in Oju-Ore, Sango, Okede and Ijamido HH water samples as