CKComCF: Canopy–K-means Clustering based Combined Collaborative Filtering
- Alternative Title
- 캐노피-k-평균 클러스터링 기반 결합 협업 필터링
- Abstract
- In recent times, big data is revolutionizing every aspect of human lives. It has attracted more attention in academia and industry. To quickly obtain information, information filtering systems are essential. In the e-commerce industry, the recommendation system (RS) to predict the preferences of people has been prevalent over the few years. Collaborative filtering (CF) is one of the most conventional algorithms of RS. However, CF suffers from data sparsity and scalability issues. Thus, we propose Canopy–K-means Clustering-based Combined Collaborative Filtering (CKComCF) to solve the challenge of data sparsity and scalability. In particular, the prediction outcomes of user-based CF (UbCF) and item-based CF (IbCF) are integrated using a weighting approach, which is based on the root-mean-square error (RMSE) minimization. Experiment results based on two real-life datasets of MovieLens and Netflix Prize demonstrate that the proposed RMSE-minimization method outperforms the traditional CF methods, improving the accuracy by 64.24% (UbCF with MovieLens) and 13.72% (IbCF with Netflix Prize). The proposed CKComCF model outperforms the existing improved CF method, reducing the calculation time by 41.84% (MovieLens) and 64.77% (Netflix Prize).
- Author(s)
- KUAN SAO I
- Issued Date
- 2021
- Awarded Date
- 2021. 2
- Type
- Dissertation
- Keyword
- Recommendation System Combined Collaborative Filtering RMSE-minimization Canopy Clustering K-means Clustering
- Publisher
- 부경대학교
- URI
- https://repository.pknu.ac.kr:8443/handle/2021.oak/2135
http://pknu.dcollection.net/common/orgView/200000372836
- Affiliation
- Pukyong National university, Graduate school
- Department
- 대학원 IT융합응용공학과
- Advisor
- Ha-Joo Song
- Table Of Contents
- 1. Introduction 1
2. Related works 3
2.1. Traditional Collaborative Filtering 3
2.2. Improved Collaborative Filtering 5
3. Proposed Model 7
3.1. ComCF weighting 7
3.1.1 Comparison of CF PCC and library PCC 8
3.2. RMSE-minimization for ComCF 8
3.2.1 RMSE vs MAPE 9
3.3. Canopy–K-means based Combined Collaborative Filtering (CKComCF) 10
3.3.1 Canopy and K-means Clustering process 11
3.4 Other method to improve CF 11
4. Experiment 13
4.1. Dataset 13
4.2. Evaluation 15
4.3. Result 15
4.3.1. Effectiveness of RMSE-minimization for ComCF 16
4.3.2. Effectiveness of CKComCF 17
5. Conclusion 25
- Degree
- Master
-
Appears in Collections:
- 대학원 > IT융합응용공학과
- Authorize & License
-
- Files in This Item:
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.