标签扩展的协同过滤推荐算法

计算机工程与科学 ›› 2021, Vol. 43 ›› Issue (10): 1826-1832.

标签扩展的协同过滤推荐算法

陈海龙，闫五岳，孙海娇，程苗

(哈尔滨理工大学计算机科学与技术学院,黑龙江哈尔滨 150080)

收稿日期:2020-05-07 修回日期:2020-08-24 接受日期:2021-10-25 出版日期:2021-10-25 发布日期:2021-10-22
作者简介:陈海龙 (1975),男,黑龙江宁安人，博士，教授，CCF会员(A1589M),研究方向为推荐算法和数据挖掘。
基金资助:
国家自然科学基金(61772160);哈尔滨市科技创新人才研究专项资金(青年后备人才A类，2017RAQXJ045)

A collaborative filtering recommendation algorithm based on tag extension

CHEN Hai-long,YAN Wu-yue,SUN Hai-jiao,CHENG Miao#br#

#br#

（College of Computer Science and Technology,Harbin University of Science and Technology,Harbin 150080,China）

Received:2020-05-07 Revised:2020-08-24 Accepted:2021-10-25 Online:2021-10-25 Published:2021-10-22
About author:CHEN Hai-long ,born in 1975,PhD，professor,CCF member(A1589M),his research interests include recommendation algorithm，and data mining.

摘要/Abstract

摘要： 大多数利用标签与用户和项目之间关系的推荐算法，都要面临用户个体不同所导致的标签稀疏问题，不同的用户为项目所标注的标签会有所不同。针对由于用户标注标签的随意性而导致的用户标签和项目标签矩阵稀疏问题，提出了一种标签扩展的协同过滤推荐算法。该算法根据用户标注标签的行为计算基于标签的标签相似度，根据用户标注的标签语义计算基于标签语义的标签相似度，从用户行为和标签语义2个方面评估标签的相似度，并利用标签相似度来扩展每个项目标签，降低由项目与标签的关联关系产生的矩阵稀疏度。在MovieLens数据集上的实验结果表明，所提算法在精度上有所提高。

关键词: 协同过滤, 标签稀疏, 标签语义, 标签扩展

Abstract: Most recommendation algorithms that use the relationship between tags and users and items have to face the problem of sparse tags caused by different individual users. Different users will have different tags for the items. Aiming at the problem of sparse user-tag and item-tag matrix due to the randomness of user labeling, a collaborative filtering recommendation algorithm based on tag extension is proposed. The label similarity based on the label is calculated according to the user's labeling behavior, and the label similarity based on the label semantics is calculated according to the semantics of the label marked by the user. The similarity of tags is evaluated in terms of user behavior and label semantics, and the tag similarity is used to expand each item-tag to reduce the sparseness of the matrix generated by the association relationship between items and tags. Experimental results show that running the algorithm on the dataset MovieLens improves the accuracy.

Key words: collaborative filtering, tag sparse, tag semantics, tag extension

陈海龙, 闫五岳, 孙海娇, 程苗. 标签扩展的协同过滤推荐算法[J]. 计算机工程与科学, 2021, 43(10): 1826-1832.

CHEN Hai-long, YAN Wu-yue, SUN Hai-jiao, CHENG Miao. A collaborative filtering recommendation algorithm based on tag extension[J]. Computer Engineering & Science, 2021, 43(10): 1826-1832.

[1]	李清风, 金柳, 马慧芳, 张若一. 双视图对比学习引导的多行为推荐方法[J]. 计算机工程与科学, 2024, 46(04): 707-715.
[2]	蔡雨岐, 郭卫斌. 基于多级语义信息融合编码的序列标注方法[J]. 计算机工程与科学, 2022, 44(12): 2266-2272.
[3]	阎红灿, 王子茹, 李伟芳, 谷建涛. 伴随时间的模糊聚类协同过滤推荐算法[J]. 计算机工程与科学, 2021, 43(11): 2084-2090.
[4]	张瑞典，钱晓东. 用余弦相似度修正评分的协同过滤推荐算法[J]. 计算机工程与科学, 2020, 42(06): 1096-1105.
[5]	袁泉1,2,3,成振华1,2,江洋1,2. 基于知识图谱和协同过滤的电影推荐算法研究[J]. 计算机工程与科学, 2020, 42(04): 714-721.
[6]	宋月亭, 吴晟. 基于相似度优化和流形学习的协同过滤算法改进研究[J]. 计算机工程与科学, 2020, 42(02): 351-357.
[7]	刘辉, ., 曾斌, 刘子恺. 融合邻居选择策略和信任关系的兴趣点推荐[J]. 计算机工程与科学, 2020, 42(02): 365-372.
[8]	刘辉1,2,3，万程峰1,2，吴晓浩1,2. 基于增量协同过滤和潜在语义分析的混合推荐算法[J]. 计算机工程与科学, 2019, 41(11): 2033-2039.
[9]	黄乐乐1，马慧芳1,2，李宁3，余丽1. 基于二分图划分联合聚类的协同过滤推荐算法[J]. 计算机工程与科学, 2019, 41(11): 2040-2047.
[10]	刘国丽，白晓霞，廉孟杰，张斌. 基于专家信任的协同过滤推荐算法改进研究[J]. 计算机工程与科学, 2019, 41(10): 1846-1853.
[11]	吴浩1，王晓晨1，曾诚1,2，何鹏1,2. 基于异质用户网络嵌入的服务推荐方法研究[J]. 计算机工程与科学, 2019, 41(07): 1244-1250.
[12]	李艳娟，牛梦婷，李林辉. 基于蜂群K-means聚类模型的协同过滤推荐算法[J]. 计算机工程与科学, 2019, 41(06): 1101-1109.
[13]	王艳茹1,马慧芳1,2,刘海姣1,魏家辉1. 基于多标签语义关联关系的微博用户兴趣建模方法[J]. 计算机工程与科学, 2018, 40(11): 2067-2073.
[14]	王磊，瞿佳明. 基于协同过滤和Slope One算法的Web服务可靠性预测[J]. 计算机工程与科学, 2018, 40(08): 1390-1397.
[15]	刘井平，李平. 一种模糊认知的协同过滤算法[J]. 计算机工程与科学, 2018, 40(05): 898-905.