J4 ›› 2014, Vol. 36 ›› Issue (4): 765-771.
• 论文 • Previous Articles Next Articles
QIAN Xuezhong,WU Zhiyuan
Received:
Revised:
Online:
Published:
Abstract:
To mine user’s interests accurately,probabilistic latent semantic analysis (PLSA) model is firstly used to project webpage-word matrix vector into probabilistic latent semantic vector space. A method of “auto-selected similarity threshold” is proposed to get web pages similarity threshold. At last, combined with divisiory algorithms and hierarchical agglomerative clustering,a hierarchical agglomerative kmedoids clustering algorithm is proposed to realize cluster user’s interests. The experimental results show that, compared with the traditional divisiory algorithms, the hierarchical agglomerative kmedoids algorithm has a better clustering effect. Furthermore, user’s interest clustering technique can improve the efficiency of personalized recommendation and search in user’ personalized service fields.
Key words: probabilistic latent semantic analysis;autoselected similarity threshold;user’s interest points;hierarchical agglomerative kmedoids;personalized service
QIAN Xuezhong,WU Zhiyuan. User’s interest clustering based on webpage probabilistic latent semantic information [J]. J4, 2014, 36(4): 765-771.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2014/V36/I4/765