• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2013, Vol. 35 ›› Issue (7): 156-163.

• 论文 • Previous Articles     Next Articles

A novel clustering algorithm for uncertain tree          

YAN Yiming,GUO Xin   

  1. (School of Software Service Outsourcing,Jishou University,Zhangjiajie 427000,China)
  • Received:2012-08-10 Revised:2012-11-02 Online:2013-07-25 Published:2013-07-25

Abstract:

Uncertain tree clustering is an important problem in data mining domain. In this paper, a new uncertain tree clustering algorithm is proposed. The algorithm effectively resolves the clustering problems for uncertain data. In order to improve accurate measurement on the similarities among trees, the method of semantic similarity and structural similarity are presented. A dynamic clustering process is designed in which selfadaptive threshold be applied so as to greatly reduce the jamming impact on the result accuracy. This process can cluster subtrees of similar structure within similar groups , minimizing the similarity of subtree groups. Both simulation and real experiments show that the algorithm is effective and efficient and the clustering result is accurate.

Key words: data mining;ordered tree;frequent subtree;similarity;uncertain tree;clustering