• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2007, Vol. 29 ›› Issue (10): 17-19.

• 论文 • 上一篇    下一篇

一种基于人工免疫网络的文本聚类算法

童健华 谭洪舟   

  • 出版日期:2007-10-01 发布日期:2010-06-02

  • Online:2007-10-01 Published:2010-06-02

摘要:

本文构造了一种能准确描述文本之间相似性(亲和力)的新方法,并在此基础上提出了一种基于人工免疫网络的文本聚类算法。仿真结果表明,与传统的文本聚类算法相比,新算法不仅能自动发现新类,而且具有聚类精度更高、数据压缩比更大、与输入初始配置无关、可增量处理的优势。

关键词: 亲和力计算 人工免疫网络 文本聚类

Abstract:

A new method which can accurately compute the affinity between documents is presented in this paper. By using the method a document clustering algorithm based on artificial immune networks is proposed. Simulation results show that the new algorithm can not only locate new clusters automatically, but h as the advantage of being independent of the input initialization and incremental clustering ability as well, where it has better clustering quality and  higher data compression rate than some current document clustering algorithms.

Key words: affinity computing, artificial immune network, document clustering