[1]Salton G, Wong A, Yang C S. A vector space model for automatic indexing[J]. Communications of the ACM, 1975, 18(11):613620.
[2]Zhang Xiaodan, Zhou Xiaohua, Hu Xiaohua. Semantic smoothing for modelbased document clustering[C]∥Proc of the 6th International Conference on Data Mining, 2006:11931198.
[3]Zhou Xiaohua, Zhang Xiaodan, Hu Xiaohua. Semantic smoothing of document models for agglomerative clustering[C]∥Proc of the 20th International Joint Conference on Artifical Intelligence, 2007:29222927.
[4]Liu Hua. Research of text classification based on key phrases[J]. Journal of Chinese Information Processing, 2007,21(4):3441.(in Chinese)
[5]Shi Qingwei, Zhao Zheng, Chao Ke. Hierarchical clustering of Chinese web pages based on suffix tree[J]. Joumal of Liaoning Technical University, 2006, 25(6):890892.(in Chinese)
[6]Du Hongbin, Xia Kewen, Liu Nanping. An improved text clustering algorithm of generalized suffix tree[J]. Information and Control, 2009, 38(3):331336. (in Chinese)
[7]Wang Junze,Mo Yijun,Huang Benxiong,et al. Web search results clustering based on a novel suffix tree structure[J]. Autonomic and Trusted Computing, 2008, 5060(23):540554.
[8]Zhao Jun, Jin Qianli, Xu Bo. Semantic computation for text retrieval[J]. Chinese Journal of Computers, 2005, 28(12):20682078. (in Chinese)
[9]Jing Liping, Zhou Lixin, Ng Michael K, et al. Ontologybased distance measure for text clustering[C]∥Proc of the Text Mining Workshop, SIAM International Conference on Data Mining, 2006:1.
[10]Xie Hongwei, Yan Xiaolin, Yu Xueli. Research on web page clustering based on ontology[J]. Computer Science, 2008, 35(9):153155. (in Chinese)
[11]Zhu Huifeng, Zuo Wanli, He Fengling. A novel text clustering method based on ontology[J]. Journal of Jilin University(Science Edtion), 2010, 48(2):277283. (in Chinese)
[12]Ponte J M, Bruce C W. A language modeling approach to information retrieval[C]∥Proc of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1998:275281.
[13]Chang Peng, Feng Nan. A cooccurrence based vector space model for document indexing[J]. Journal of Chinese Information Processing, 2012, 26(1):5157.(in Chinese)
[14]Cao Tian, Zhou Li, Zhang Guoxuan. Text similarity computing based on word cooccurrence[J]. Computer Engineering & Science, 2008, 29(3):5253.(in Chinese)
[15]Wu Guangyuan, He Pilian, Cao Guihong. Vector space model based on word cooccurrence and its application in text classification[J]. Computer Applications, 2003, 23(23):138140.(in Chinese)
附中文参考文献:
[4]刘华. 基于关键短语的文本分类研究[J]. 中文信息学报, 2007, 21(4):3441.
[5]史庆伟,赵政,朝柯. 一种基于后缀树的中文网页层次聚类方法[J]. 辽宁工程技术大学学报, 2006, 25(6):890892.
[6]杜红斌,夏克文,刘南平. 一种改进的基于广义后缀树的文本聚类算法[J]. 信息与控制, 2009, 38(3):331336.
[8]赵军,金千里,徐波. 面向文本检索的语义计算[J]. 计算机学报, 2005, 28(12):20682078.
[10]谢红薇,颜小林,余雪丽. 基于本体的WEB页面聚类研究[J]. 计算机科学, 2008, 35(9):153155.
[11]朱会峰,左万利,赫枫龄. 一种基于本体的文本聚类方法[J]. 吉林大学学报(自然科学版), 2010, 48(2):277283.
[13]常鹏,冯楠. 基于词共现的文档表示模型[J]. 中文信息学报, 2012, 26(1):5157.
[14]曹恬,周丽,张国煊. 一种基于词共现的文本相似度计算[J]. 计算机工程与科学, 2008, 29(3):5253.
[15]吴光远,何丕廉,曹桂宏. 基于向量空间模型的词共现研究及其在文本分类中的引用[J]. 计算机应用, 2003, 23(23):138140. |