• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2008, Vol. 30 ›› Issue (2): 64-66.

• 论文 • Previous Articles     Next Articles

  

  • Online:2008-02-01 Published:2010-05-19

Abstract:

In order to get the specific field term dictionary from large-scale unlabelled texts,we usually use manual methods to filter terms after getting the terms from the machine of term-extraction. But this needs more manpower and material resources. This paper proposes a new way to automatically filter the specific terms from term texts based on the CBC(cluster by committee) clustering method. Meanwhile, it can recognize new field terms by enlarging the field corpus. Finally it evaluates the results of this experiment, and shows the better effect of the method in filtering terms.

Key words: CBC(cluster by committee), term filtering, corpus, term extracting