J4 ›› 2008, Vol. 30 ›› Issue (2): 64-66.
• 论文 • Previous Articles Next Articles
Online:
Published:
Abstract:
In order to get the specific field term dictionary from large-scale unlabelled texts,we usually use manual methods to filter terms after getting the terms from the machine of term-extraction. But this needs more manpower and material resources. This paper proposes a new way to automatically filter the specific terms from term texts based on the CBC(cluster by committee) clustering method. Meanwhile, it can recognize new field terms by enlarging the field corpus. Finally it evaluates the results of this experiment, and shows the better effect of the method in filtering terms.
Key words: CBC(cluster by committee), term filtering, corpus, term extracting
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2008/V30/I2/64