• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2010, Vol. 32 ›› Issue (8): 78-80.doi: 10.3969/j.issn.1007130X.2010.

• 论文 • 上一篇    下一篇

一种采用小波包分析及RBFN的民族文种识别方法

郭〓海,赵晶莹,韦宗伟   

  1. (大连民族学院计算机科学与工程学院,辽宁 大连 116600)
  • 收稿日期:2009-08-25 修回日期:2009-12-03 出版日期:2010-07-25 发布日期:2010-07-28
  • 作者简介:郭海(1979),男,黑龙江哈尔滨人,硕士,讲师,研究方向为少数民族信息处理和模式识别; 赵晶莹,硕士,讲师,研究方向为少数民族信息处理和模式识别;韦宗伟,研究方向为少数民族信息处理和模式识别。
  • 基金资助:

    国家自然科学基金资助项目(60803096);国家民委项目(07DL07)

A Method of Chinese Minority Script Identification Using Wavelet Packet Decomposition and RBFN

GUO Hai,ZHAO Jingying,WEI Zongwei   

  1. (School of Computer Science and  Engineering,Dalian Nationalities University,Dalian 116600,China)
  • Received:2009-08-25 Revised:2009-12-03 Online:2010-07-25 Published:2010-07-28

摘要:

随着我国计算机技术的发展,少数民族信息处理已经逐渐成熟起来,少数民族文字识别研究已经成为一个热点。本文提出一种基于小波包特征与径向基网络的少数民族文字种类识别方法,该方法采用小波包能量和小波包能量比例分布的特征描述,利用径向基函数神经网络对少数民族文种进行分类识别。通过构建六种常用的少数民族文字及汉字、英文共八种文字的样本库,采用本文的方法对样本库进行了训练和测试。实验结果显示,本文的方法对于少数民族文种识别的平均精度好于小波特征及传统的分类方法。

关键词: 少数民族文字, 文种识别, 小波分析, 径向基网络

Abstract:

With the fast development of computer technology, Chinese minority information processing has become mature gradually, and the research on Chinese minority information processing focuses on the Chinese minority optical character recognition. The method of identification of the kinds of Chinese minority scripts based on wavelet packet analysis and Radial Basis Function Network (RBFN) is presented which adopts the feature of wavelet packet energy and wavelet packet energy distribution proportion, and constructs multivariate classification in the radial basis function Network. By building a data set which contains 6 kinds of common Chinese minority scripts, 8 kinds of Chinese and English in total, we train and test the dataset by means of the method in this paper. Obviously, the result shows that the method outperforms the traditional classification and wavelet feature.

Key words: Chinese minority script;script identification;wavelet packet analysis;radial basis function network (RBFN)