• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2020, Vol. 42 ›› Issue (11): 2073-2079.

• 人工智能与数据挖掘 • 上一篇    下一篇

数字化藏文古籍中多样性字体的实现方法研究

朱倩倩,车文刚,苗晗   

  1. (昆明理工大学信息工程与自动化学院, 云南 昆明 650500)
  • 收稿日期:2019-08-14 修回日期:2020-03-06 接受日期:2020-11-25 出版日期:2020-11-25 发布日期:2020-11-30

An implementation method of diversified  fonts in digital Tibetan ancient books#br#
#br#

ZHU Qianqian,CHE Wengang,MIAO Han#br#

#br#
  

  1. (Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming 650500,China)


  • Received:2019-08-14 Revised:2020-03-06 Accepted:2020-11-25 Online:2020-11-25 Published:2020-11-30

摘要: 计算机成为数据共享和信息交流的工具之后,统一的计算机字体使得文字失去了手写字的多样性与离散性。文字是文化传播和文明传承的关键因素,许多古籍电子化以后失去了原版古籍中具有文化背景和历史意义的特色字体。例如堪称藏族文化一绝的具有多样性和离散性的雕刻字体。为了解决这个问题,提出了将藏文古籍中雕刻字体数字化的方法。结合投影法与连通域法切分古籍图像;通过GIST特征算法实现图像文字的识别;采用SIFT特征算法实现图像字体风格分类,获取古籍中不同风格的雕刻字体;提出字体多样性表达算法实现古籍中雕刻字体的多样性和离散性。研究的目的是传承和保护雕刻字体,具有重要的文化研究和传承意义。


关键词: 藏文古籍, 图像分割, 藏文识别, 字体分类, 字体多样性表达算法

Abstract: After the computer became a tool for data sharing and information exchange, the unified computer font has made the text lose the diversity and discreteness of handwriting. Text is the crucial factor for the spread of culture and civilization. Many electronic books have lost the characteristic fonts with cultural background and historical significance in the original ancient books after the digitalization. One example is the sculpted typeface with diversity and discreteness that can be called a Tibetan culture. In order to solve this problem, a research method of digitizing engraving fonts in ancient Tibetan books is proposed. Firstly, the projection method and the connected domain method are used to segment the ancient book image. Secondly, the GIST feature algorithm is used to realize the image text recognition. Thirdly, the SIFT feature algorithm is used to implement the image font style classification, and diffe rent styles of carved fonts in the ancient books are obtained. A font diversity expression algorithm is proposed to realize the diversity and discreteness of carved fonts in ancient books. The purpose of the research is to achieve the inheritance and protection of engraving fonts, which has important cultural research and inheritance significance.


Key words: Tibetan ancient book, image segmentation, Tibetan recognition, font classification, font diversity expression algorithm