北京大学学报(自然科学版)

基于语义构件的甲骨文字库自动生成技术研究

吴琴霞1,2,栗青生1,2,高峰1,2   

  1. 1. 安阳师范学院计算机与信息工程学院, 安阳 455002; 2. 甲骨文数字化工程技术研究中心, 安阳 455002;
  • 收稿日期:2013-06-17 出版日期:2014-01-20 发布日期:2014-01-20

Study on the Technique of Automatic Generation of Oracle Characters Based on Semantic Component

WU Qinxia1,2, LI Qingsheng1,2, GAO Feng1,2   

  1. 1. School of Computer and Information Engineering, Anyang Normal University, Anyang 455002; 2. Institute of Digital Inscriptions on Bones/Tortoise Shells, Anyang 455002;
  • Received:2013-06-17 Online:2014-01-20 Published:2014-01-20

摘要: 针对甲骨文字形多变、异体字多等特点, 通过对甲骨字的构成分析, 提出一种基于语义构件的甲骨文字库自动生成方法。该方法以动态描述库为基础, 通过算法提取甲骨字的构件特征信息, 重组笔元生成语义构件, 再给语义构件加上特征描述生成构件知识库。通过仿射变换重复使用语义构件自动生成任意甲骨字。实验表明, 该方法能有效解决甲骨文无字库输入的实现, 还可以解决甲骨字编码、构件统计、未释字的考释等。

关键词: 甲骨文, 构件, 字形描述, 有向笔段, 仿射变换

Abstract: According to the features of Oracle characters, such as changing shape and variant font froms, an automatic generation methode of Oracle Bone script font library based on semantic component is proposed by analyzing the structure of Oracle Bone characters. The method is based on the dynamic description library. Firstly an algorithm is used to get the feature information of the component Oracle characters; secondly the semantic component is made by reorganization strokes; finally the component library base is generated by feature descriptions for component. The semantic component library is designed with affine transform, from which each Oracle character could be generated. Experiments show that this method can effectively solve the problem of Oracle character input without word stock, and can also solve the Oracle character encoding, component statistics, and interpretation of the incognizant Oracle character, etc.

Key words: Oracle character, component, glyph description, stroke-segments-vector, affine transform

中图分类号: