Acta Scientiarum Naturalium Universitatis Pekinensis ›› 2018, Vol. 54 ›› Issue (2): 243-248.DOI: 10.13209/j.0479-8023.2017.152

Previous Articles     Next Articles

A Study of Articulatory Features Based Detection of Mandrain Pronunciation Erroneous Tendency for Automatic Annotation

WEI Xing, WANG Wei, CHEN Jingping, XIE Yanlu, ZHANG Jinsong   

  1. Advanced Innovation Center for Language Resource and Intelligence Research Funds of State Language Commission, School of Information Science, Beijing Language and Culture University, Beijing 100083
  • Received:2017-06-05 Revised:2017-09-05 Online:2018-03-20 Published:2018-03-20
  • Contact: XIE Yanlu, E-mail: xieyanlu(at)blcu.edu.cn

基于发音特征的汉语发音偏误自动标注

魏星, 王玮, 陈静萍, 解焱陆, 张劲松   

  1. 语言资源高精尖创新中心, 北京语言大学信息科学学院, 北京 100083
  • 通讯作者: 解焱陆, E-mail: xieyanlu(at)blcu.edu.cn
  • 基金资助:
    国家语言文字工作委员会科研项目(ZDI135-51)、北京语言大学梧桐创新平台项目(16PT05)、北京语言大学研究生创新基金项目(17YCX140)和语言资源高精尖创新中心项目(451122500)资助

Abstract:

For the purpose of relieving the time cost and inconformity in annotation, the authors use an articulatory features based mispronunciation detection system to give an Top-N feedback and use this feedback to assist manual annotation. As a result, the consistency rate of phoneme labels in proposed system increases from 80.7% to 92.48%. In addition, the time cost for annotating each sentence reduce from 10 to 3 minutes. The results indicate that proposed automatic annotation system is practical, and there is also a room for further improvement.

Key words: articulatory features (AFs), pronunciation erroneous tendency (PET), automatic annotation

摘要:

针对发音偏误检测系统语音标注费时、费力和标注不一致的问题, 基于发音特征, 构建偏误检测系统, 给出Top-N的识别结果, 通过praat软件呈现机器初步标注文本, 在此基础上进行人工二次标注。实验结果表明, 与单纯的人工标注相比, 所提出的自动标注加人工二次标注方法在标注一致性上从80.7%提高到92.48%, 平均每个句子的标注时间从10分钟减少到3分钟。所提方法有效地提高了人工标注的效率, 可以在有限时间内为识别系统提供更多可靠的标注语料。

关键词: 发音特征, 发音偏误趋势, 自动标注

CLC Number: