Acta Scientiarum Naturalium Universitatis Pekinensis ›› 2017, Vol. 53 ›› Issue (2): 239-246.DOI: 10.13209/j.0479-8023.2017.029

• Orginal Article • Previous Articles     Next Articles

A Study of Articulatory Features Based Detection of Pronunciation Erroneous Tendency

Leyuan QU, Yanlu XIE(), Jinsong ZHANG   

  1. School of Information Science, Beijing Language and Culture University, Beijing 100083
  • Received:2016-07-29 Revised:2016-10-01 Online:2016-11-30 Published:2017-03-20
  • Contact: Yanlu XIE


屈乐园, 解焱陆(), 张劲松   

  1. 北京语言大学信息科学学院, 北京 100083
  • 通讯作者: 解焱陆
  • 基金资助:


This paper proposed to apply senone log-likelihood ratio based articulatory features (AFs) to improve pronunciation erroneous tendency (PET) detection performance. The feedback information of articulation-placement and articulation-manner could be derived from the definition of PET. The framework of the method involved two main steps. 1) A bank of attribute extractors based on neural networks were trained to estimate the log-likelihood ratio (LLR) for each senone at a frame level. 2) AFs composed of those LLRs outputted from each attribute extractor were used for detecting PETs. Results demonstrated that the proposed system had better performance than the baseline system using MFCC. Moreover, substantial improvements were obtained by combining AFs with MFCC, achieving a lower false rejection rate of 5.0%, a lower false acceptance rate of 30.8% and a higher diagnostic accuracy of 89.8%.

Key words: articulatory features (AFs), pronunciation erroneous tendency (PET), computer assisted pronuncia-tion training (CAPT), senone log-likelihood ratios


为了提升计算机辅助发音训练(CAPT)系统中发音偏误趋势(PET)的检测效果, 确保反馈信息的准确性与有效性, 提出一种基于对数似然比的发音特征方法。该方法将多个基于深度神经网络的发音特征提取器用于生成帧级别的对数似然比, 然后将对数似然比组成的发音特征用于PET的检测, 为学习者提供发音位置和发音方法的正音信息。实验结果表明, 发音特征对PET的检测效果优于常用声学特征(MFCC, PLP和fBank), 当发音特征与MFCC特征相结合时, 可以进一步提升性能, 达到错误接受率为5.0%, 错误拒绝率为30.8%, 诊断正确率为89.8%的检测效果。

关键词: 发音特征, 发音偏误趋势, 计算机辅助发音训练, 对数似然比

CLC Number: