Acta Scientiarum Naturalium Universitatis Pekinensis
Previous Articles Next Articles
WU Shuzhen1, CHENG Qiansheng2
Received:
吴淑珍1, 程乾生2
Abstract: A speech recognition method is described, that is based on a combination of finite-state vector quantization(FSVQ) and dynamic spectral features. FSVQ is a recallable vector quantization system, which also uses past information for optimizing the code book, and is more effective for speech recognition. The characteristics of a speech signal are represented by time sequences of LPC cepstral coefficients, the dynamic spectral features and log-energy. According to pronunciation feature of Mandarin, the distance values were weighted for the parts of word termination. The experimental results show that the depended speaker speech recognition rate is 98%.
Key words: finite-state vector quantization, LPC cepstral coefficients, dynamic spectral feature, dynamic time warpping, state transition function
摘要: 结合动态谱特性的语音识别研究,阐述了一种有限状态矢量量化(FSVQ)方法。FSVQ利用了过去的信息来选择合适的码本进行编码,对于语音识别更为有效。改进了所使用的语音特征参量,除了LPC倒谱系数外,结合使用了动态谱特征和能量的对数值,并根据汉语发音特征对语音信号端点进行一种加权处理。实验结果表明:与说话人有关的孤立词识别率达到98%。
关键词: 有限状态矢量量化, LPC倒谱系数, 动态谱特性, 动态规整, 状态转移函数
CLC Number:
TN912
WU Shuzhen,CHENG Qiansheng. A Study on Speech Recognition for Isolate Words[J]. Acta Scientiarum Naturalium Universitatis Pekinensis.
吴淑珍, 程乾生. 一种孤立词语音识别方法研究[J]. 北京大学学报(自然科学版).
Add to citation manager EndNote|Ris|BibTeX
URL: https://xbna.pku.edu.cn/EN/
https://xbna.pku.edu.cn/EN/Y2001/V37/I1/67