维吾尔语大词汇语音识别系统识别单元研究

北京大学学报（自然科学版）

维吾尔语大词汇语音识别系统识别单元研究

努尔麦麦提.尤鲁瓦斯,吾守尔.斯拉木,热依曼.吐尔逊

新疆大学信息科学与工程学院, 乌鲁木齐 830046;

收稿日期:2013-06-14 出版日期:2014-01-20 发布日期:2014-01-20

Research on Recognition Units of Large Vocabulary Speech Recognition System of Uyghur

Nurmemet Yolwas, Wushour Silamu, Reyiman Tursun

College of Information Science and Engineering, Xinjiang University, Urumqi 830046;

Received:2013-06-14 Online:2014-01-20 Published:2014-01-20

摘要/Abstract

摘要： 维吾尔语是一种黏着语, 单词不太适合作为维吾尔语大词汇连续语音识别系统识别单元。针对维吾尔语大词汇连续语音识别系统中的识别单元选择问题, 设计更适合维吾尔语的子词识别单元, 提出维吾尔语单词和子词相结合的组合识别单元构建方法, 并对单词、子词和组合识别单元的语言模型和语音识别性能进行评价。实验结果表明, 所提出的识别单元在单元数量、语言模型复杂度等方面表现出更加优越的性能, 并且使识别系统的单词错误率比基于单词的系统相对减少22%。

关键词: 维吾尔语, 大词汇, 语音识别, 识别单元

Abstract: Uyghur is an agglutinative language and words are not optimal recognition units for Uyghur LVCSR systems. With regard to recognition unit selection problem in Uyghur LVCSR systems, a more suitable recognition units for Uyghur likes sub-word is designed, and the combining recognition units of word and sub-word are proposed. The performance of language models and speech recognition are evaluated on different recognition units. Experiment results show that the proposed recognition units outperforms word units in terms of unit size, language model perplexity, and can give a relative word error rate reduction of 22% over the word based system.

Key words: Uyghur, LVCSR, speech recognition, recognition unit

中图分类号:

TP391

努尔麦麦提.尤鲁瓦斯,吾守尔.斯拉木,热依曼.吐尔逊. 维吾尔语大词汇语音识别系统识别单元研究[J]. 北京大学学报（自然科学版）.

Nurmemet Yolwas,Wushour Silamu,Reyiman Tursun. Research on Recognition Units of Large Vocabulary Speech Recognition System of Uyghur[J]. Acta Scientiarum Naturalium Universitatis Pekinensis.

导出引用管理器 EndNote|Ris|BibTeX

链接本文: https://xbna.pku.edu.cn/CN/

https://xbna.pku.edu.cn/CN/Y2014/V50/I1/149

[1]	张新路, 李晓, 杨雅婷, 王磊, 董瑞. 面向维汉神经机器翻译的双向重排序模型分析[J]. 北京大学学报自然科学版, 2020, 56(1): 31-38.
[2]	吐尔洪·吾司曼, 杨雅婷, 艾孜孜·吐尔逊, 程力. 字符级的维吾尔语形态协同分析方法[J]. 北京大学学报自然科学版, 2019, 55(1): 47-54.
[3]	周楠, 赵悦, 李要嫱, 徐晓娜, 才旺拉姆, 吴立成. 基于瓶颈特征的藏语拉萨话连续语音识别研究[J]. 北京大学学报（自然科学版）, 2018, 54(2): 249-254.
[4]	甄斌,吴玺宏,刘志敏,迟惠生. 语音识别和说话人识别中各倒谱分量的相对重要性[J]. 北京大学学报（自然科学版）, 2001, 37(3): 371-378.