北京大学学报(自然科学版)

说话人识别的参量研究和语音库建设

吴淑珍,吴阿华   

  1. 北京大学无线电系,北京,100871
  • 收稿日期:1994-09-23 出版日期:1995-05-20 发布日期:1995-05-20

A Study of Parameters on Speaker Recognition and Creation of Speech Database

WU Suzhen, WU Ahua   

  1. Department of Radio Electronics, Peking University, Beijing, 100871
  • Received:1994-09-23 Online:1995-05-20 Published:1995-05-20

摘要: 本文对说话人识别中的几个基本问题进行了研究。语音参量是说话人识别的基础,用矢量量化方法,使用自建的语音库中的材料,研究了说话人识别中的各种参量的效果。实验表明,所采用的参量中,一种混合参量MC最好,倒谱系数CE次之。

关键词: 说话人识别, 语音参量, 矢量量化, 倒谱系数, 线性预测编码

Abstract: Describes briefly a study of a few fundamental problemson Speaker Recognition. Speech parameters are the base and a speech database is needful for speaker recognition. Our study used VQ technic and materiel of speech database which created by us. It is shown by experiments that in some introduced parameters a mixed parameter is the best, secondly is cepstral coefficient.

Key words: speaker recognition, speech parameter, VQ, cepstral coefficient, linear predictive coding

中图分类号: