一般拓扑结构的非齐次隐含马尔科夫模型及其在中、英文语种辨识中的应用

王作英; 孙健

doi:10.3724/SP.J.1146.2005.01128

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名

邮箱

手机号码

标题

留言内容

验证码

一般拓扑结构的非齐次隐含马尔科夫模型及其在中、英文语种辨识中的应用

doi: 10.3724/SP.J.1146.2005.01128

王作英,
孙健

计量
- 文章访问数: 3177
- HTML全文浏览量: 94
- PDF下载量: 1146
- 被引次数: 0
出版历程
- 收稿日期: 2005-09-09
- 修回日期: 2006-01-06
- 刊出日期: 2007-04-19

The Inhomogeneous HMM with General Topological Structure and Its Application in Language Identification between Mandarin and English

摘要

摘要: 为了充分利用语音信号中的段长信息，该文提出了一种具有一般拓扑结构的非齐次隐含Markov模型(Hidden Markov Model, HMM)，并将其应用于中、英文语种辨识(Language IDentification, LID)系统。非齐次HMM既很好地描述了语音信号的发生过程，又准确地利用了状态的段长信息和语言中的上下文连接结构信息，对于中、英文语种辨识系统，非齐次的HMM系统辨识性能好于齐次的HMM模型。而在非齐次的HMM中，同段长为均匀分布相比，段长分布为正态分布时系统的辨识性能更好，表明段长确实是一种重要的语种区分信息之一，且正态分布较均匀分布更接近于真实的段长分布。
- 语种辨识;非齐次隐含Markov模型;段长分布
Abstract: In order to use duration information in Language IDentification (LID) efficiently, the inhomogeneous Hidden Markov Model (HMM) with general topological structure is proposed, and is used to identify the language between Mandarin and English also. Because the inhomogeneous HMM with general topologic structure not only describes the duration of state more accurately than HMM, but also uses the structure information of specific language phonetics more effectively, the LID system based on the inhomogeneous HMM with general topological structure has better performance than the homogeneous HMM. For the LID system based on inhomogeneous HMM with different duration distribution, the norm distribution has better performance than the uniform distribution, it shows that the state duration is an important cue for language identification and the norm distribution can model the duration more accurately than the uniform distribution.

HTML全文

参考文献(1)

[1] Zissman M A and Berkling K M. Automatic language identification[J].Speech Communication.2001, 35(1-2):115- [2] Zissman M A. Automatic language identification using Gauss mixture and hidden Markov models, In: 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-93, Minneapolis, Minnesota, USA, 1993, 2: 399-402. [3] House A S and Neuburg E P. Toward automatic identification of the language of an utterance. I. Preliminary methodological considerations. J. Acoust. Soc. Amer, 1977, 62(3): 708-713. [4] 王作英，肖熙. 基于段长分布的HMM语音识别模型. 电子学报, 2004, 32(1): 46-50. Wang Zuo-ying and Xiao Xi. Duration distribution based HMM speech recognition models. Acta Electronica Sinica, 2004, 32(1): 46-50. [5] Wang Z Y and Gao H G. An inhomogeneous HMM speech recognition algorithm. Chinese Journal of Electronics, 1998, 7(1): 73-77.

施引文献

资源附件(0)

访问统计

计量

文章访问数: 3177
HTML全文浏览量: 94
PDF下载量: 1146
被引次数: 0

留言板

一般拓扑结构的非齐次隐含马尔科夫模型及其在中、英文语种辨识中的应用

doi: 10.3724/SP.J.1146.2005.01128

计量

出版历程

The Inhomogeneous HMM with General Topological Structure and Its Application in Language Identification between Mandarin and English

计量

出版历程

目录