Wang Zuo-ying, Sun Jian. The Inhomogeneous HMM with General Topological Structure and Its Application in Language Identification between Mandarin and English[J]. Journal of Electronics & Information Technology, 2007, 29(4): 867-869. doi: 10.3724/SP.J.1146.2005.01128
Citation:
Wang Zuo-ying, Sun Jian. The Inhomogeneous HMM with General Topological Structure and Its Application in Language Identification between Mandarin and English[J]. Journal of Electronics & Information Technology, 2007, 29(4): 867-869. doi: 10.3724/SP.J.1146.2005.01128
Wang Zuo-ying, Sun Jian. The Inhomogeneous HMM with General Topological Structure and Its Application in Language Identification between Mandarin and English[J]. Journal of Electronics & Information Technology, 2007, 29(4): 867-869. doi: 10.3724/SP.J.1146.2005.01128
Citation:
Wang Zuo-ying, Sun Jian. The Inhomogeneous HMM with General Topological Structure and Its Application in Language Identification between Mandarin and English[J]. Journal of Electronics & Information Technology, 2007, 29(4): 867-869. doi: 10.3724/SP.J.1146.2005.01128
In order to use duration information in Language IDentification (LID) efficiently, the inhomogeneous Hidden Markov Model (HMM) with general topological structure is proposed, and is used to identify the language between Mandarin and English also. Because the inhomogeneous HMM with general topologic structure not only describes the duration of state more accurately than HMM, but also uses the structure information of specific language phonetics more effectively, the LID system based on the inhomogeneous HMM with general topological structure has better performance than the homogeneous HMM. For the LID system based on inhomogeneous HMM with different duration distribution, the norm distribution has better performance than the uniform distribution, it shows that the state duration is an important cue for language identification and the norm distribution can model the duration more accurately than the uniform distribution.
[1] Zissman M A and Berkling K M. Automatic language identification[J].Speech Communication.2001, 35(1-2):115- [2] Zissman M A. Automatic language identification using Gauss mixture and hidden Markov models, In: 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-93, Minneapolis, Minnesota, USA, 1993, 2: 399-402. [3] House A S and Neuburg E P. Toward automatic identification of the language of an utterance. I. Preliminary methodological considerations. J. Acoust. Soc. Amer, 1977, 62(3): 708-713. [4] 王作英,肖熙. 基于段长分布的HMM语音识别模型. 电子学报, 2004, 32(1): 46-50. Wang Zuo-ying and Xiao Xi. Duration distribution based HMM speech recognition models. Acta Electronica Sinica, 2004, 32(1): 46-50. [5] Wang Z Y and Gao H G. An inhomogeneous HMM speech recognition algorithm. Chinese Journal of Electronics, 1998, 7(1): 73-77.