Huang Wei, Dai Bei-qian, Li Hui. Speaker Identification Based on Classify Feature Sub-space Gaussian Mixture Model and Neural Net Fusion[J]. Journal of Electronics & Information Technology, 2004, 26(10): 1607-1612.
Citation:
Huang Wei, Dai Bei-qian, Li Hui. Speaker Identification Based on Classify Feature Sub-space Gaussian Mixture Model and Neural Net Fusion[J]. Journal of Electronics & Information Technology, 2004, 26(10): 1607-1612.
Huang Wei, Dai Bei-qian, Li Hui. Speaker Identification Based on Classify Feature Sub-space Gaussian Mixture Model and Neural Net Fusion[J]. Journal of Electronics & Information Technology, 2004, 26(10): 1607-1612.
Citation:
Huang Wei, Dai Bei-qian, Li Hui. Speaker Identification Based on Classify Feature Sub-space Gaussian Mixture Model and Neural Net Fusion[J]. Journal of Electronics & Information Technology, 2004, 26(10): 1607-1612.
In this paper, a speaker identification system is proposed based on classify Fea-ture Sub-space Gaussian Mixture Model and Neural Net fusion (FS-GMM/NN) . With clus-tering analysis of the feature vectors, the speakers training feature vectors can be classified to some subsets and training classify Gaussian Mixture Models (GMM) with different mix-tures according to the subsets feature vectorss number. Finally, the outputs of every classify GMM will be fused by Neural Net (NN). In the experiment of text-independent speaker iden-tification of 100 speakers (male), the system based on FS-GMM/NN overmatch the Baseline Gaussian Mixture Model (B-GMM) in identification performance and noise robustness with fewer mixtures and shorter test speech. Moreover, the training of FS-GMM/NN is more effective.
Reynolds D A, Rose R C. Robust text-independent speaker identification using Gaussian mixture speaker models[J].IEEE Trans. on Speech Audio Process.1995, 3(1):72-83[2]Reynolds D A. Speaker identification and verification using Gaussian mixture speaker models[J].Speech Communication.1995, 17(1-2):91-108[3]Reynolds D A. Speaker verification using adapted Gaussian mixture models[J].Digital Signal Processing.2000, 10(1-3):19-41[4]Deller J R, Proakisa J G, Hansenm J H L. Discrete-Time Processing of Speech Signals. New York: Macmillan Publishing Company, 1993.[5]Reynolds D A. Experimental evaluation of features for robust speaker identification[J].IEEE Trans.on Speech Audio Process.1994, 2(4):639-643[6]Chang E, Shi Y, Zhou J, Huang C. Speech lab in a box: A mandarin speech toolbox to jumpstart speech related research. in EUROSPEECH, Aalborg, Denmark, 2001: 192-199.