Feature Mean Distance Based Speaker Clustering for Short Speech Segments

Li Yan-Xiong; Wu  Yong; He  Qian-Hua

doi:10.3724/SP.J.1146.2011.01139

Volume 34 Issue 6

Jul. 2012

Turn off MathJax

Article Contents

Article Navigation > Journal of Electronics & Information Technology > 2012 > 34(6): 1404-1407

Li Yan-Xiong, Wu Yong, He Qian-Hua. Feature Mean Distance Based Speaker Clustering for Short Speech Segments[J]. Journal of Electronics & Information Technology, 2012, 34(6): 1404-1407. doi: 10.3724/SP.J.1146.2011.01139

Citation:

Li Yan-Xiong, Wu Yong, He Qian-Hua. Feature Mean Distance Based Speaker Clustering for Short Speech Segments[J]. Journal of Electronics & Information Technology, 2012, 34(6): 1404-1407. doi: 10.3724/SP.J.1146.2011.01139

Citation:

PDF( 194 KB)

Feature Mean Distance Based Speaker Clustering for Short Speech Segments

doi: 10.3724/SP.J.1146.2011.01139 cstr: 32379.14.SP.J.1146.2011.01139

Li Yan-Xiong^{* 吴永贺前华
,},
Wu Yong,
He Qian-Hua

Received Date: 2011-11-03
Rev Recd Date: 2012-02-24
Publish Date: 2012-06-19

Abstract

Abstract

An algorithm of speaker clustering is proposed based on Feature Mean Distance (FMD) for short speech segments. First, a distance measure, i.e. FMD, is introduced to represent the similarities between two clusters on the level of feature instead of the level of model. Then, two clusters with the minimum of FMDs are iteratively merged until the minimum of FMDs is larger than an adaptive threshold. Experimental results show average 5% improvements in F measure are obtained in comparison with the AHC+BIC based algorithm. In addition, the proposed algorithm is 4.68 times faster than the AHC+BIC based algorithm.
- Speech signal processing,
- Speaker clustering,
- Feature Mean Distance (FMD),
- Short speech segments