基于盲源分离理论的麦克风阵列信号有音/无音检测方法
doi: 10.3724/SP.J.1146.2005.00717
A Voice Activity Detection Method Based on Blind Source Separation for Microphone Array Signals
-
摘要: 该文提出一种在方向性噪声场中多路麦克风信号同时进行有音/无音检测(VAD)的方法。在方向性噪声场中,由于各个麦克风接收信号中的噪声彼此之间相关,因而,可以利用盲源分离理论将方向噪声与语音源信号分离,从而获得相对比较纯净的语音源信号。对分离出的语音源信号进行有音/无音检测,获得VAD结果,同时估计出各个麦克风信号相对于该信号的时延值。以相对纯净语音源信号的VAD检测结果为参考,将其分别平移相应的时延值,即可同时获得多路麦克风信号的VAD结果。计算机模拟结果表明,在方向性噪声场的多种情况下,该方法对具有加性噪声的多路麦克风信号均具有较好的有音/无音检测能力。Abstract: A Voice Activity Detection (VAD) method for microphone array signals in directional noise field is proposed. As the noises received by different microphones are correlated with each other in directional noise field, relatively pure speech can be derived from any two array signals by using Blind Source Separation (BSS) method. The generalized correlation method is used to estimate time delay between this relatively pure signal and every channel signals of microphone array. In the same time, a long-term speech information method is applied to the relatively pure speech signal to obtain its VAD result. Then this VAD result is used as reference to produce those of all array signals by the time shifting of it according to each time delay values. Simulation results illustrate the validity of the proposed method.
-
[1] Gustafsson T, Rao B D, and Trivedi M. Source localization in reverberant environments: modeling and statistical analysis[J].IEEE Trans. on Speech and Audio Processing.2003, 11(6):791-803 [2] Gannot S and Cohen I. Speech enhancement based on the general transfer function GSC and postfiltering[J].IEEE Trans. on Speech and Audio Process.2004, 12 (6):561-571 [3] Tanyer S G and zer H. Voice activity detection in nonstationary noise[J].IEEE Trans. on Speech and Audio Process.2000, 8 (4):478-482 [4] Ramrez J, Segura J C, and Bentez C, et al.. Efficient voice activity detection algorithms using long-term speech information[J].Speech Communication.2004, 42 (3-4):271-287 [5] Chen J F and Ser W. Speech detection using microphone array[J].Electronics Letters.2000, 36(2):181-182 [6] Cao X R and Liu R W. General approach to blind source separation[J].IEEE Trans. on Signal Processing.1996, 44(3):562-571 [7] Cardoso J F. Blind signal separation: Statistical principles[J].Proce. IEEE.1998, 86(10):2009-2025 [8] Comon P. Independent component analysis, A new concept? Signal Processing, 1994, 36(3): 287-314. [9] Hyvarinen A and Oja E. A fast fixed-point algorithm for independent component analysis[J].Neural Computation.1997, 9(7):1483-1492 [10] Siow Yong Low, Nordholm S, and Togneri R. Convolutive blind signal separation with post-processing[J].IEEE Trans. on Speech and Audio Processing.2004, 12(5):539-548 [11] Knapp C and Carter G. The generalized correlation method for estimation of time delay[J].IEEE Trans. on Acoustics, Speech, and Signal Process.1976, 24(4):320-327
计量
- 文章访问数: 3457
- HTML全文浏览量: 90
- PDF下载量: 749
- 被引次数: 0