Ma Xiao-hong, Liang Li-li, Yin Fu-liang. A Voice Activity Detection Method Based on Blind Source Separation for Microphone Array Signals[J]. Journal of Electronics & Information Technology, 2007, 29(3): 589-592. doi: 10.3724/SP.J.1146.2005.00717
Citation:
Ma Xiao-hong, Liang Li-li, Yin Fu-liang. A Voice Activity Detection Method Based on Blind Source Separation for Microphone Array Signals[J]. Journal of Electronics & Information Technology, 2007, 29(3): 589-592. doi: 10.3724/SP.J.1146.2005.00717
Ma Xiao-hong, Liang Li-li, Yin Fu-liang. A Voice Activity Detection Method Based on Blind Source Separation for Microphone Array Signals[J]. Journal of Electronics & Information Technology, 2007, 29(3): 589-592. doi: 10.3724/SP.J.1146.2005.00717
Citation:
Ma Xiao-hong, Liang Li-li, Yin Fu-liang. A Voice Activity Detection Method Based on Blind Source Separation for Microphone Array Signals[J]. Journal of Electronics & Information Technology, 2007, 29(3): 589-592. doi: 10.3724/SP.J.1146.2005.00717
A Voice Activity Detection (VAD) method for microphone array signals in directional noise field is proposed. As the noises received by different microphones are correlated with each other in directional noise field, relatively pure speech can be derived from any two array signals by using Blind Source Separation (BSS) method. The generalized correlation method is used to estimate time delay between this relatively pure signal and every channel signals of microphone array. In the same time, a long-term speech information method is applied to the relatively pure speech signal to obtain its VAD result. Then this VAD result is used as reference to produce those of all array signals by the time shifting of it according to each time delay values. Simulation results illustrate the validity of the proposed method.
[1] Gustafsson T, Rao B D, and Trivedi M. Source localization in reverberant environments: modeling and statistical analysis[J].IEEE Trans. on Speech and Audio Processing.2003, 11(6):791-803 [2] Gannot S and Cohen I. Speech enhancement based on the general transfer function GSC and postfiltering[J].IEEE Trans. on Speech and Audio Process.2004, 12 (6):561-571 [3] Tanyer S G and zer H. Voice activity detection in nonstationary noise[J].IEEE Trans. on Speech and Audio Process.2000, 8 (4):478-482 [4] Ramrez J, Segura J C, and Bentez C, et al.. Efficient voice activity detection algorithms using long-term speech information[J].Speech Communication.2004, 42 (3-4):271-287 [5] Chen J F and Ser W. Speech detection using microphone array[J].Electronics Letters.2000, 36(2):181-182 [6] Cao X R and Liu R W. General approach to blind source separation[J].IEEE Trans. on Signal Processing.1996, 44(3):562-571 [7] Cardoso J F. Blind signal separation: Statistical principles[J].Proce. IEEE.1998, 86(10):2009-2025 [8] Comon P. Independent component analysis, A new concept? Signal Processing, 1994, 36(3): 287-314. [9] Hyvarinen A and Oja E. A fast fixed-point algorithm for independent component analysis[J].Neural Computation.1997, 9(7):1483-1492 [10] Siow Yong Low, Nordholm S, and Togneri R. Convolutive blind signal separation with post-processing[J].IEEE Trans. on Speech and Audio Processing.2004, 12(5):539-548 [11] Knapp C and Carter G. The generalized correlation method for estimation of time delay[J].IEEE Trans. on Acoustics, Speech, and Signal Process.1976, 24(4):320-327