Chen Guo-ming, Zhao Li, Zou Cai-rong. Speech Enhancement Based on Masking Properties and Short-Time Spectral Amplitude Estimation[J]. Journal of Electronics & Information Technology, 2007, 29(4): 863-866. doi: 10.3724/SP.J.1146.2005.01338
Citation:
Chen Guo-ming, Zhao Li, Zou Cai-rong. Speech Enhancement Based on Masking Properties and Short-Time Spectral Amplitude Estimation[J]. Journal of Electronics & Information Technology, 2007, 29(4): 863-866. doi: 10.3724/SP.J.1146.2005.01338
Chen Guo-ming, Zhao Li, Zou Cai-rong. Speech Enhancement Based on Masking Properties and Short-Time Spectral Amplitude Estimation[J]. Journal of Electronics & Information Technology, 2007, 29(4): 863-866. doi: 10.3724/SP.J.1146.2005.01338
Citation:
Chen Guo-ming, Zhao Li, Zou Cai-rong. Speech Enhancement Based on Masking Properties and Short-Time Spectral Amplitude Estimation[J]. Journal of Electronics & Information Technology, 2007, 29(4): 863-866. doi: 10.3724/SP.J.1146.2005.01338
In this paper a single channel speech enhancement method for noisy speech signals is proposed, which is based on masking properties of the human auditory system and power spectral density estimation. During the estimation of power spectrum of the speech, the estimator can be modified by the MMSE method and the estimator of short -time spectral amplitude filter is determined by masking properties. In this way, the best trade off among the reduction of noise, the speech distortion and the level of musical residual noise can be found. Experimental results demonstrate the improved performance of the proposed algorithms.
[1] Boll S F. Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. on Acoustics Speech Signal Processing, 1979, 27(2): 112-130. [2] Ephraim Y and Malah D. Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator[J].IEEE Trans. on Acoustic Speech Signal Processing.1984, 32(10):1109-1121 [3] Johnston J D. Transform coding of audio signals using perceptual noise criteria[J].IEEE J. on Selected Areas in Communications.1988, 6(2):314-323 [4] Virag N. Single channel speech enhancement based on masking properties of the human auditory system[J].IEEE Trans. on Speech and Audio Processing.1999, 7(2):126-137 [5] Tsoukalas D E, Mourjopoulos J N, and Kokkinakis G. Speech enhancement based on audible noise suppression[J].IEEE Trans. on Speech and Audio Processing.1997, 5(6):497-514 [6] Painter T and Spanias A. Perceptual coding of digital audio[J].Proc. IEEE.2000, 88(4):451-515 [7] Azirami A A, Bouquin R J L, and Faucon G. Optimizing speech enhancement by exploiting masking properties of the human ears. ICASSP, Detroit, 1995, I: 800-803. [8] Gustafsson S, Jax P, and Vary P. A novel psychoacoustically motivated audio enhancement algorithm preserving background noise characteristics. ICASSP, Seattle, 1998, I: 397-400. [9] Cohen I. Relaxed statistical model for speech enhancement and a priori SNR estimation[J].IEEE Trans. on Speech and Audio Processing.2005, 13 (5):870-881