一种基于短时谱估计和人耳掩蔽效应的语音增强算法
doi: 10.3724/SP.J.1146.2005.01338
Speech Enhancement Based on Masking Properties and Short-Time Spectral Amplitude Estimation
-
摘要: 该文结合短时谱估计算法和人耳掩蔽效应提出了一种单通道语音增强算法。该算法在MMSE准则下采用了非固定参数的语音跟踪,并且引入人耳掩蔽效应动态的确定增强滤波器的传递函数以适应语音信号的变化。实验结果表明:该算法使降噪后的语音信号有较小的语音失真并且很好地抑制了音乐噪声。
-
关键词:
- 语音增强;听觉掩蔽;音乐噪声
Abstract: In this paper a single channel speech enhancement method for noisy speech signals is proposed, which is based on masking properties of the human auditory system and power spectral density estimation. During the estimation of power spectrum of the speech, the estimator can be modified by the MMSE method and the estimator of short -time spectral amplitude filter is determined by masking properties. In this way, the best trade off among the reduction of noise, the speech distortion and the level of musical residual noise can be found. Experimental results demonstrate the improved performance of the proposed algorithms. -
[1] Boll S F. Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. on Acoustics Speech Signal Processing, 1979, 27(2): 112-130. [2] Ephraim Y and Malah D. Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator[J].IEEE Trans. on Acoustic Speech Signal Processing.1984, 32(10):1109-1121 [3] Johnston J D. Transform coding of audio signals using perceptual noise criteria[J].IEEE J. on Selected Areas in Communications.1988, 6(2):314-323 [4] Virag N. Single channel speech enhancement based on masking properties of the human auditory system[J].IEEE Trans. on Speech and Audio Processing.1999, 7(2):126-137 [5] Tsoukalas D E, Mourjopoulos J N, and Kokkinakis G. Speech enhancement based on audible noise suppression[J].IEEE Trans. on Speech and Audio Processing.1997, 5(6):497-514 [6] Painter T and Spanias A. Perceptual coding of digital audio[J].Proc. IEEE.2000, 88(4):451-515 [7] Azirami A A, Bouquin R J L, and Faucon G. Optimizing speech enhancement by exploiting masking properties of the human ears. ICASSP, Detroit, 1995, I: 800-803. [8] Gustafsson S, Jax P, and Vary P. A novel psychoacoustically motivated audio enhancement algorithm preserving background noise characteristics. ICASSP, Seattle, 1998, I: 397-400. [9] Cohen I. Relaxed statistical model for speech enhancement and a priori SNR estimation[J].IEEE Trans. on Speech and Audio Processing.2005, 13 (5):870-881
计量
- 文章访问数: 3330
- HTML全文浏览量: 103
- PDF下载量: 1050
- 被引次数: 0