基于噪声被掩蔽概率的优化语音增强方法

卜凡亮; 王为民; 戴启军; 陈砚圃

基于噪声被掩蔽概率的优化语音增强方法

1.
北京大学信息科学技术学院北京 100871
2.
西安交通大学生命科学技术学院西安 710049
3.
西安通信学院计算中心西安 710106

计量
- 文章访问数: 2630
- HTML全文浏览量: 75
- PDF下载量: 811
- 被引次数: 0
出版历程
- 收稿日期: 2004-05-27
- 修回日期: 2004-12-16
- 刊出日期: 2005-05-19

Optimizing Speech Enhancement Based on Noise Masked Probability

摘要

摘要: 利用听觉系统的掩蔽特性,提出了一种优化的语音增强方法。研究表明,噪声被语音掩蔽的概率是噪声强度和听觉掩蔽阈值的函数。考虑到噪声在带噪语音中的出现具有不确定性,各语音谱分量的最终估计由对带噪语音的谱分量和用传统的增强方法估计的谱分量的加权求得,加权因子由噪声被掩蔽概率确定。语音增强性能的评估结果表明,这种优化的语音增强方法在减少语音失真与加强噪声抑制之间取得了良好的折衷,减少了语音的听觉失真, 有效地抑制了音乐噪声,提高了增强语音的清晰度。
- 语音增强; 听觉掩蔽效应; 语音清晰度; 音乐噪声
Abstract: An optimal approach for enhancing a speech signal degraded by uncorrelated stationary additive noise, which exploits auditory perception properties, is proposed. The speech spectra estimate is performed in two cases: noisy speech spectra for noise masked and classical estimate for noise unmasked. Taking account into the uncertainty of the noise presence, the enhanced speech signal spectra are obtained by a weighted sum of these two estimates, where the weights are given by the noise masked probability. The performance of the proposed speech enhancement approach has been evaluated with speech distortion and informal listening tests. Comparing with Aziranis method and classical estimator, results show that a better compromise between reducing speech distortion and reinforcing noise suppression has been made, speech distortion has been decreased apparently, musical noise has been suppressed and speech articulation has been improved.

HTML全文

参考文献(1)

Ephraim Y, Malah D. Speech enhancement using a minimum mean square error short-time spectral amplitude estimator. IEEE Trans. onASSP, 1984, 32(6): 1109- 1121.[2]Cappe O. Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor[J].IEEE Trans. on Speech and Audio Processing.1994, 2(3):345-[3]McAulay R J, Malpass M L. Speech enhancement using a soft decision noise suppression filter[J].IEEE Trans. on ASSP.1980,28(2):137-[4]曹志刚,郑文涛.基于短时谱最小均方误差估计的语音增强和剩余噪声衰减.电子学报,1993,21(4):7-12.[5]陆生礼,时龙兴,余崇智,等.听觉模拟的语音增强方法.声学学报,1996,21(6):879-883.[6]Virag N. Single channel speech enhancement based on masking properties of the human auditory system[J].IEEE Trans. on Speech and Audio Processing.1999, 7(2):126-[7]Tsoukalas D E, M Paraskevas, Mourjopoulos J N. Speech enhancement using psychoacoustic criteria. ICASSP, Salt Lake City, 1993, Ⅱ: 359 - 362.[8]Azirani A, Jeannes R L B, Faucon G. Optimizing speech enhancement by exploiting masking properties of the human ear.ICASSP, Detroit, 1995, I: 800 - 803.[9]沈永欢,梁在中,许履瑚,等.实用数学手册.北京:科学出版社,1997:477-479.[10]Bu Fanliang, Hou Zhen, Wen Yuan, et al.. An estimation of noise parameters in noisy speech signals. NCMMSC6, Shenzhen, 2001:71 - 74.[11]Johnston J D. Transform coding of audio signal using perceptual noise criteria[J].IEEE J. on Select. Areas Commun.1988, 6(2):314-

施引文献

资源附件(0)

访问统计