基于分数阶谱相减的语音增强法

王振力; 张雄伟

doi:10.3724/SP.J.1146.2005.01002

基于分数阶谱相减的语音增强法

doi: 10.3724/SP.J.1146.2005.01002

基金项目:

江苏省自然科学基金(BK2006001)和江苏省图像处理与图像通信实验室(KJS03036)资助课题

计量
- 文章访问数: 3465
- HTML全文浏览量: 95
- PDF下载量: 988
- 被引次数: 0
出版历程
- 收稿日期: 2005-08-15
- 修回日期: 2006-04-12
- 刊出日期: 2007-05-19

A Method Based on Fractional Spectral Subtraction for Speech Enhancement

摘要

摘要: 该文提出了基于分数阶谱相减的语音增强法(FSS)。该方法通过对带噪语音信号作分数阶傅里叶变换(FRFT)，将得到的分数阶语噪混合谱与估计的分数阶噪声谱相减，最后利用分数阶Fourier反变换获得去噪后的语音信号。理论分析表明，所提方法存在一个最佳分数阶阶数，使得语噪混合信号能在分数阶变换域得到最好的分离，从而有效地提高了增强语音的性能。计算机仿真表明，对于混有加性白噪声的男/女声发音信号，所提方法在信噪比提高量和Itakura距离减少量两个方面都优于传统的谱相减法(SS)，并且增强语音中的音乐噪声得到了明显抑制。
- 分数阶傅里叶变换;语音增强;谱相减;音乐噪声
Abstract: A method based on fractional spectral subtraction for speech enhancement (FSS) is proposed. It applies FRFT (FRactional Fourier Transform) to noisy speech, the estimated fractional noise spectrum is then subtracted from the derived fractional speech-noise spectrum. Finally, the denoised speech is obtained by inverse fractional Fourier transform. Theory analysis indicates that the new method can find an optimal fractional order, which can best separate speech for noisy data in the fractional Fourier domain. The result is that the performance of the enhanced speech is effectively improved. Computer simulation shows that the SNR improvement amount and Itakura distance decrease amount of the proposed method are superior to those of Spectral Subtraction (SS) for male/female speech corrupted by additive white Gaussian noise. In addition, the music noise of the enhanced speech is remarkably suppressed.

HTML全文

参考文献(1)

Ozaktas H M, Barshan B, Mendlovic D, and Onural L. Convolution, filtering. and multiplexing in fractional Fourier domains and their relation to chirp and wavelet transforms. J. Opt. Soc. Amer. A, 1994, 11(2): 547-559.[2]Namias V. The fractional Fourier transform and its application in quantum mechanics[J].J. Inst. Maths. Application.1980, 25(1):241-265[3]McBride A C and Kerr F H. On Namiass fractional Fourier transform[J].IMAJ. Applied Math.1987, 39(2):159-175[4]Almeida L B. The fractional Fourier transform and time-frequency representations[J].IEEE Trans. on Signal Processing.1994, 42(11):3084-3091[5]Ozaktas H M, Arikan O, Kutay M A, and Bozdagi G. Digital computation of The fractional Fourier transform[J].IEEE Trans. on Signal Processing.1996, 44(9):2141-2150[6]Candan C, Kutay M A, and Ozaktas H M. The discrete fractional Fourier transform[J].IEEE Trans. on Signal Processing.2000, 48(5):1329-1337[7]Vogel K and Risken H. Determination of quasiprobability distributions in terms of probability distributions for the rotated quadrature phase[J].Phys. Rev. A.1989, 40(5):2847-2849[8]Mendlovic D, Ozaktas H M, and Lohmann A W. Fractional correlation[J].Appl. Opt.1995, 34(2):303-309[9]Fonollosa J R and Nikias C L. A new positive time-frequency distribution. in Pro. IEEE Int. Conf. Acoust., Speech, Signal Processing, New Jersey, 1994: 301-304.[10]Boll S F. Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. on Acoust, Speech, Signal Processing, 1979, ASSP-27(2): 113-120.[11]Sim B L, Tong Y C, Chang J S, and Tan C T. A parametric formulation of the generalized spectral subtraction method[J].IEEE Trans. on. Speech and Audio Processing.1998, 6(4):328-337[12]Berouti M, Schwartz R, and Makhoul J. Enhancement of speech corrupted by acoustic noise. in Proc. ICASSP, Washington DC, 1979: 208-211.[13]Kushner W M, Vladimir G, Wu C, Nguyen V, and Damoulakis J N. The effects of subtractive-type speech enhancement/noise reduction algorithms on parameter estimation for improved recognition and coding in high noise environments. In Proc. ICASSP, Glasgow Scotland, 1989: 211-214.[14]Widrow B, et al.. Adaptive noise cancelling, principles and applications[J].IEEE Proc.1975, 63(12):1692-1716[15]Frazier R H.[J].Samsam S, Braida L D, and Oppenheim A V. Enhancement of speech by adaptive filtering. ICASSP76, Philadelphia.1976,:-[16]Gibson J D, Koo B, and Gray S D. Filtering of colored noise for speech enhancement and coding. IEEE Trans. on Acoust., Speech, Signal Processing, 1991, ASSP-39(8): 1732-1742.[17]Lim J S and Oppenheim A V. All-pole modeling of degraded speech. IEEE Trans. on Acoust., Speech, Signal Processing, 1978, ASSP-26(3): 197-210.[18]Ephraim Y. A Bayesian estimation approach for speech enhancement using Hidden Markov models[J].IEEE Trans. on Signal Processing.1992, 40(4):725-735[19]Ephraim Y, Malah D, and Juang H. On the application of Hidden Markov models for enhancing noise speech[J].IEEE Trans. on Acoust., Speech, Signal Processing.1989, 37(12):1846-1856[20]Knecht W G, Schenkel M E, and Moschytz G S. Neural network filters for speech enhancement[J].IEEE Trans. on Speech. Audio Processing.1995, 3(6):433-438[21]Donoho D L. De-Noising by Soft-Thresholding[J].IEEE Trans. on Inform. Theory.1995, 41(3):613-627[22]Donoho D L and Johnstone I M. Adapting to Unknown Smoothness via Wavelet Shrinkage[J].Journal of the American Statistical Association.1995, 90(432):1200-1224[23]Donoho D L, Johnstone I M, Kerkyacharian G and Picard D. Wavelet Shrink: Asymptopia. Journal of the Royal Statistical Society, Series B, 1995, 57(2): 301-369.[24]Ephraim Y and Van Trees H L. A signal subspace approach for speech enhancement[J].IEEE Trans. on Speech Audio Processing.1995, 3(4):251-266[25]Mittal U and Phamdo N. Signal/noise KLT based approach for enhancing speech degraded by colored noise[J].IEEE Trans. on Speech Audio Processing.2000, 8(2):159-167[26]Rezayee A and Gazor S. An adaptive KLT approach for speech enhancement[J].IEEE Trans. on Speech Audio Processing.2001, 9(2):87-95[27]Santhanam B and McClellan J H. The discrete rotational Fourier transform. IEEE Trans. on Signal Processing, 1996, 42(4): 994-998.[28]Pei Soo-Chang, Yeh Min-Huang, and Tseng Chien-Cheng. Discrete fractional Fourier Transforms based on orthogonal projections[J].IEEE Trans. on Signal Processing.1999, 47(5):1335-1347[29]平先军，陶然, 周思永等. 一种新的分数阶Fourier变换快速算法. 电子学报, 2001, 29(3): 406-408.[30]Wang D L and Lim J S. The unimportance of phase in speech enhancement. IEEE Trans. on Acoust., Speech, Signal Processing, 1982, ASSP-30(8): 679-681.[31]Picinbono B. Random Signals and Systems. Englewood Cliffs, NJ: Prentice Hall, 1993: 136-138.[32]Spiegel M R. Schaums Outline of Theory and Problems of Mathematical Handbook of Formulas and Tables. Int.ed. New York: McGrawHill, 1990.

施引文献

资源附件(0)

访问统计