A Method Based on Fractional Spectral Subtraction for Speech Enhancement

Wang Zhen-li; Zhang Xiong-wei

doi:10.3724/SP.J.1146.2005.01002

Volume 29 Issue 5

Jan. 2011

Turn off MathJax

Article Contents

Article Navigation > Journal of Electronics & Information Technology > 2007 > 29(5): 1096-1100

Wang Zhen-li, Zhang Xiong-wei. A Method Based on Fractional Spectral Subtraction for Speech Enhancement[J]. Journal of Electronics & Information Technology, 2007, 29(5): 1096-1100. doi: 10.3724/SP.J.1146.2005.01002

Citation:

Wang Zhen-li, Zhang Xiong-wei. A Method Based on Fractional Spectral Subtraction for Speech Enhancement[J]. Journal of Electronics & Information Technology, 2007, 29(5): 1096-1100. doi: 10.3724/SP.J.1146.2005.01002

Citation:

PDF( 303 KB)

A Method Based on Fractional Spectral Subtraction for Speech Enhancement

doi: 10.3724/SP.J.1146.2005.01002 cstr: 32379.14.SP.J.1146.2005.01002

Received Date: 2005-08-15
Rev Recd Date: 2006-04-12
Publish Date: 2007-05-19

Abstract

Abstract

A method based on fractional spectral subtraction for speech enhancement (FSS) is proposed. It applies FRFT (FRactional Fourier Transform) to noisy speech, the estimated fractional noise spectrum is then subtracted from the derived fractional speech-noise spectrum. Finally, the denoised speech is obtained by inverse fractional Fourier transform. Theory analysis indicates that the new method can find an optimal fractional order, which can best separate speech for noisy data in the fractional Fourier domain. The result is that the performance of the enhanced speech is effectively improved. Computer simulation shows that the SNR improvement amount and Itakura distance decrease amount of the proposed method are superior to those of Spectral Subtraction (SS) for male/female speech corrupted by additive white Gaussian noise. In addition, the music noise of the enhanced speech is remarkably suppressed.

FullText(HTML)

References(1)

References

Ozaktas H M, Barshan B, Mendlovic D, and Onural L. Convolution, filtering. and multiplexing in fractional Fourier domains and their relation to chirp and wavelet transforms. J. Opt. Soc. Amer. A, 1994, 11(2): 547-559.[2]Namias V. The fractional Fourier transform and its application in quantum mechanics[J].J. Inst. Maths. Application.1980, 25(1):241-265[3]McBride A C and Kerr F H. On Namiass fractional Fourier transform[J].IMAJ. Applied Math.1987, 39(2):159-175[4]Almeida L B. The fractional Fourier transform and time-frequency representations[J].IEEE Trans. on Signal Processing.1994, 42(11):3084-3091[5]Ozaktas H M, Arikan O, Kutay M A, and Bozdagi G. Digital computation of The fractional Fourier transform[J].IEEE Trans. on Signal Processing.1996, 44(9):2141-2150[6]Candan C, Kutay M A, and Ozaktas H M. The discrete fractional Fourier transform[J].IEEE Trans. on Signal Processing.2000, 48(5):1329-1337[7]Vogel K and Risken H. Determination of quasiprobability distributions in terms of probability distributions for the rotated quadrature phase[J].Phys. Rev. A.1989, 40(5):2847-2849[8]Mendlovic D, Ozaktas H M, and Lohmann A W. Fractional correlation[J].Appl. Opt.1995, 34(2):303-309[9]Fonollosa J R and Nikias C L. A new positive time-frequency distribution. in Pro. IEEE Int. Conf. Acoust., Speech, Signal Processing, New Jersey, 1994: 301-304.[10]Boll S F. Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. on Acoust, Speech, Signal Processing, 1979, ASSP-27(2): 113-120.[11]Sim B L, Tong Y C, Chang J S, and Tan C T. A parametric formulation of the generalized spectral subtraction method[J].IEEE Trans. on. Speech and Audio Processing.1998, 6(4):328-337[12]Berouti M, Schwartz R, and Makhoul J. Enhancement of speech corrupted by acoustic noise. in Proc. ICASSP, Washington DC, 1979: 208-211.[13]Kushner W M, Vladimir G, Wu C, Nguyen V, and Damoulakis J N. The effects of subtractive-type speech enhancement/noise reduction algorithms on parameter estimation for improved recognition and coding in high noise environments. In Proc. ICASSP, Glasgow Scotland, 1989: 211-214.[14]Widrow B, et al.. Adaptive noise cancelling, principles and applications[J].IEEE Proc.1975, 63(12):1692-1716[15]Frazier R H.[J].Samsam S, Braida L D, and Oppenheim A V. Enhancement of speech by adaptive filtering. ICASSP76, Philadelphia.1976,:-[16]Gibson J D, Koo B, and Gray S D. Filtering of colored noise for speech enhancement and coding. IEEE Trans. on Acoust., Speech, Signal Processing, 1991, ASSP-39(8): 1732-1742.[17]Lim J S and Oppenheim A V. All-pole modeling of degraded speech. IEEE Trans. on Acoust., Speech, Signal Processing, 1978, ASSP-26(3): 197-210.[18]Ephraim Y. A Bayesian estimation approach for speech enhancement using Hidden Markov models[J].IEEE Trans. on Signal Processing.1992, 40(4):725-735[19]Ephraim Y, Malah D, and Juang H. On the application of Hidden Markov models for enhancing noise speech[J].IEEE Trans. on Acoust., Speech, Signal Processing.1989, 37(12):1846-1856[20]Knecht W G, Schenkel M E, and Moschytz G S. Neural network filters for speech enhancement[J].IEEE Trans. on Speech. Audio Processing.1995, 3(6):433-438[21]Donoho D L. De-Noising by Soft-Thresholding[J].IEEE Trans. on Inform. Theory.1995, 41(3):613-627[22]Donoho D L and Johnstone I M. Adapting to Unknown Smoothness via Wavelet Shrinkage[J].Journal of the American Statistical Association.1995, 90(432):1200-1224[23]Donoho D L, Johnstone I M, Kerkyacharian G and Picard D. Wavelet Shrink: Asymptopia. Journal of the Royal Statistical Society, Series B, 1995, 57(2): 301-369.[24]Ephraim Y and Van Trees H L. A signal subspace approach for speech enhancement[J].IEEE Trans. on Speech Audio Processing.1995, 3(4):251-266[25]Mittal U and Phamdo N. Signal/noise KLT based approach for enhancing speech degraded by colored noise[J].IEEE Trans. on Speech Audio Processing.2000, 8(2):159-167[26]Rezayee A and Gazor S. An adaptive KLT approach for speech enhancement[J].IEEE Trans. on Speech Audio Processing.2001, 9(2):87-95[27]Santhanam B and McClellan J H. The discrete rotational Fourier transform. IEEE Trans. on Signal Processing, 1996, 42(4): 994-998.[28]Pei Soo-Chang, Yeh Min-Huang, and Tseng Chien-Cheng. Discrete fractional Fourier Transforms based on orthogonal projections[J].IEEE Trans. on Signal Processing.1999, 47(5):1335-1347[29]平先军，陶然, 周思永等. 一种新的分数阶Fourier变换快速算法. 电子学报, 2001, 29(3): 406-408.[30]Wang D L and Lim J S. The unimportance of phase in speech enhancement. IEEE Trans. on Acoust., Speech, Signal Processing, 1982, ASSP-30(8): 679-681.[31]Picinbono B. Random Signals and Systems. Englewood Cliffs, NJ: Prentice Hall, 1993: 136-138.[32]Spiegel M R. Schaums Outline of Theory and Problems of Mathematical Handbook of Formulas and Tables. Int.ed. New York: McGrawHill, 1990.

Relative Articles

Supplements(0)

Cited By

Proportional views