Advanced Search
Volume 29 Issue 5
Jan.  2011
Turn off MathJax
Article Contents
Wang Zhen-li, Zhang Xiong-wei. A Method Based on Fractional Spectral Subtraction for Speech Enhancement[J]. Journal of Electronics & Information Technology, 2007, 29(5): 1096-1100. doi: 10.3724/SP.J.1146.2005.01002
Citation: Wang Zhen-li, Zhang Xiong-wei. A Method Based on Fractional Spectral Subtraction for Speech Enhancement[J]. Journal of Electronics & Information Technology, 2007, 29(5): 1096-1100. doi: 10.3724/SP.J.1146.2005.01002

A Method Based on Fractional Spectral Subtraction for Speech Enhancement

doi: 10.3724/SP.J.1146.2005.01002
  • Received Date: 2005-08-15
  • Rev Recd Date: 2006-04-12
  • Publish Date: 2007-05-19
  • A method based on fractional spectral subtraction for speech enhancement (FSS) is proposed. It applies FRFT (FRactional Fourier Transform) to noisy speech, the estimated fractional noise spectrum is then subtracted from the derived fractional speech-noise spectrum. Finally, the denoised speech is obtained by inverse fractional Fourier transform. Theory analysis indicates that the new method can find an optimal fractional order, which can best separate speech for noisy data in the fractional Fourier domain. The result is that the performance of the enhanced speech is effectively improved. Computer simulation shows that the SNR improvement amount and Itakura distance decrease amount of the proposed method are superior to those of Spectral Subtraction (SS) for male/female speech corrupted by additive white Gaussian noise. In addition, the music noise of the enhanced speech is remarkably suppressed.
  • loading
  • Ozaktas H M, Barshan B, Mendlovic D, and Onural L. Convolution, filtering. and multiplexing in fractional Fourier domains and their relation to chirp and wavelet transforms. J. Opt. Soc. Amer. A, 1994, 11(2): 547-559.[2]Namias V. The fractional Fourier transform and its application in quantum mechanics[J].J. Inst. Maths. Application.1980, 25(1):241-265[3]McBride A C and Kerr F H. On Namiass fractional Fourier transform[J].IMAJ. Applied Math.1987, 39(2):159-175[4]Almeida L B. The fractional Fourier transform and time-frequency representations[J].IEEE Trans. on Signal Processing.1994, 42(11):3084-3091[5]Ozaktas H M, Arikan O, Kutay M A, and Bozdagi G. Digital computation of The fractional Fourier transform[J].IEEE Trans. on Signal Processing.1996, 44(9):2141-2150[6]Candan C, Kutay M A, and Ozaktas H M. The discrete fractional Fourier transform[J].IEEE Trans. on Signal Processing.2000, 48(5):1329-1337[7]Vogel K and Risken H. Determination of quasiprobability distributions in terms of probability distributions for the rotated quadrature phase[J].Phys. Rev. A.1989, 40(5):2847-2849[8]Mendlovic D, Ozaktas H M, and Lohmann A W. Fractional correlation[J].Appl. Opt.1995, 34(2):303-309[9]Fonollosa J R and Nikias C L. A new positive time-frequency distribution. in Pro. IEEE Int. Conf. Acoust., Speech, Signal Processing, New Jersey, 1994: 301-304.[10]Boll S F. Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. on Acoust, Speech, Signal Processing, 1979, ASSP-27(2): 113-120.[11]Sim B L, Tong Y C, Chang J S, and Tan C T. A parametric formulation of the generalized spectral subtraction method[J].IEEE Trans. on. Speech and Audio Processing.1998, 6(4):328-337[12]Berouti M, Schwartz R, and Makhoul J. Enhancement of speech corrupted by acoustic noise. in Proc. ICASSP, Washington DC, 1979: 208-211.[13]Kushner W M, Vladimir G, Wu C, Nguyen V, and Damoulakis J N. The effects of subtractive-type speech enhancement/noise reduction algorithms on parameter estimation for improved recognition and coding in high noise environments. In Proc. ICASSP, Glasgow Scotland, 1989: 211-214.[14]Widrow B, et al.. Adaptive noise cancelling, principles and applications[J].IEEE Proc.1975, 63(12):1692-1716[15]Frazier R H.[J].Samsam S, Braida L D, and Oppenheim A V. Enhancement of speech by adaptive filtering. ICASSP76, Philadelphia.1976,:-[16]Gibson J D, Koo B, and Gray S D. Filtering of colored noise for speech enhancement and coding. IEEE Trans. on Acoust., Speech, Signal Processing, 1991, ASSP-39(8): 1732-1742.[17]Lim J S and Oppenheim A V. All-pole modeling of degraded speech. IEEE Trans. on Acoust., Speech, Signal Processing, 1978, ASSP-26(3): 197-210.[18]Ephraim Y. A Bayesian estimation approach for speech enhancement using Hidden Markov models[J].IEEE Trans. on Signal Processing.1992, 40(4):725-735[19]Ephraim Y, Malah D, and Juang H. On the application of Hidden Markov models for enhancing noise speech[J].IEEE Trans. on Acoust., Speech, Signal Processing.1989, 37(12):1846-1856[20]Knecht W G, Schenkel M E, and Moschytz G S. Neural network filters for speech enhancement[J].IEEE Trans. on Speech. Audio Processing.1995, 3(6):433-438[21]Donoho D L. De-Noising by Soft-Thresholding[J].IEEE Trans. on Inform. Theory.1995, 41(3):613-627[22]Donoho D L and Johnstone I M. Adapting to Unknown Smoothness via Wavelet Shrinkage[J].Journal of the American Statistical Association.1995, 90(432):1200-1224[23]Donoho D L, Johnstone I M, Kerkyacharian G and Picard D. Wavelet Shrink: Asymptopia. Journal of the Royal Statistical Society, Series B, 1995, 57(2): 301-369.[24]Ephraim Y and Van Trees H L. A signal subspace approach for speech enhancement[J].IEEE Trans. on Speech Audio Processing.1995, 3(4):251-266[25]Mittal U and Phamdo N. Signal/noise KLT based approach for enhancing speech degraded by colored noise[J].IEEE Trans. on Speech Audio Processing.2000, 8(2):159-167[26]Rezayee A and Gazor S. An adaptive KLT approach for speech enhancement[J].IEEE Trans. on Speech Audio Processing.2001, 9(2):87-95[27]Santhanam B and McClellan J H. The discrete rotational Fourier transform. IEEE Trans. on Signal Processing, 1996, 42(4): 994-998.[28]Pei Soo-Chang, Yeh Min-Huang, and Tseng Chien-Cheng. Discrete fractional Fourier Transforms based on orthogonal projections[J].IEEE Trans. on Signal Processing.1999, 47(5):1335-1347[29]平先军,陶然, 周思永等. 一种新的分数阶Fourier变换快速算法. 电子学报, 2001, 29(3): 406-408.[30]Wang D L and Lim J S. The unimportance of phase in speech enhancement. IEEE Trans. on Acoust., Speech, Signal Processing, 1982, ASSP-30(8): 679-681.[31]Picinbono B. Random Signals and Systems. Englewood Cliffs, NJ: Prentice Hall, 1993: 136-138.[32]Spiegel M R. Schaums Outline of Theory and Problems of Mathematical Handbook of Formulas and Tables. Int.ed. New York: McGrawHill, 1990.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (3465) PDF downloads(988) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return