Multi-band Spectral Subtraction of Speech Enhancement Based on Maximum Posteriori Phase Estimation

LI Zhen; WU Wenjin; ZHANG Qin; REN Hui

doi:10.11999/JEIT161381

Volume 39 Issue 9

Sep. 2017

Turn off MathJax

Article Contents

Article Navigation > Journal of Electronics & Information Technology > 2017 > 39(9): 2282-2286

Huang Jian-Hua, Ding Jian-Rui, Liu Jia-Feng, Zhang Ying-Tao. Citation-kNN Algorithm Based on Locally-weighting[J]. Journal of Electronics & Information Technology, 2013, 35(3): 627-632. doi: 10.3724/SP.J.1146.2012.00016

Citation:

LI Zhen, WU Wenjin, ZHANG Qin, REN Hui. Multi-band Spectral Subtraction of Speech Enhancement Based on Maximum Posteriori Phase Estimation[J]. Journal of Electronics & Information Technology, 2017, 39(9): 2282-2286. doi: 10.11999/JEIT161381

Citation:

PDF( 634 KB)

Multi-band Spectral Subtraction of Speech Enhancement Based on Maximum Posteriori Phase Estimation

doi: 10.11999/JEIT161381

Funds:

The National Science and Technology Planning?Project (2012BAH38F00)

Received Date: 2016-12-21
Rev Recd Date: 2017-04-25
Publish Date: 2017-09-19

Abstract

Abstract

The spectral subtraction speech enhancement is extensively used due to its simplicity and easy to implement. The principle of this method is to subtract the estimated magnitude of the noise from the magnitude of the noisy signal, but the phase of the noisy signal is unchanged. This conventional method produces the estimating error because it exploits the noisy phase, especially in low SNR, and it produces musical noise because of the inaccuracy of the noise estimation. This paper proposes a multi-band spectral subtraction algorithm based on maximum posteriori phase estimation. Experimental results show that the proposed method can get better performance than the conventional method especially in low SNR.
- Speech enhancement,
- Maximum posteriori phase estimation,
- Multi-band spectral subtraction,
- Low SNR

FullText(HTML)

References(14)

References

WJCICKI K, MILACIC M, STARK A, et al. Exploiting conjugate symmetry of the short-time fourier spectrum for speech enhancement[J]. IEEE Signal Processing Letters, 2008, 15: 461-464. doi: 10.1109/LSP.2008.923579.

WANG Jiaching, LIN Changhong, WANG Shufan, et al. Compressive Sensing-based speech enhancement[J]. IEEE Transactions on Audio, Speech and Language Processing, 2016, 24(11): 2122-2131. doi: 10.1109/TASLP.2016.2598306.

MOWLAEE P and KULMER J. Harmonic phase estimation in single-channel speech enhancement using phase decomposition and SNR information[J]. IEEE Transactions on Audio, Speech and Language Processing, 2015, 23(9): 1521-1532. doi: 10.1109/TASLP.2015.2439038.

KULMER J and MOWLAEE P. Phase estimation in single channel speech enhancement using phase decomposition[J]. IEEE Signal Processing Letters, 2015, 22(5): 598-602. doi: 10.1109/LSP.2014.2365040.

BOLLS F. Suppression of acoustic noise in speech using spectral subtraction [J]. IEEE Transactions on Acoustics, Speech and Signal Processing, 1979, 27(2): 113-120. doi: 10.1109/TASSP.1979.1163209.

WIENER N. The Extrapolation, Interpolation, and Smoothing of Stationary Time Series With Engineering Applications[M]. Cambridge: Massachusetts, MIT, 1949: 81-101.

EPHRAIM Y and MALAH D. Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator[J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1984, 32(6): 1109-1121. doi: 10.1109/ TASSP.1984.1164453.

KAMATH S and LOIZOU P C. A multi-band spectral subtraction method for enhancing speech corrupted by colored noise[C]. IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, FL, USA, 2002: IV-4164-IV-4164.

VARY P. Noise Suppression by spectral magnitude estimation: Mechanism and theoretical limits[J]. Signal Processing, 1985, 8(4): 387-400. doi: 10.1016/0165-1684(85) 90002-7.

SAMUI S. Improved single channel phase-aware speech enhancement technique for low signal-to-noise ration signal[J]. IET Signal Processing, 2016, 10(6): 641-650. doi: 10.1049/ iet-spr.2015.0182.

KULMER J and MOWLAEE P. Harmonic phase estimation in single-channel speech enhancement using Von Mises distribution and prior SNR[C]. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, 2015: 5063-5067.

KAY S M. Fundamentals of Statistical Signal Processing, Volume I: Estimation Theory[M]. New Jersey: Prentice Hall PTR, 1993: 164-172.

TAAL C H, HENDRIKS R C, HEUSDENS R, et al. An algorithm for intelligibility prediction of time-frequency weighted noisy speech[J]. IEEE Transactions on Audio, Speech and Languages, 2011, 19(7): 2125-2136. doi: 10.1109/ TASL.2011.2114881.

GAICH A and MOWLAEE P. On speech quality estimation of phase-aware single-channel speech enhancement[C]. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), Brisbane, Australia, 2015: 216-220.

Relative Articles

Supplements(0)

Cited By

Cited by

Periodical cited type(4)

1.	李文伟，郑永军，杨圣慧，江世界，赵航行，王慧，苏道毕力格，谭彧. 音频技术在禽畜养殖与果蔬种植中的应用研究进展. 农业工程学报. 2024(07): 34-49 .
2.	董胡，刘刚，马振中. 基于自适应MMSE-LSA与NMF的语音增强算法. 探测与控制学报. 2021(04): 81-85+91 .
3.	田玉静，左红伟，王超. 语声通信降噪研究. 应用声学. 2020(06): 932-939 .
4.	罗瀛，曾庆宁，龙超. 多噪声环境下双微阵列语音增强算法. 计算机应用. 2019(08): 2426-2430 .