基于多元Laplace语音模型的语音增强算法

周彬; 邹霞; 张雄伟

doi:10.3724/SP.J.1146.2011.01312

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名

邮箱

手机号码

标题

留言内容

验证码

基于多元Laplace语音模型的语音增强算法

doi: 10.3724/SP.J.1146.2011.01312

周彬^{* 邹霞张雄伟,},
邹霞,
张雄伟

基金项目:

江苏省自然科学基金(BK2009059)和国家博士后科研基金资助课题

计量
- 文章访问数: 2425
- HTML全文浏览量: 116
- PDF下载量: 948
- 被引次数: 0
出版历程
- 收稿日期: 2011-12-12
- 修回日期: 2012-04-01
- 刊出日期: 2012-07-19

Speech Enhancement with Multivariate Laplace Speech Model

Zhou Bin^{* 邹霞张雄伟
,},
Zou Xia,
Zhang Xiong-Wei

摘要

摘要: 传统的短时谱估计语音增强算法通常假设语音谱分量相互独立，没有考虑语音谱分量间的相关性。针对这一问题，该文提出一种新的基于多元Laplace分布模型的短时谱估计算法。首先，假设语音的离散余弦变换(DCT)系数服从多元Laplace分布，以此利用谱分量间的相关性；在此基础上，利用多元随机矢量的高斯尺度混合模型表示，推导得到语音DCT系数矢量的最小均方误差(MMSE)估计的解析表达式；并进一步推导了基于该分布模型的语音存在概率，对最小均方误差估计子进行修正。实验结果表明，该算法在抑制背景噪声和减少语音失真等方面优于传统的语音增强方法。
- 语音增强 /
- 最小均方误差 /
- 多元Laplace分布模型
Abstract: The spectral components of speech are usually assumed to be independent in traditional short-time spectrum estimation, which is not the case in practice. To solve this problem, a new speech enhancement algorithm with multivariate Laplace speech model is proposed in this paper. Firstly, the speech Discrete Cosine Transform (DCT) coefficients are modeled by a multivariate Laplace distribution, so the correlations between speech spectral components can be exploited. And then a Minimum-Mean-Square-Error (MMSE) estimator based on the proposed model is derived using a Gaussian scale mixture representation of random vectors. Furthermore, the speech presence uncertainty with the new model is derived to modify the MMSE estimator. Experimental results show that the developed method has better noise suppression performance and lower speech distortion compared to the traditional speech enhancement method.
- Speech enhancement /
- Minimum-Mean-Square-Error (MMSE) /
- Multivariate Laplace distribution model