高级搜索

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于混合线性变换的语声转换算法

简志华 杨震

简志华, 杨震. 基于混合线性变换的语声转换算法[J]. 电子与信息学报, 2007, 29(7): 1700-1702. doi: 10.3724/SP.J.1146.2006.00787
引用本文: 简志华, 杨震. 基于混合线性变换的语声转换算法[J]. 电子与信息学报, 2007, 29(7): 1700-1702. doi: 10.3724/SP.J.1146.2006.00787
Jian Zhi-hua, Yang Zhen. An Algorithm for Voice Conversion Based on Mixtures of Linear Transformation[J]. Journal of Electronics & Information Technology, 2007, 29(7): 1700-1702. doi: 10.3724/SP.J.1146.2006.00787
Citation: Jian Zhi-hua, Yang Zhen. An Algorithm for Voice Conversion Based on Mixtures of Linear Transformation[J]. Journal of Electronics & Information Technology, 2007, 29(7): 1700-1702. doi: 10.3724/SP.J.1146.2006.00787

基于混合线性变换的语声转换算法

doi: 10.3724/SP.J.1146.2006.00787
基金项目: 

江苏省青蓝工程项目(QL003YZ)资助课题

An Algorithm for Voice Conversion Based on Mixtures of Linear Transformation

  • 摘要: 针对在没有对称语音库的情况下,该文提出了一种基于混合线性变换的语声转换算法,在最大似然估计准则下,使用EM迭代算法计算变换函数的参量。为了减小线性加权对语音谱包络的平滑作用,使用线性调频Z变换来调节语音信号的LPC系数。客观评测和主观感受的实验结果都表明,基于混合线性变换的语声转换算法也可以取得与传统语声转换技术相当的转换效果,解除了传统语声转换技术需要对称语音库的要求。
  • Childers D G, Wu K, and Hicks D M, et al.. Voice conversion[J].Speech Communication.1989, 8(2):147-158[2]Abe M, Nakamura S, Shikano K, and Kuwabara H. Voice conversion through vector quantization. IEEE Proceedings of ICASSP, New York, USA, Apr. 11-14, 1988: 565-568.[3]Arslan L M. Speaker transformation algorithm using segmental codebooks[J].Speech Communication.1999, 28(3):211-226[4]Narendranath M, Murthy H A, and Rajendran S, et al.. Transformation of formants for voice conversion using artificial neural networks[J].Speech Communication.1995, 16(2):207-216[5]Iwahashi N and Sagisaka Y. Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks[J].Speech Communication.1995, 16(2):139-151[6]Stylianou Y, Cappe O, and Moulines E. Continuous Probabilistic Transform for Voice Conversion[J].IEEE Trans on Speech and Audio Processing.1998, 6(2):131-142[7]Kain A and Macon M W. Spectral voice conversion for text-to-speech synthesis. IEEE Proceedings of ICASSP, Seattle, USA, May 12-15, 1998: 285-288.[8]Smits R and Yegnanarayana B. Determination of instants of significant excitation in speech using group delay function[J].IEEE Trans. on Speech and Audio Processing.1995, 3(5):325-333[9]Diakoloukas V D and Digalakis V V. Maximum likelihood stochastic transformation adaptation of hidden Markov models[J].IEEE Trans. on Speech and Audio Processing.1999, 7(2):177-187[10]Wang T T. The segmented chirp z-transform and its application in spectrum analysis[J].IEEE Trans. on Instrumentation and Measurement.1990, 39(2):318-324[11]Rao K S and Yegnanarayana B. Prosody modification using instants of significant excitation[J].IEEE Trans. on Audio, Speech and Language.2006, 14(3):972-980[12]Hasan M M, Nasr A M, and Sultana S. An approach to voice conversion using feature statistical mapping[J].Applied Acoustics.2005, 66(5):513-532
  • 加载中
计量
  • 文章访问数:  3090
  • HTML全文浏览量:  105
  • PDF下载量:  762
  • 被引次数: 0
出版历程
  • 收稿日期:  2006-06-06
  • 修回日期:  2006-10-30
  • 刊出日期:  2007-07-19

目录

    /

    返回文章
    返回