码激励线性预测语音编码器中的非均匀和部分搜索域代数码书
Non-uniform and Part-searching-area Algebraic Codebook for Code Excited Linear Prediction Speech Coder
-
摘要: 该文基于代数码激励线性预测(ACELP)语音编码算法提出了非均匀和部分搜索域代数码书。非均匀代数码书由代数码书的脉冲非均匀统计特性确定,部分搜索域代数码书则由代数码书矢量的周期性确定,该方法有效地弥补了低比特率情况下代数码书中脉冲数不足的缺点。在使用上述两项技术时,为保持基音的连续性,该编码器对语音段和非语音段采用了不同的基音估计方法。主观和客观的听力测试表明,当该技术应用于4kb/s 散布脉冲码激励线性预测(DP-CELP)语音编码器时,重建语音的质量得到明显改善,尤其是对女性讲话者。Abstract: This paper presents a non-uniform and part-searching-area algebraic codebook based on Algebraic Code Excited Linear Preiction(ACELP) speech coding algorithm. The non-uniform algebraic codebook is determined by the non-uniform statistical properties of the algebraic codebook, and the part-searching-area is determined by the periodicity of the algebraic codebook excitation vector, which makes up the insufficient numbers of signed pulses in algebraic codebook at low bit rate. In order to preserve the continuity of pitch, different pitch detection methods are employed for speech/silence frame when these two techniques are used. Subjective and objective test results indicate that the reconstructed speech quality of 4kb/s DP-CELP speech coder is improved based on these techniques, especially for the female speakers.
-
ITU-T Recommendation G.729. Coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear-prediction (CS-ACELP), 1996.ITU-T Recommendation G.723.1. Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s, 1996.[2]Yasunaga K, et al.. Dispersed-pulse codebook and its application to a 4kb/s speech coder. IEEE Proc, ICASSP, 2000, Istanbul, Turkey, III : 1503-1506.[3]Gao Y, et al.. eX-CELP: A speech coding paradigm. IEEE Proc, ICASSP, 2001, Salt Lake City, Utah, II : 689-692.[4]Rao A V, Ahmadi S, et al.. Pitch adaptive windows for improved excitation coding in low-rate CELP coders[J].IEEE Trans. on Speech Audio Processing.2003, 11(6):648-659[5]鲍长春. 高质量的4kb/s散布脉冲CELP语音编码算法. 电子学报, 2003, 31(2): 309-313.[6]李悦,唐昆等. 高质量3.35kb/s MPD-USACELP语音编码算法研究. 清华大学学报(自然科学版), 2004, 44(10): 1410-1413.[7]Chu W C. Speech coding algorithmsFoundation and evolution of standardized coders. New Jersey: Wiley-Interscience, 2003: 471-474.[8]Bao Changchun. Harmonic excited LPC (HE-LPC) speech coding at 2.3kb/s. IEEE Proc. ICASSP, 2003, Hongkong, I : 784-787.[9]ITU-T Recommendation P.862. Perceptual evaluation of speech quality (PESQ), 2001.
计量
- 文章访问数: 2572
- HTML全文浏览量: 137
- PDF下载量: 838
- 被引次数: 0