Advanced Search
Volume 29 Issue 1
Jan.  2011
Turn off MathJax
Article Contents
Liu Hao-jie, Du Li-min, Fu Yue-wen. F0 Contour Optimization and Its Rules in Chinese[J]. Journal of Electronics & Information Technology, 2007, 29(1): 71-75. doi: 10.3724/SP.J.1146.2005.00452
Citation: Liu Hao-jie, Du Li-min, Fu Yue-wen. F0 Contour Optimization and Its Rules in Chinese[J]. Journal of Electronics & Information Technology, 2007, 29(1): 71-75. doi: 10.3724/SP.J.1146.2005.00452

F0 Contour Optimization and Its Rules in Chinese

doi: 10.3724/SP.J.1146.2005.00452
  • Received Date: 2005-04-22
  • Rev Recd Date: 2005-10-15
  • Publish Date: 2007-01-19
  • The fundamental frequency contour (F0 contour) for utterance in rule-based speech synthesis system, is shaped by many functional unit in phonetics, not only the simple concatenation of F0 contour among the nearby syllables. In order to improve the naturalness of synthesized speech, this paper proposes a new forward idea of F0 contour optimization in Chinese prosodic chunk, which can integrate the environmental factors (such as, the stress, the distortion of syllable, the articulation velocity, etc.) into the F0 contour. And based on the idea of optimization, this paper inversely extracts the parameters associated with optimization (namely the top-line, the bottom-line, the smoothness, the distortion, the stress) from the clustered F0 contour using the MMSE principle for the monosyllable, the disyllable, the trisyllable chunks. Further, this paper analyzes the influence of position and tone to the parameters associated with optimization. The analyzed result shows the reliability of the extracted parameters and the rationality of the optimization theory on the whole, so the rules of the parameters associated with optimization can be got for the different prosodic chunk in speech synthesis system. The actual listening test shows that, the scores of intelligibility are 3.25 and 3.35 before and after the optimization, and the scores of naturalness are 2.9 and 3.31.
  • loading
  • ].Speech Communication.2001, 33(4):319- [4] Press H and Teukolsky A, et al.. Numerical recipes in c. New York: Cambridge University Press, 1992: 657-661. [5] 杨顺安. 浊声源动态特性对合成音质的影响. 中国语文,1986, (3): 173-181. [6] Coleman T F and Li Y. An interior, trust region approach for nonlinear minimization subject to bounds. STAM Journal on Optimization, 1996, (2): 418-445. [7] 李香春. 汉语单音节、两音节组和三音节组基频曲线建模方法研究. [博士论文], 中国科学院声学研究所, 2002. [8] 沈炯. 北京话声调的音域和语调. 北京语音实验录, 北京: 北京大学出版社,1985: 73-130. [9] 王安红,陈明,吕士楠. 基于言语数据库的汉语高音下倾现象研究. 声学学报,2004,29(4): 353-358. Wang Anhong, Chen Ming, and Lu Shinan. The study of declination in speech database in standard Chinese. Acta Acustica,2004,29(4):353-358. [10] 王韫佳,初敏等. 连续话语中双音节韵律词的重音感知. 声学学报,2003, 28(6): 534-539 Wang Yunjia and Chu Min, et al.. The perception of disyllabic word stress of Chinese speech in utterance. Acta Acustica,2003, 28(6): 534-539. [11] 颜景助,林茂灿. 北京话三字组重音的声学表现. 方言, 1988, (3): 227-237. .

    [1] Greg Kochansaki and Chilin Shih. Stem-ML: Language-independent prosody description. in Proceedings of the International Conference on Spoken Language Processing 2000, Beijing, China, 2000, Vol.3: 239-242. [2] Fujisaki Hiroya. The fundamental frequency contour of speech: Its modeling.[J].underlying mechanisms, and application to multilingual speech synthesis. In Proceedings of ICSP99, Seoul Korea.1999,:- [3] Xu Yi. Pitch targets and their realization: Evidence from Manda.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (3285) PDF downloads(1239) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return