基于先验知识的三音子模型聚类结构自适应策略
doi: 10.3724/SP.J.1146.2006.00200
Transcendental Information Based Triphone Model Tying Structure Adaptation Strategy
-
摘要: 该文提出了一种基于先验知识的三音子模型聚类结构自适应策略,可以在规模很小的自适应语音库条件下改善三音子声学模型的聚类结构使之更适合应用对象的协同发音特点。以基本声学模型训练过程中的三音子模型聚类结果作为先验知识的聚类中心,依据基本声学模型对自适应语音库的分割,按照最大似然准则迭代地重估新的聚类中心和模型聚类结构。实验表明:基于先验知识的三音子模型聚类结构自适应策略可以在不足两小时的自适应语音库上实现三音子模型聚类结构重估,在针对汉语母语说话人的英语声学模型实验中,该文的模型聚类结构自适应策略可以将系统识别率从74.59%提高到83.63%。
-
关键词:
- 语音识别;三音子模型;模型聚类
Abstract: A Transcendental Information Based (TIB) triphone model tying structure adaptation strategy is delivered, and this strategy can improve the triphone model tying structure to suit the target co-pronunciation features with small amount of adaptation data. The TIB triphone model tying structure adaptation strategy uses the baseline acoustic models triphone model tying result as the transcendental model clustering center, with the adaptation data alignment by the baseline acoustic model, re-estimate the TIB triphone model clustering center and model tying structure recursively under maximum likelihood principle. The experiments show that the TIB triphone model tying structure adaptation strategy can improve the triphone model tying structure with only 2 hours adaptation corpus, and in the experiment of English acoustic model for Chinese speakers, the TIB strategy will increase the recognition accuracy rate from 74.59% to 83.63%. -
Lee K F and Hon H W. Speaker-independent phone recognition using hidden Markov models[J].IEEE Trans on ASSP.1989, 37(11):1641-1648[2]Chang E, Shi Y, Zhou J L, and Huang C. Speech lab in a box: A Mandarin speech toolbox to jumpstart speech related research. Eurospeech 2001, Aalborg, Denmark, 2001.[3]Young S and Evermann G, et al.. The HTK Book (for HTK Version 3.2). Cambridge University Engineering Department, 2002.[4]Lazarides A, Normandin Y, and Kuhn R. Improving decision trees for acoustic modeling. Proceedings of ICSLP'96. Philadelphia, 1996: 1053-1056.[5]Liang W Q, Liu J, and Liu R S. An automatic pronunciation quality assessing algorithm for computer assisted language learning. Chinese Journal of Electronics, 2005, 14(4):639-643.
计量
- 文章访问数: 3482
- HTML全文浏览量: 87
- PDF下载量: 1234
- 被引次数: 0