An Application of Mispronunciation Detecting Network for Computer Assisted Language Learning System
-
摘要: 该文通过将计算机辅助语言学习(Computer Assisted Language Learning, CALL)系统的标准发音网络加入插入和删除路径的方法提出了一种发音错误检测新算法:检错音网络(Error-Detecting Network of Pronunciation, EDNP)错误检测算法。该算法首先对待测语音进行EDNP错误检测,然后通过对检错音网络的二级识别结果使用一级多候选词图进行错误召回的策略,进行错误检测,该算法易于实现并具有平台无关性。实验结果表明:该文提出的EDNP错误检测方法在中国四级考生语音测试库上使删除错误虚警率和漏报率分别达到7.38%和12.25%,插入错误虚警率和漏报率分别达到4.94%和26.17%,且客观评分与专家评分相关度比强制对齐方法的相关度提高了4.29%。Abstract: This paper put forth a technique called Error-Detecting Network of Pronunciation (EDNP) that is applied to Computer Assisted Language Learning (CALL) system. By comparison with state-of-the-art CALL systems, the application of this kind of network is to insert mispronunciation detection routes into task-specific Finite State Grammar (FSG) network and avoid constructing complex mispronounced models. The detailed procedures of how to construct mispronunciation detection network and how to perform an error callback strategy are introduced in this paper. The algorithm is simply to be implemented and is independent to any speech toolkit. The experiments show that the application of this network achieves a False Acceptance Rate (FAR) of 7.38%, as well as a False Rejection Rate (FRR) of 12.25% for the deletion errors and achieves a FAR of 4.94%, as well as a FRR of 26.17% for the insertion errors. Furthermore, compared to traditional forced alignment, there is 4.29% improvement to correlation rate between the objective and the subjective pronunciation quality evaluation by using EDNP.
计量
- 文章访问数: 4074
- HTML全文浏览量: 84
- PDF下载量: 849
- 被引次数: 0