Advanced Search
Volume 34 Issue 3
Mar.  2012
Turn off MathJax
Article Contents
Zhang Wen-Lin, Niu Tong, Zhang Lian-Hai, Li Bi-Cheng. Rapid Speaker Adaptation Based on Maximum-likelihood Variable Subspace[J]. Journal of Electronics & Information Technology, 2012, 34(3): 571-575. doi: 10.3724/SP.J.1146.2011.00839
Citation: Zhang Wen-Lin, Niu Tong, Zhang Lian-Hai, Li Bi-Cheng. Rapid Speaker Adaptation Based on Maximum-likelihood Variable Subspace[J]. Journal of Electronics & Information Technology, 2012, 34(3): 571-575. doi: 10.3724/SP.J.1146.2011.00839

Rapid Speaker Adaptation Based on Maximum-likelihood Variable Subspace

doi: 10.3724/SP.J.1146.2011.00839
  • Received Date: 2011-08-15
  • Rev Recd Date: 2011-11-21
  • Publish Date: 2012-03-19
  • A new rapid speaker adaptation method based on maximum likelihood variable subspace is proposed. A set of bases of the speaker space is obtained by performing Principal Component Analysis (PCA) on the Speaker Dependent (SD) model parameters of the training speakers. Different from conventional subspace based methods, during speaker adaptation, a subset of these bases is dynamically chosen for each speaker using maximum likelihood criteria. The new speakers model is constrained in the subspace spanned by those bases. With less free parameters required, the new method can obtain more robust SD model using very little amount of adaptation data. Speech recognition experiments show that the new method can obtain better performance than the eigenvoice method and MLLR method, both in supervised mode and in unsupervised mode.
  • loading
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (2197) PDF downloads(716) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return