Advanced Search
Volume 29 Issue 3
Jan.  2011
Turn off MathJax
Article Contents
Zheng De-quan, Li Sheng, Zhao Tie-jun, Yu Hao. Research on Automatic Text Classification Based on a Hybrid Language Model[J]. Journal of Electronics & Information Technology, 2007, 29(3): 601-605. doi: 10.3724/SP.J.1146.2005.01015
Citation: Zheng De-quan, Li Sheng, Zhao Tie-jun, Yu Hao. Research on Automatic Text Classification Based on a Hybrid Language Model[J]. Journal of Electronics & Information Technology, 2007, 29(3): 601-605. doi: 10.3724/SP.J.1146.2005.01015

Research on Automatic Text Classification Based on a Hybrid Language Model

doi: 10.3724/SP.J.1146.2005.01015
  • Received Date: 2005-08-17
  • Rev Recd Date: 2006-01-11
  • Publish Date: 2007-03-19
  • With the volume of information available on the Internet and corporate intranets continues to increase, text classification has become one of the key technology in organizing and processing large amount of document data. This paper gives a novel method of Chinese text categorization based on a combination of ontology with statistical method. In this study, first, linguistic ontology knowledge bank will be respectively acquired by learning training corpus for various classes to determine the various categorizations. For a actual document, the evaluation value will respectively be gotten by various linguistic ontology knowledge bank and the categorization will be judged by the highest evaluation value. This method is compared with Bayes, k-nearest neighbor and support vector machine, The primary experimental results show that the method outperforms that previous work.
  • loading
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (3111) PDF downloads(1264) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return