Advanced Search
Volume 33 Issue 1
Feb.  2011
Turn off MathJax
Article Contents
Wang Jun, Li Zhou-Jun, Hu Xia, Hu Bi-Yun. A Novel Composite Kernel and Application to Question Retrieval[J]. Journal of Electronics & Information Technology, 2011, 33(1): 129-135. doi: 10.3724/SP.J.1146.2010.00268
Citation: Wang Jun, Li Zhou-Jun, Hu Xia, Hu Bi-Yun. A Novel Composite Kernel and Application to Question Retrieval[J]. Journal of Electronics & Information Technology, 2011, 33(1): 129-135. doi: 10.3724/SP.J.1146.2010.00268

A Novel Composite Kernel and Application to Question Retrieval

doi: 10.3724/SP.J.1146.2010.00268
  • Received Date: 2010-03-23
  • Rev Recd Date: 2010-07-05
  • Publish Date: 2011-01-19
  • Question retrieval plays important role in question and answering systems. The main problem is how to measure the similarity between candidate questions and query question. This paper presents a tree kernel based method, named weighted tree kernel, to calculate the similarity of sentences structures and proposes improvements to the original tree kernel algorithm. In order to reduce the effect on tree kernel bringing by syntactic parsing, a composite kernel is proposed based on the weighted tree kernel and two other string kernels, which can capture syntax, part-of-speech and lexical level information of a sentence, to calculate the semantic similarity between question sentences. Experimental results on Yahoo!Answers dataset show that the proposed method outperforms traditional vector space model based methods by 24.02% in question retrieval accuacry.
  • loading
  • Burke R D, Hammond K J, and Kulyukin V A, et al.. Question answering from frequently asked question files: experiments with the faq finder system[J]. AI Magazine, 1997, 18(2): 57-66.[2]Jijkoun V and De Rijke M. Retrieving answers from frequently asked questions pages on the web [C]. In CIKM05: Proceedings of the 14th ACM international conference on Information and knowledge management, Bremen, Germany, 2005: 84-90.[3]Cao Xin.[J].Cong Gao, and Cui Bin, et al.. The use of categorization information in language models for question retrieval [C]. In CIKM09: Proceeding of the 18th ACM conference on Information and knowledge management, Hong Kong, China.2009,:-[4]Duan Hui-zhong, Cao Yun-bo, and Lin Chin-yew, et al.. Searching questions by identifying questions topic and question focus [C]. In ACL-08: HLT: Proceeding of the 46th annual meeting of the association for computational linguistics: Human Language Technologies. Columbus, OH, USA, 2008: 156-164.Xue Xiao-bing, Jeon J, and Croft W B. Retrieval models for question and answer archives [C]. In SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, New York, NY, USA, 2008: 475-482.[5]Wang Kai.[J].Ming Zhao-yan, and Chua Tat-seng. A syntactic tree matching approach to finding similar questions in community-based QA services [C]. In SIGIR09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, Boston, MA, USA.2009,:-[6]Collins M and Duffy N. Convolution Kernels for Natural Language [M]. Advances in Neural Information Processing Systems 14, MIT press, 2001: 625-632.[7]Zhao Shu-bin and Grishman R. Extracting relations with integrated information using kernel methods [C]. In ACL05: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, Ann Arbor, Michigan, USA, 2005: 419-426.[8]Lodhih H, Saunders G, and Shawe-Taylor J, et al.. Text classification using string kernels [J].Journal of Machine Learning Research.2002, 2(Feb):419-444[9]Choon Hui-teo and Vishwanathan S V N. Fast and space efficient string kernels using suffix arrays [C]. In ICML06: Proceedings of the 23rd international conference on Machine learning, Pittsburgh, Pennsylvania, USA, 2006: 929-936.[10]Cancedda N, Gaussier E, and Goutte C, et al.. Word sequence kernels [J].The Journal of Machine Learning Research.2003, 3(Feb):1059-1082[11]Joachims T, De Thorsten J, and Cristianini N, et al.. Composite kernels for hypertext categorization. In ICML: International Conference on Machine Learning, Williams College, USA, 2001: 250-257.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (3602) PDF downloads(970) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return