高级搜索

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

一种新的复合核函数及在问句检索中的应用

王君 李舟军 胡侠 胡必云

王君, 李舟军, 胡侠, 胡必云. 一种新的复合核函数及在问句检索中的应用[J]. 电子与信息学报, 2011, 33(1): 129-135. doi: 10.3724/SP.J.1146.2010.00268
引用本文: 王君, 李舟军, 胡侠, 胡必云. 一种新的复合核函数及在问句检索中的应用[J]. 电子与信息学报, 2011, 33(1): 129-135. doi: 10.3724/SP.J.1146.2010.00268
Wang Jun, Li Zhou-Jun, Hu Xia, Hu Bi-Yun. A Novel Composite Kernel and Application to Question Retrieval[J]. Journal of Electronics & Information Technology, 2011, 33(1): 129-135. doi: 10.3724/SP.J.1146.2010.00268
Citation: Wang Jun, Li Zhou-Jun, Hu Xia, Hu Bi-Yun. A Novel Composite Kernel and Application to Question Retrieval[J]. Journal of Electronics & Information Technology, 2011, 33(1): 129-135. doi: 10.3724/SP.J.1146.2010.00268

一种新的复合核函数及在问句检索中的应用

doi: 10.3724/SP.J.1146.2010.00268
基金项目: 

国家973规划项目( 2007CB310803)资助课题

A Novel Composite Kernel and Application to Question Retrieval

  • 摘要: 问句检索在问答系统中有着重要的作用,其核心问题在于研究查询问句与候选问句之间的相似性计算问题,实现问句之间的高精度匹配。该文采用树核函数的方法计算问句之间的结构相似性,并针对原有算法的不足,做了相应的改进。为降低句法解析器性能对树核函数的影响,该文在改进的树核函数基础上,将其与字符串核结合,提出了一种能同时融合问句的句法信息,词性信息和词序信息的复合核函数,用以计算问句之间的综合语义相似性。在社区问答系统Yahoo!Answer的数据上进行测试,相对传统的基于词频的特征向量法,问句检索平均准确率提高了24.02%。
  • Burke R D, Hammond K J, and Kulyukin V A, et al.. Question answering from frequently asked question files: experiments with the faq finder system[J]. AI Magazine, 1997, 18(2): 57-66.[2]Jijkoun V and De Rijke M. Retrieving answers from frequently asked questions pages on the web [C]. In CIKM05: Proceedings of the 14th ACM international conference on Information and knowledge management, Bremen, Germany, 2005: 84-90.[3]Cao Xin.[J].Cong Gao, and Cui Bin, et al.. The use of categorization information in language models for question retrieval [C]. In CIKM09: Proceeding of the 18th ACM conference on Information and knowledge management, Hong Kong, China.2009,:-[4]Duan Hui-zhong, Cao Yun-bo, and Lin Chin-yew, et al.. Searching questions by identifying questions topic and question focus [C]. In ACL-08: HLT: Proceeding of the 46th annual meeting of the association for computational linguistics: Human Language Technologies. Columbus, OH, USA, 2008: 156-164.Xue Xiao-bing, Jeon J, and Croft W B. Retrieval models for question and answer archives [C]. In SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, New York, NY, USA, 2008: 475-482.[5]Wang Kai.[J].Ming Zhao-yan, and Chua Tat-seng. A syntactic tree matching approach to finding similar questions in community-based QA services [C]. In SIGIR09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, Boston, MA, USA.2009,:-[6]Collins M and Duffy N. Convolution Kernels for Natural Language [M]. Advances in Neural Information Processing Systems 14, MIT press, 2001: 625-632.[7]Zhao Shu-bin and Grishman R. Extracting relations with integrated information using kernel methods [C]. In ACL05: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, Ann Arbor, Michigan, USA, 2005: 419-426.[8]Lodhih H, Saunders G, and Shawe-Taylor J, et al.. Text classification using string kernels [J].Journal of Machine Learning Research.2002, 2(Feb):419-444[9]Choon Hui-teo and Vishwanathan S V N. Fast and space efficient string kernels using suffix arrays [C]. In ICML06: Proceedings of the 23rd international conference on Machine learning, Pittsburgh, Pennsylvania, USA, 2006: 929-936.[10]Cancedda N, Gaussier E, and Goutte C, et al.. Word sequence kernels [J].The Journal of Machine Learning Research.2003, 3(Feb):1059-1082[11]Joachims T, De Thorsten J, and Cristianini N, et al.. Composite kernels for hypertext categorization. In ICML: International Conference on Machine Learning, Williams College, USA, 2001: 250-257.
  • 加载中
计量
  • 文章访问数:  3602
  • HTML全文浏览量:  100
  • PDF下载量:  970
  • 被引次数: 0
出版历程
  • 收稿日期:  2010-03-23
  • 修回日期:  2010-07-05
  • 刊出日期:  2011-01-19

目录

    /

    返回文章
    返回