Advanced Search
Volume 30 Issue 3
Dec.  2010
Turn off MathJax
Article Contents
Li Wei-jiang, Zhao Tie-jun, Wang Xian-gang . A SMT-based Approach for Query Expansion in Information Retrieval[J]. Journal of Electronics & Information Technology, 2008, 30(3): 725-729. doi: 10.3724/SP.J.1146.2006.01382
Citation: Li Wei-jiang, Zhao Tie-jun, Wang Xian-gang . A SMT-based Approach for Query Expansion in Information Retrieval[J]. Journal of Electronics & Information Technology, 2008, 30(3): 725-729. doi: 10.3724/SP.J.1146.2006.01382

A SMT-based Approach for Query Expansion in Information Retrieval

doi: 10.3724/SP.J.1146.2006.01382
  • Received Date: 2006-09-26
  • Rev Recd Date: 2007-01-26
  • Publish Date: 2008-03-19
  • In practical applications of information retrieval, such as the search engine,the query user submitted contains only several keywords usually. This will cause unmatched issue of word of relevant files and users query and have more serious negative effects on the performance of information retrieval. On the basis of analyzing of process of producing query, this paper puts forward a new method of query expansion on the basis of model of statistical machine translation. The approach extract related terms between documents and query through statistical machine translation model, then expand into query. The experiment result on TREC data collection shows the proposed method, SMT-based query expansion, has 12 - 17% of the improvement all the time more than the language model method without expanding. Compared to the popular approach of query expansion, pseudo feedback, the proposed method has the competed average precision.
  • loading
  • Ponte J and Croft W. A language modeling approach to information retrieval. In Proceedings of the 21st ACM Conference on Research and Development in Information Retrieval(SIGIR98), Melbourne, Australia, 1998: 222-229.[2]Richardson R and Smeaton A. Using wordnet in a knowledge-based approach to information retrieval. Trinity College Dublin, Working paper ca-0395, 1995.[3]Lin D K and Zhao S J. Identifying synonyms among distributionally similar words. Proceedings of International Joint Conference of Artificial Intelligence (IJCAI2003), Mexico, 2003: 1492-1493.[4]丁国栋, 白硕. 一种基于局部共现的查询扩展方法. 中文信息学报, 2006, 20(3): 84-91.Ding Guo-dong and Bai Suo. Local co-occurrence based query expansion for information retrieval. Journal of Chinese Information Processing, 2006, 20(3): 84-91.[5]吕碧波. 基于相关文档池建模的查询扩展. 中文信息学报, 2005, 20(3): 78-83.[6]Lv Bi-bo. Query expansion based on modeling of relevant documents pool. Journal of Chinese Information Processing. 2005, 20(3): 78-83.[7]Xu J and Croft W. Query expansion using local and global document analysis. Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, Switzerland, 1996: 4-11.[8]张敏. 基于语义关系查询扩展的文档重构方法[J].计算机学报.2004, 27(10):1395-1401Zhang Min. Document refinement based on semantic query expansion. Chinese Journal of Computers, 2004, 27(10): 1395-1401.[9]Kang De L. Dependency-based evaluation of MINIPAR. Proceedings of the Workshop on the Evaluation of Parsing Systems, Granada, Spain, 1998: 298-312.[10]Peat H and Willett P. The limitations of term co-occurrence data for query expansion in document retrieval systems[J].JASIS.1991, 42(5):378-3833.0.CO;2-8' target='_blank'>[11]Voorhees E. Query expansion using lexicalsemantic relations. ACM SIGIR, Dulin, Ireland, 1994: 61-69.[12]Qiu Y and Frei H. Concept based query expansion. ACM SIGIR, Pittsburgh, PA, USA, 1993: 160-169.[13]Bai J, Song D, Nie J Y, and Cao G. Query expansion using term relationships in language models for information retrieval. ACM CIKM, Bremen, Germany, 2005: 688-695.[14]Yarowsky D. Unsupervised word sense disambiguation rivaling supervised methods. ACL, Cambridge, Massachusetts, USA, 1995: 403-410.[15]Schjtze H and Pedersen J O. A cooccurrence-based thesaurus and two applications to information retrieval[J].Information Processing and Management.1997, 33(3):307-318[16]Berger A and Lafferty J. Information retrieval as statistical translation. In Proceedings of SIGIR99, Berkeley, CA,USA, 1999: 222-229.[17]曹华梁, 朱星. 适用于P2P的系统查询扩展优化方法[J].上海交通大学学报.2005, 39(10):1706-1710Cao Hua-liang and Zhu Xing. SDQE: A semantic query optimization in P2P system. Journal of Shanghai Jiaotong University, 2005, 39(10): 1706-1710.[18]Brown P, Della Pietra S, Della Pietra V, and Mercer R. The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics. 1993, 19(2): 263-311.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (3171) PDF downloads(921) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return