Fan Dong-Mei, Lu Zhi-Mao, Zhang Ru-Bo, Pan Shu-Shen. Chinese Word Sense Disambiguation Based on Bayesian Model Improved by Information Gain[J]. Journal of Electronics & Information Technology, 2008, 30(12): 2926-2929. doi: 10.3724/SP.J.1146.2007.00868
Citation:
Fan Dong-Mei, Lu Zhi-Mao, Zhang Ru-Bo, Pan Shu-Shen. Chinese Word Sense Disambiguation Based on Bayesian Model Improved by Information Gain[J]. Journal of Electronics & Information Technology, 2008, 30(12): 2926-2929. doi: 10.3724/SP.J.1146.2007.00868
Fan Dong-Mei, Lu Zhi-Mao, Zhang Ru-Bo, Pan Shu-Shen. Chinese Word Sense Disambiguation Based on Bayesian Model Improved by Information Gain[J]. Journal of Electronics & Information Technology, 2008, 30(12): 2926-2929. doi: 10.3724/SP.J.1146.2007.00868
Citation:
Fan Dong-Mei, Lu Zhi-Mao, Zhang Ru-Bo, Pan Shu-Shen. Chinese Word Sense Disambiguation Based on Bayesian Model Improved by Information Gain[J]. Journal of Electronics & Information Technology, 2008, 30(12): 2926-2929. doi: 10.3724/SP.J.1146.2007.00868
Word Sense Disambiguation (WSD) is one of the key issues and difficulties in natural language processing. WSD is usually considered as an issue about pattern classification to study, which feature selection, is an important component. In this paper, according to Nave Bayesian Model (NBM) assumption, a feature selection method based on information gain is proposed to improve NBM. Location information concealed in the context of ambiguous word is mined through information gain, to improve the knowledge acquisition efficiency of Bayesian model, thereby improving the word-sense classification. The eight ambiguous words are tested in the experiment. The experimental results show that improved Bayesian model is more correct than the NBM an average of 3.5 percentage points. The accuracy rise is bigger and the improvement effect is outstanding. These results prove also the method put forward in this paper is efficacious.