A collocation-based WSD model: RFR-SUM

Qu Weiguang<sup>*</sup>; Sui Zhifang; Ji Genlin; Yu Shiwen; Zhou Junsheng

摘要

In this paper, the concept of Relative Frequency Ratio (RFR) is presented to evaluate the strength of collocation. Based on RFR, a WSD Model RFR-SUM is put forward to disambiguate polysemous Chinese word sense. It selects 9 frequently used polysemous words as examples, and achieves the average precision up to 92.50% in open test. It has compared the model with Naive Bayesian Model and Maximum Entropy Model. The results show that the precision by RFR-SUM Model is 5.95% and 4.48% higher than that of Na:ive Bayesian Model and Maximum Entropy Model respectively. It also tries to prune RFR lists. The results reveal that leaving only 5% important collocation information can keep almost the same precision. At the same time, the speed is 20 times higher.

出版日期2007
单位北京大学

收藏分享被引(1) 浏览

更新时间：2018-08-02 19:37

A collocation-based WSD model: RFR-SUM

摘要

产品服务

站内浏览

服务支持

联系方式

科研之友