Discriminative reranking approach to spelling correction

Zhang Yang<sup>*</sup>; He Pi Lian; Xiang Wei; Li Mu

doi:10.3724/SP.J.1001.2008.00557

摘要

This paper proposes an approach to spelling correction. It reranks the output of an existing spelling corrector, Aspell. A discriminative model (Ranking SVM) is employed to improve upon the initial ranking, using additional features as evidence. These features are derived from state-of-the-art techniques in spelling correction, including edit distance, letter-based n-gram, phonetic similarity and noisy channel model. This paper also presents a method to automatically extract training samples from the query log chain. The system outperforms the baseline Aspell greatly, as well as the previous models and several off-the-shelf systems (e.g. spelling corrector in Microsoft Word 2003). The experimental results based on query chain pairs are comparable to that based on manually-annotated pairs, with 32.2%/32.6% reduction in error rate, respectively.

出版日期2008
单位天津大学

全文

访问全文

收藏分享被引浏览

更新时间：2018-08-03 07:55

Discriminative reranking approach to spelling correction

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友