An ontology-based semantic clustering algorithm for accounting text

Jiang Yanhui<sup>*</sup>; Li Mo; Yao Kaohua

摘要

The feature selection and semantic similarity computing between texts are essential components of accounting text clustering. In the past, several approaches for generic text feature selection and similarity computing by exploiting different measures (vector space model, words frequency, thesauri, domain corpora, etc.) have been proposed. However, accounting field is different from general field. Accounting has its own concepts and rules. These generic methods are not so suitable for accounting text clustering. In this paper, a novel accounting ontology-based feature selection and similarity computing algorithm for accounting text is proposed. Firstly, characterizing the accounting texts, we get a terms vector. Secondly, terms vector is mapped into concept of accounting ontology and converted into concept vector. Based on the structure of concept, the semantic similarity between texts is computed. Then, trough an improved clustering method, accounting texts are clustered effectively. The experiments results imply that our proposal outperforms most of the previous measures as well as eliminates some of their limitations.

出版日期2013

全文

访问全文

收藏分享被引浏览

更新时间：2018-08-03 07:55

An ontology-based semantic clustering algorithm for accounting text

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友