A NOVEL KERNEL FOR TEXT CLASSIFICATION BASED ON SEMANTIC AND STATISTICAL INFORMATION

Yao, Haipeng<sup>*</sup>; Zhang, Bo; Zhang, Peiying; Li, Maozhen

doi:10.4149/cai_2018_4_992

摘要

In text categorization, a document is usually represented by a vector space model which can accomplish the classification task, but the model cannot deal with Chinese synonyms and polysemy phenomenon. This paper presents a novel approach which takes into account both the semantic and statistical information to improve the accuracy of text classification. The proposed approach computes semantic information based on HowNet and statistical information based on a kernel function with class-based weighting. According to our experimental results, the proposed approach could achieve state-of-the-art or competitive results as compared with traditional approaches such as the k-Nearest Neighbor (KNN), the Naive Bayes and deep learning models like convolutional networks.

出版日期2018
单位北京邮电大学

全文

访问全文

收藏分享被引(2) 浏览

更新时间：2023-11-16 12:04

A NOVEL KERNEL FOR TEXT CLASSIFICATION BASED ON SEMANTIC AND STATISTICAL INFORMATION

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友