Multiclass Boosting with Adaptive Group-Based kNN and Its Application in Text Categorization

La, Lei<sup>*</sup>; Guo, Qiao; Yang, Dequan; Cao, Qimin

doi:10.1155/2012/793490

摘要

AdaBoost is an excellent committee-based tool for classification. However, its effectiveness and efficiency in multiclass categorization face the challenges from methods based on support vector machine (SVM), neural networks (NN), naive Bayes, and k-nearest neighbor (kNN). This paper uses a novel multi-class AdaBoost algorithm to avoid reducing the multi-class classification problem to multiple two-class classification problems. This novel method is more effective. In addition, it keeps the accuracy advantage of existing AdaBoost. An adaptive group-based kNN method is proposed in this paper to build more accurate weak classifiers and in this way control the number of basis classifiers in an acceptable range. To further enhance the performance, weak classifiers are combined into a strong classifier through a double iterative weighted way and construct an adaptive group-based kNN boosting algorithm (AGkNN-AdaBoost). We implement AGkNN-AdaBoost in a Chinese text categorization system. Experimental results showed that the classification algorithm proposed in this paper has better performance both in precision and recall than many other text categorization methods including traditional AdaBoost. In addition, the processing speed is significantly enhanced than original AdaBoost and many other classic categorization algorithms.

出版日期2012
单位北京理工大学

全文

访问全文

收藏分享被引(11) 浏览

更新时间：2024-04-07 21:36

Multiclass Boosting with Adaptive Group-Based kNN and Its Application in Text Categorization

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友