摘要
Text classification is the foundation and core of text mining. Naive Bayes is an effective method for text classification. This paper improves the accuracy of Naive Bayes classification using improved information gain,one of methods of feature extraction, by reducing the impact of low -frequency words. In this paper, we use a widely corpus of NLTK. According to the test results, The accuracy of the classification improved significantly.
- 出版日期2015
- 单位中国传媒大学