A Text Classifier of English Movie Reviews Based on Information Gain

作者:Jin Lianjing*; Gong Wei; Fu Wenlong; Wu Hongbin
来源:3rd International Conference on Applied Computing and Information Technology (ACIT 2015) 2nd International Conference on Computational Science and Intelligence (CSI 2015), 2015-07-12 to 2015-07-16.
DOI:10.1109/ACIT-CSI.2015.86

摘要

Text classification is the foundation and core of text mining. Naive Bayes is an effective method for text classification. This paper improves the accuracy of Naive Bayes classification using improved information gain,one of methods of feature extraction, by reducing the impact of low -frequency words. In this paper, we use a widely corpus of NLTK. According to the test results, The accuracy of the classification improved significantly.

  • 出版日期2015
  • 单位中国传媒大学

全文