A novel classification approach based on Naive Bayes for Twitter sentiment analysis

作者:Song Junseok; Kim Kyung Tae; Lee Byungjun; Kim Sangyoung; Youn Hee Yong*
来源:KSII Transactions on Internet and Information Systems, 2017, 11(6): 2996-3011.
DOI:10.3837/tiis.2017.06.011

摘要

With rapid growth of web technology and dissemination of smart devices, social networking service(SNS) is widely used. As a result, huge amount of data are generated from SNS such as Twitter, and sentiment analysis of SNS data is very important for various applications and services. In the existing sentiment analysis based on the Naive Bayes algorithm, a same number of attributes is usually employed to estimate the weight of each class. Moreover, uncountable and meaningless attributes are included. This results in decreased accuracy of sentiment analysis. In this paper two methods are proposed to resolve these issues, which reflect the difference of the number of positive words and negative words in calculating the weights, and eliminate insignificant words in the feature selection step using Multinomial Naive Bayes(MNB) algorithm. Performance comparison demonstrates that the proposed scheme significantly increases the accuracy compared to the existing Multivariate Bernoulli Naive Bayes(BNB) algorithm and MNB scheme.

  • 出版日期2017-6-30