摘要

This paper presents an empirical study of machine learn techniques to text categorization. Specifically aim to classify news about coffee market according with categories from coffee supply chain. The objective is to measure the performance of three types of algorithms: Naive Bayes based, Tree based and Support Vector Machine (SVM). A database with news collected from web and labeled by human expert analysts is used in a learning phase. Then automatic classify news extracted from web following the same steps and terms as human according to their relevance for each learned category. The test in a real database shows a better performance by Naive Bayes based Algorithms for this specific case.

  • 出版日期2015-7

全文