Applying computational intelligence methods for predicting the sales of newly published books in a real editorial business management environment

作者:Castillo Pedro A*; Mora Antonio M; Faris Hossam; Merelo J J; Garcia Sanchez Pablo; Fernandez Ares Antonio J; De las Cuevas Paloma; Garcia Arenas Maria I
来源:Knowledge-Based Systems, 2017, 115: 133-151.
DOI:10.1016/j.knosys.2016.10.019

摘要

When a new book is launched the publisher faces the problem of how many books should be printed for delivery to bookstores; printing too many is the main issue, since it implies a loss of investment due to inventory excess, but printing too few will also have a negative economic impact. In this paper, we are tackling the problem of predicting total sales in order to print the right amount of books and doing so even before the book has reached the stores. A real dataset including the complete sales data for books published in Spain across several years has been used. We have conducted an analysis in three stages: an initial exploratory analysis, by means of data visualisation techniques; a feature selection process, using different techniques to find out what are the variables that have more impact on sales; and a regression or prediction stage, in which a set of machine learning methods has been applied to create forecasting models for book sales. The obtained models are able to predict sales from pre-publication data with remarkable accuracy, and can be visualised as simple decision trees. Thus, these can be used as decision aid tools for publishers, which can provide a reliable guidance on the decision process of publishing a book. This is also shown in the paper by addressing four example cases of representative publishers, regarding their number of sales and the number of different books they sell.

  • 出版日期2017-1-1