Training subset selection in Hourly Ontario Energy Price forecasting using time series clustering-based stratification

Lopez Karol Lina; Gagne Christian<sup>*</sup>; Castellanos Dominguez German; Orozco Alzate Mauricio

doi:10.1016/j.neucom.2014.12.052

摘要

Training a given learning-based forecasting method to a satisfactory level of performance often requires a large dataset. Indeed, any data-driven methods require having examples that are providing a satisfactory representation of what we wish to model to work properly. This often implies using large datasets to be sure that the phenomenon of interest is properly sampled. However, learning from time series composed of too many samples can also be a problem, given that the computational requirements of the learning algorithms can easily grow following a polynomial complexity according to the training set size. In order to identify representative examples of a dataset, we are proposing a methodology using clustering-based stratification of time series to select a training data subset. The principle for constructing a representative sample set using this method consists in selecting heterogeneous instances picked from all the various clusters composing the dataset. Results obtained show that with a small number of training examples, obtained through the proposed clustering-based stratification, we can preserve the performance and improve the stability of models such as artificial neural networks and support vector regression, while training at a much lower computational cost. We illustrate the methodology through forecasting the one-step ahead Hourly Ontario Energy Price (HOEP).

出版日期2015-5-25

全文

访问全文

收藏分享被引(4) 浏览

更新时间：2021-04-13 07:22

Training subset selection in Hourly Ontario Energy Price forecasting using time series clustering-based stratification

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友