Predictive analytics of crude oil prices by utilizing the intelligent model search engine

Bekiroglu K., Duru O., Gulay E., Su R., Lagoa C.

APPLIED ENERGY, vol.228, pp.2387-2397, 2018 (SCI-Expanded) identifier

  • Publication Type: Article / Article
  • Volume: 228
  • Publication Date: 2018
  • Doi Number: 10.1016/j.apenergy.2018.07.071
  • Journal Name: APPLIED ENERGY
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Page Numbers: pp.2387-2397
  • Keywords: Predictive analytics, Oil prices, Model mining, Time series forecasting, UNIVARIATE TIME-SERIES, ACCURACY, MACROECONOMY, MINIMIZATION, MOVEMENTS, FORECASTS, SYSTEMS, ARIMA
  • Dokuz Eylül University Affiliated: Yes


This paper proposes an intelligent model search engine (IMSE), an integrated model selection algorithm, subject to the out of sample predictive performance and given set of explanatory variables for forecasting crude oil prices. In the conventional applications of energy price forecasting, models are selected based on preliminary assumptions on causality and model structure (e.g. lag length in lagged variables). Relaxation of those assumptions would cause over-fitting and reduce the degree of freedom. Considering the ultimate objective of forecasting models, any variations of models may be tested in the out-of-sample period, and the optimization problem can be redefined as minimization of post-sample error metric in a validation set. By this, data mining would be a legitimate operation for economic forecasting, and it also proves required conditions usually tested by diagnostic tests such as Akaike Information Criterion for model quality. IMSE is a multi-input/single output difference equation based approach which allows users to test various models (for given set of explanatory variables) as well as various order of lagged inputs (lag length) without a priori assumption or theoretical basis except defining set of potential inputs. Finally, it selects the best model subject to predictive accuracy in a validation set. Empirical results indicated that the proposed algorithm significantly outperformed a broad range of benchmark methodologies as well as proving that certain assumptions of econometric approach (e.g. statistical significance of explanatory variables) are independent of predictive performance.