Multistep Ahead Forecasting of WTI Crude Oil Prices Using Time Series and Machine Learning Models

Main Article Content

Muhammad Shafiq, Mohammad Abiad, Noor Zali Khan, Ihtisham ul Haq

Abstract

Crude oil is a critical energy source and a raw material for various products, including fuels (gasoline, diesel, and jet fuel), lubricants, petrochemicals, and asphalt. It undergoes refining processes to separate and convert it into usable products. Multiple factors, including economic indicators, geopolitical events, supply and demand dynamics, and technological advancements influence crude oil prices. Its global market has prices often benchmarked to grades like Brent Crude and West Texas Intermediate (WTI). In this article, the daily WTI crude oil prices are multi-step ahead forecasted using four different methods which are autoregressive integrated moving average (ARIMA), random forest (RF), k-nearest neighbors (kNN), and autoregressive neural networks (ARNN). The data set consists of daily WTI crude oil prices from February 13, 2014, to February 13, 2024, divided into 80% training set and 20% validation set. The performance of the methods is evaluated by one-step and seven-step ahead forecasting using root mean square error (RMSE) and mean absolute percentage error (MAPE) as the accuracy measures. The results show that ARIMA outperforms the methods for one and seven-step ahead forecasting, followed by ARNN, kNN, and RF. The study provides useful insights for investors and policymakers in the oil market.

Article Details

References

  1. D. Langevin, S. Poteau, I. Hénaut, J.F. Argillier, Crude Oil Emulsion Properties and Their Application to Heavy Oil Transportation, Oil Gas Sci. Technol. 59 (2004), 511–521. https://doi.org/10.2516/ogst:2004036.
  2. K. Lang, B.R. Auer, The Economic and Financial Properties of Crude Oil: A Review, N. Am. J. Econ. Financ. 52 (2020), 100914. https://doi.org/10.1016/j.najef.2019.01.011.
  3. M. Monge, L.A. Gil-Alana, F. Pérez de Gracia, Crude Oil Price Behaviour Before and After Military Conflicts and Geopolitical Events, Energy 120 (2017), 79–91. https://doi.org/10.1016/j.energy.2016.12.102.
  4. J. Zhao, Exploring the Influence of the Main Factors on the Crude Oil Price Volatility: An Analysis Based on Garch-Midas Model with Lasso Approach, Resour. Policy 79 (2022), 103031. https://doi.org/10.1016/j.resourpol.2022.103031.
  5. Z. Liu, Z. Ding, T. Lv, J.S. Wu, W. Qiang, Financial Factors Affecting Oil Price Change and Oil-Stock Interactions: A Review and Future Perspectives, Nat. Hazards 95 (2018), 207–225. https://doi.org/10.1007/s11069-018-3473-y.
  6. J. Nasir, M. Aamir, Z.U. Haq, S. Khan, M.Y. Amin, et al., A New Approach for Forecasting Crude Oil Prices Based on Stochastic and Deterministic Influences of Lmd Using Arima and Lstm Models, IEEE Access 11 (2023), 14322–14339. https://doi.org/10.1109/access.2023.3243232.
  7. M. Hasan, M.Z. Abedin, P. Hajek, K. Coussement, M.N. Sultan, et al., A Blending Ensemble Learning Model for Crude Oil Price Forecasting, Ann. Oper. Res. (2024). https://doi.org/10.1007/s10479-023-05810-8.
  8. K. Zhang, M. Hong, Forecasting Crude Oil Price Using Lstm Neural Networks, Data Sci. Financ. Econ. 2 (2022), 163–180. https://doi.org/10.3934/dsfe.2022008.
  9. L. Gasper, H. Mbwambo, Forecasting Crude Oil Prices by Using Arima Model: Evidence from Tanzania, J. Account. Financ. Audit. Stud. 9 (2023), 158–175. https://doi.org/10.32602/jafas.2023.017.
  10. J. Selvi, R.K. Shree, J. Krishnan, Forecasting Crude Oil Price Using ARIMA Models, Int. J. Adv. Res. 7 (2018), 334–343.
  11. V.P. Ariyanti, Tristyanti Yusnitasari, Comparison of Arima and Sarima for Forecasting Crude Oil Prices, J. RESTI 7 (2023), 405–413. https://doi.org/10.29207/resti.v7i2.4895.
  12. A. Almeida, A.A. Golpe, J.M. Martín-Alvarez, J.C. Vides, Brent Vs. West Texas Intermediate in the Us Petro Derivatives Price Formation, Pet. Sci. 21 (2024), 729–739. https://doi.org/10.1016/j.petsci.2023.09.007.
  13. Y. Yang, T. Yang, S. Chen, C. Tong, Exploring the Non-Linearity of West Texas Intermediate Crude Oil Price from Exchange Rate of Us Dollar and West Texas Intermediate Crude Oil Production, Energy Strat. Rev. 41 (2022), 100854. https://doi.org/10.1016/j.esr.2022.100854.
  14. R.H. Shumway, D.S. Stoffer, R.H. Shumway, D.S. Stoffer, ARIMA Models, in: Time Series Analysis and Its Applications: With R Examples, pp. 75–163, (2017).
  15. R.A. Ahmed, A.B. Shabri, Daily Crude Oil Price Forecasting Model Using ARIMA, Generalized Autoregressive Conditional Heteroscedastic and Support Vector Machines, Am. J. Appl. Sci. 11 (2014), 425–432. https://doi.org/10.3844/ajassp.2014.425.432.
  16. L. Breiman, Random Forests, Mach. Learn. 45 (2001), 5–32. https://doi.org/10.1023/a:1010933404324.
  17. S.J. Rigatti, Random Forest, J. Insur. Med. 47 (2017), 31–39.
  18. A. Tealab, H. Hefny, A. Badr, Forecasting of Nonlinear Time Series Using Ann, Futur. Comput. Inform. J. 2 (2017), 39–47. https://doi.org/10.1016/j.fcij.2017.05.001.
  19. M. Adil, R. Ullah, S. Noor, N. Gohar, Effect of Number of Neurons and Layers in an Artificial Neural Network for Generalized Concrete Mix Design, Neural Comput. Appl. 34 (2020), 8355–8363. https://doi.org/10.1007/s00521-020-05305-8.
  20. A. Shabri, R. Samsudin, Daily Crude Oil Price Forecasting Using Hybridizing Wavelet and Artificial Neural Network Model, Math. Probl. Eng. 2014 (2014), 201402. https://doi.org/10.1155/2014/201402.
  21. S. Sharma, S. Sharma, A. Athaiya, Activation Functions in Neural Networks, Int. J. Eng. Appl. Sci. Technol. 04 (2020), 310–316. https://doi.org/10.33564/ijeast.2020.v04i12.054.
  22. F. Martínez, M.P. Frías, M.D. Pérez, A.J. Rivera, A Methodology for Applying K-Nearest Neighbor to Time Series Forecasting, Artif. Intell. Rev. 52 (2017), 2019–2037. https://doi.org/10.1007/s10462-017-9593-z.
  23. F.H. Al-Qahtani, S.F. Crone, Multivariate K-Nearest Neighbour Regression for Time Series Data – A Novel Algorithm for Forecasting Uk Electricity Demand, in: The 2013 International Joint Conference on Neural Networks (IJCNN), IEEE, 2013, pp. 1-8. https://doi.org/10.1109/ijcnn.2013.6706742.
  24. Y. Lee, C. Wei, T. Cheng, C. Yang, Nearest-neighbor-based Approach to Time-Series Classification, Decis. Support Syst. 53 (2012), 207–217. https://doi.org/10.1016/j.dss.2011.12.014.
  25. B.K. Nelson, Time Series Analysis Using Autoregressive Integrated Moving Average (ARIMA) Models, Acad. Emerg. Med. 5 (1998), 739–744. https://doi.org/10.1111/j.1553-2712.1998.tb02493.x.
  26. R. Hossain, D. Timmer, Machine Learning Model Optimization with Hyper Parameter Tuning Approach, Glob. J. Comput. Sci. Technol. D Neural Artif. Intell 21 (2021), 6–13.
  27. A.C. Petrica, S. Stancu, A. Tindeche, Limitation of Arima Models in Financial and Monetary Economics, Theor. Appl. Econ. 23 (2016), 19–42.