253
Views
4
CrossRef citations to date
0
Altmetric
Research Articles

Forecasting time trends of fatal motor vehicle crashes in Iran using an ensemble learning algorithm

ORCID Icon, ORCID Icon & ORCID Icon
Pages 44-49 | Received 16 Mar 2022, Accepted 23 Sep 2022, Published online: 24 Oct 2022
 

Abstract

Objective

This study aimed to introduce the random forest (RF) method as a valuable tool for short-term crash frequency prediction. Besides, the study compares the forecast efficiency of the RF model with the classical seasonal autoregressive integrated moving average (SARIMA) model in the multivariate time-series analysis of crash counts.

Methods

To this end, fatal accidents reported by the police and intercity traffic flow extracted from the loop detectors were aggregated in intercity highways at the country’s level monthly from Farvardin 1395 to Mordad 1400. The first 55 data points were used as the training sample, and the remaining ten months were considered the test sample. The Box-Jenkins and random forest machine learning methods were employed for short-term crash frequency prediction. The mean absolute percentage error (MAPE) criterion was utilized to compare the forecast accuracy of the developed models.

Results

The performance of the random forest model (MAPE = 2.6) with the exogenous variables of traffic flow, crash year, and month outperformed the best SARIMA (1,0,0) (1,0,0)12 model (MAPE = 5.7) with traffic flow as the regressor.

Conclusions

This study suggests that the random forest as an ensemble learning algorithm is a better crash prediction tool compared to the classical Box-Jenkins method, accounting for the non-linear dependencies in crash count time-series. Besides, the results illustrate that the multivariate SARIMA (SARIMAX) model significantly outperforms its univariate counterpart, accounting for the simultaneous impacts of exogenous variables.

Disclosure statement

There are no potential influences that may undermine the objectivity, and integrity of the publication.

Data availability

The datasets generated during and analyzed during the current study are available from the corresponding author on reasonable request.

Additional information

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.