Abstract
Traffic accidents in the Middle East are a primary concern for governments and local communities owing to the large numbers of fatalities, injuries and economic losses. Many analytical methods have been used in the literature to analyze the accidents database. One of the recent methods in this domain is the data-mining techniques. In this paper, we evaluate the performance of a well-known data mining technique called Random Forests (RF) in predicting the severity of road accidents based on 5973 accidents occurred in Abu Dhabi over a period of 6 years (2008–2013). The factors studied in this paper include: five accident-related attributes (year, day, time, reason of accident, and accident type), six driver-related attributes (gender, nationality, age, seat belt use, casualty status, degree of injury), and five road-related attributes (lighting, road surface, speed limit, lane numbers, and weather). The severity of the accident was classified into one of four classes (Minor, Moderate, Severe, and Death). RF was then used to build a prediction model using 10-fold cross validation method. The overall model predication performance was 68.5%. The generated model was found to perform poorly on the underrepresented classes (Death and Severe). As a result, the original data was transformed into a balanced data set using Minority Oversampling Technique (SMOTE). The performance of RF on the balanced data was 78.19% with 14% improvement. In order to validate the performance of the RF model, an ordered probit model was also used as a comparative benchmark. The accuracy of the ordered probit model was 59.5%, and 34% for the original and balanced data sets respectively. It was obvious that RF technique outperforms the ordered probit method in predicting the severity or road traffic accidents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Baluni, P., Raiwani, Y.P.: Vehicular accident analysis using neural network. Int. J. Emerg. Technol. Adv. Eng. 4(9), 161–164 (2014)
Shankar, V., Mannering, F., Barfield, W.: Statistical analysis of accident severity on rural freeways. Accid. Anal. Prev. 28(3), 391–401 (1996)
Alkheder, S., Taamneh, M., Taamneh, S.: Severity prediction of traffic accident using an artificial neural network. J. Forecasting 36(1), 100–108 (2017)
Taamneh, M., Taamneh, S., Alkheder, S.: Clustering-based classification of road traffic accidents using hierarchical clustering and artificial neural networks. Int. J. Inj. Control Saf. Promot. 24(3), 388–395 (2017)
Chang, L.-Y., Wang, H.-W.: Analysis of traffic injury severity: An application of non-parametric classification tree techniques. Accid. Anal. Prev. 38(5), 1019–1027 (2006)
Taamneh, M., Alkheder, S., Taamneh, S.: Data-mining techniques for traffic accident modeling and prediction in the United Arab Emirates. J. Transp. Saf. Secur. 9(2), 146–166 (2017)
Kashani, A.T., Mohaymany, A.S.: Analysis of the traffic injury severity on two-lane, two-way rural roads based on classification tree models. Saf. Sci. 49(10), 1314–1320 (2011)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Bramer, M.: Avoiding overfitting of decision trees. In: Principles of Data Mining. Springer, London, pp. 121–136 (2013)
Breiman, L.: Manual on setting up, using, and understanding random forests v3. 1. Statistics Department, University of California Berkeley, CA, USA 1 (2002)
Liaw, A., Wiener, M.: Classification and regression by randomForest. R News 2(3), 18–22 (2002)
Krishnaveni, S., Hemalatha, M.: A perspective analysis of traffic accident using data mining techniques. Int. J. Comput. Appl. 23(7), 40–48 (2011)
Das, A., Abdel-Aty, M., Pande, A.: Using conditional inference forests to identify the factors affecting crash severity on arterial corridors. J. Saf. Res. 40(4), 317–327 (2009)
Haleem, K., Gan, A.: Effect of driver’s age and side of impact on crash severity along urban freeways: A mixed logit approach. J. Saf. Res. 46, 67–76 (2013)
Harb, R., et al.: Exploring precrash maneuvers using classification trees and random forests. Accid. Anal. Prev. 41(1), 98–107 (2009)
Lee, J., Nam, B., Abdel-Aty, M.: Effects of pavement surface conditions on traffic crash severity. J. Transp. Eng. 141(10), 04015020 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Taamneh, S., Taamneh, M. (2019). Evaluation of the Performance of Random Forests Technique in Predicting the Severity of Road Traffic Accidents. In: Stanton, N. (eds) Advances in Human Aspects of Transportation. AHFE 2018. Advances in Intelligent Systems and Computing, vol 786. Springer, Cham. https://doi.org/10.1007/978-3-319-93885-1_78
Download citation
DOI: https://doi.org/10.1007/978-3-319-93885-1_78
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-93884-4
Online ISBN: 978-3-319-93885-1
eBook Packages: EngineeringEngineering (R0)