Skip to main content

Evaluation of the Performance of Random Forests Technique in Predicting the Severity of Road Traffic Accidents

  • Conference paper
  • First Online:
Advances in Human Aspects of Transportation (AHFE 2018)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 786))

Included in the following conference series:

Abstract

Traffic accidents in the Middle East are a primary concern for governments and local communities owing to the large numbers of fatalities, injuries and economic losses. Many analytical methods have been used in the literature to analyze the accidents database. One of the recent methods in this domain is the data-mining techniques. In this paper, we evaluate the performance of a well-known data mining technique called Random Forests (RF) in predicting the severity of road accidents based on 5973 accidents occurred in Abu Dhabi over a period of 6 years (2008–2013). The factors studied in this paper include: five accident-related attributes (year, day, time, reason of accident, and accident type), six driver-related attributes (gender, nationality, age, seat belt use, casualty status, degree of injury), and five road-related attributes (lighting, road surface, speed limit, lane numbers, and weather). The severity of the accident was classified into one of four classes (Minor, Moderate, Severe, and Death). RF was then used to build a prediction model using 10-fold cross validation method. The overall model predication performance was 68.5%. The generated model was found to perform poorly on the underrepresented classes (Death and Severe). As a result, the original data was transformed into a balanced data set using Minority Oversampling Technique (SMOTE). The performance of RF on the balanced data was 78.19% with 14% improvement. In order to validate the performance of the RF model, an ordered probit model was also used as a comparative benchmark. The accuracy of the ordered probit model was 59.5%, and 34% for the original and balanced data sets respectively. It was obvious that RF technique outperforms the ordered probit method in predicting the severity or road traffic accidents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Baluni, P., Raiwani, Y.P.: Vehicular accident analysis using neural network. Int. J. Emerg. Technol. Adv. Eng. 4(9), 161–164 (2014)

    Google Scholar 

  2. Shankar, V., Mannering, F., Barfield, W.: Statistical analysis of accident severity on rural freeways. Accid. Anal. Prev. 28(3), 391–401 (1996)

    Article  Google Scholar 

  3. Alkheder, S., Taamneh, M., Taamneh, S.: Severity prediction of traffic accident using an artificial neural network. J. Forecasting 36(1), 100–108 (2017)

    Article  MathSciNet  Google Scholar 

  4. Taamneh, M., Taamneh, S., Alkheder, S.: Clustering-based classification of road traffic accidents using hierarchical clustering and artificial neural networks. Int. J. Inj. Control Saf. Promot. 24(3), 388–395 (2017)

    Article  Google Scholar 

  5. Chang, L.-Y., Wang, H.-W.: Analysis of traffic injury severity: An application of non-parametric classification tree techniques. Accid. Anal. Prev. 38(5), 1019–1027 (2006)

    Article  Google Scholar 

  6. Taamneh, M., Alkheder, S., Taamneh, S.: Data-mining techniques for traffic accident modeling and prediction in the United Arab Emirates. J. Transp. Saf. Secur. 9(2), 146–166 (2017)

    Article  Google Scholar 

  7. Kashani, A.T., Mohaymany, A.S.: Analysis of the traffic injury severity on two-lane, two-way rural roads based on classification tree models. Saf. Sci. 49(10), 1314–1320 (2011)

    Article  Google Scholar 

  8. Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)

    Article  Google Scholar 

  9. Bramer, M.: Avoiding overfitting of decision trees. In: Principles of Data Mining. Springer, London, pp. 121–136 (2013)

    Google Scholar 

  10. Breiman, L.: Manual on setting up, using, and understanding random forests v3. 1. Statistics Department, University of California Berkeley, CA, USA 1 (2002)

    Google Scholar 

  11. Liaw, A., Wiener, M.: Classification and regression by randomForest. R News 2(3), 18–22 (2002)

    Google Scholar 

  12. Krishnaveni, S., Hemalatha, M.: A perspective analysis of traffic accident using data mining techniques. Int. J. Comput. Appl. 23(7), 40–48 (2011)

    Google Scholar 

  13. Das, A., Abdel-Aty, M., Pande, A.: Using conditional inference forests to identify the factors affecting crash severity on arterial corridors. J. Saf. Res. 40(4), 317–327 (2009)

    Article  Google Scholar 

  14. Haleem, K., Gan, A.: Effect of driver’s age and side of impact on crash severity along urban freeways: A mixed logit approach. J. Saf. Res. 46, 67–76 (2013)

    Article  Google Scholar 

  15. Harb, R., et al.: Exploring precrash maneuvers using classification trees and random forests. Accid. Anal. Prev. 41(1), 98–107 (2009)

    Google Scholar 

  16. Lee, J., Nam, B., Abdel-Aty, M.: Effects of pavement surface conditions on traffic crash severity. J. Transp. Eng. 141(10), 04015020 (2015)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Salah Taamneh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Taamneh, S., Taamneh, M. (2019). Evaluation of the Performance of Random Forests Technique in Predicting the Severity of Road Traffic Accidents. In: Stanton, N. (eds) Advances in Human Aspects of Transportation. AHFE 2018. Advances in Intelligent Systems and Computing, vol 786. Springer, Cham. https://doi.org/10.1007/978-3-319-93885-1_78

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-93885-1_78

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-93884-4

  • Online ISBN: 978-3-319-93885-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics