• Title of article

    A Data Mining Framework to Identify Important Factors of Fatigue and Drowsiness Accidents

  • Author/Authors

    tavakoli kashani ، ali Civil Engineering Department - Iran University of Science and Technology , Rakhshani Moghadam ، Marziyeh Civil Engineering Department - Iran University of Science and Technology , Amirifar ، Saeideh Civil Engineering Department - Iran University of Science and Technology

  • From page
    421
  • To page
    432
  • Abstract
    Fatigue and drowsiness are the major factors contributing to accidents worldwide. According to statistics, 20 to 40 percent of traffic accidents in Iran are due to drivers’ fatigue. This study aims to identify the most important variables affecting the occurrence of fatigue and drowsiness accidents based on the classification and regression tree (CART) method. At first, 859, 378 police crash data of provinces Tehran, Fars, and Mazandaran during seven years (2011-2018) were segmented into homogeneous groups using the two-step clustering algorithm. Next, an oversampling technique is applied to deal with the crash data imbalance problem. Finally, the classification and regression tree combined with the boosting algorithm increases the accuracy of the models. The results of the classification tree showed that the main variables affecting the occurrence of fatigue and drowsiness accidents are: road type, time of day, road traffic direction, local land use, shoulder type, vehicle type, control type, and collision type. Moreover, the road type variable was the only significant factor in residential suburban areas of Mazandaran and Fars provinces. Also, the common variable in residential urban areas of all three provinces was the time of day. It was concluded that the combination of the CART algorithm with oversampling and boosting increased the accuracy of the models. Identifying influential factors in fatigue and drowsiness accidents in the three mentioned provinces could improve the engineering and executive interactions and appropriate educational programs.
  • Keywords
    Fatigue and drowsiness , Clustering analysis , Imbalance data , Classification and Regression Tree , Boosting
  • Journal title
    AUT Journal of Civil Engineering
  • Journal title
    AUT Journal of Civil Engineering
  • Record number

    2736002