DSpace Repository

Crash severity analysis in distracted driving using unlabeled and imbalanced data: A novel approach using Robust Two-Phase Ensemble Predictor

Show simple item record

dc.contributor.author Bag, Subhajit.
dc.contributor.author Maity, Saptashwa.
dc.contributor.author Sarkar, Sobhan.
dc.date.accessioned 2022-10-28T07:59:44Z
dc.date.available 2022-10-28T07:59:44Z
dc.date.issued 2022-10
dc.identifier.citation Subhajit Bag, Saptashwa Maity & Sobhan Sarkar (Oct. 2022). Crash severity analysis in distracted driving using unlabeled and imbalanced data: A novel approach using Robust Two-Phase Ensemble Predictor. 2022 International Conference on Data Analytics for Business and Industry (ICDABI), 88-92. IEEE. en_US
dc.identifier.uri https://doi.org/10.1109/ICDABI56818.2022.10041646
dc.identifier.uri http://idr.iimranchi.ac.in:8080/xmlui/handle/123456789/1604
dc.description.abstract Distracted driving plays a pivotal role in road accidents. Therefore, prediction of the crash severity due to distracted driving is essential. Although several machine learning techniques exist for such prediction, it is difficult to use them in case of the unavailability of class labels and class imbalance issues. Moreover, there is a severe lack of research considering environmental factors and driver’s behaviour to predict the crash severity. To address the issues, in this study, a robust two-phase ensemble prediction model has been developed, considering the geolocation information and driver’s behaviour. An analysis of the unlabeled and high-dimensional data is generally challenging. We perform dimensionality reduction using t-SNE, followed by agglomerative hierarchical clustering to get labelled data. We have used Synthetic Minority Over-sampling Technique (SMOTE) to mitigate the class imbalance issue. Subsequently, we observe that some localities have much more severe crashes, so we develop a feature considering the geolocation information. Then, we create a novel predictor called Robust Two-Phase Ensemble Predictor (R2PEP) to predict the crash severity. The performance of the proposed model has been compared with five state-of-the-art algorithms using a dataset we obtained from the Nevada Department of Transportation. The comparison demonstrates the superiority of our model over the other models, with an accuracy of 99.6%. en_US
dc.language.iso en en_US
dc.publisher 2022 International Conference on Data Analytics for Business and Industry (ICDABI) en_US
dc.subject Distracted driving en_US
dc.subject Clustering en_US
dc.subject Class imbalance en_US
dc.subject Crash severity prediction en_US
dc.subject IIM Ranchi en_US
dc.title Crash severity analysis in distracted driving using unlabeled and imbalanced data: A novel approach using Robust Two-Phase Ensemble Predictor en_US
dc.type Conference Paper en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record