DSpace Repository

Behavioral insights and hotspot identification: Integrating natural language processing, machine learning and geospatial analyses of cyclist crashes

Show simple item record

dc.contributor.author Malaghan, Vinayak Devendra
dc.date.accessioned 2026-05-14T10:28:09Z
dc.date.available 2026-05-14T10:28:09Z
dc.date.issued 2025-08
dc.identifier.uri https://www.sciencedirect.com/science/article/pii/S136984782500169X
dc.identifier.uri http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/21318
dc.description.abstract In response to the rising trend in the promotion and adoption of cycling, ensuring cyclist safety is paramount. Understanding behavioural causes of crashes and identifying collision hotspots is important; however, the efforts are hindered by underreporting and limited data on all types of incidents, including near misses. Addressing these challenges, this study analyses text data reported on dedicated active travel collision platforms to categorize incidents and uncover behavioural patterns contributing to collisions. The reported text data is grouped into distinct themes applying Term Frequency-Inverse Document Frequency (TF-IDF) vectorization, and clustering. Additionally, the advanced geospatial technique Getis-Ord Gi* statistic is computed to identify spatial clustering of collisions and categorize geographical regions as hotspots and cold spots. Key themes contributing to collisions are grouped as follows: ‘close pass incidents,’ ‘blocked bicycle lanes,’ ‘cyclist incidents on tram tracks,’ ‘roundabout incidents,’ ‘left turn incidents,’ ‘incidents between buses and cyclists,’ ‘incidents involving cyclists and trucks,’ ‘incidents related to traffic lights and pedestrian crossings,’ and ‘turning incidents at intersections.’ Moreover, the hotspots from these incidents are located at or near the intersections of regional roads in the Central Business District (CBD) and on the peripheral regional roads encapsulating the CBD in Dublin, Ireland. This study advances the state of the art by utilizing an alternative data source, ‘crash descriptions’ from cyclist crashes, through the application of innovative machine learning techniques and advanced geospatial analyses. The insights from the unique themes and identified hotspots enhance understanding of risky behaviours and their spatial distribution, contributing to ongoing efforts to foster a safer cycling environment. en_US
dc.language.iso en en_US
dc.publisher Elsevier en_US
dc.subject Civil engineering en_US
dc.subject Behavioral analyses en_US
dc.subject Collision hotspots en_US
dc.subject Cyclist safety en_US
dc.title Behavioral insights and hotspot identification: Integrating natural language processing, machine learning and geospatial analyses of cyclist crashes en_US
dc.type Article en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account