Threat assessment from naturalistic video-data: How to detect, classify, and estimate the position of multiple road users from cameras

dc.contributor.authorFang, Luhan
dc.contributor.authorWu, Yahui
dc.contributor.departmentChalmers tekniska högskola / Institutionen för mekanik och maritima vetenskapersv
dc.contributor.departmentChalmers University of Technology / Department of Mechanics and Maritime Sciencesen
dc.contributor.examinerDozza, Marco
dc.contributor.supervisorPai, Rahul Rajendra
dc.contributor.supervisorRasch, Alexander
dc.date.accessioned2024-06-20T13:11:12Z
dc.date.available2024-06-20T13:11:12Z
dc.date.issued2024
dc.date.submitted
dc.description.abstractEach year, over one million people die in traffic-related crashes, with more than half involving vulnerable road users (VRUs) such as pedestrians, cyclists, and escooterists. To mitigate the crashes, it is important to understand the causation mechanisms of such crashes and analyze the interactions and behaviors of VRUs. Using the knowledge of these crash mechanisms and rider behavior, threat assesment algorithms can be developed to identify anomaly in rider behavior and likelihood of a crash. Naturalistic data has proven to be an effective tool in traffic safety research to identify the road user interaction and crash mechanisms. However, as the data tends to be large, traditionally the safety critical events have been identified based on kinematic trigger. However, in some safety critical scenarios, the vehicles may not exhibit any change in kinematics, which often leads to their exclusion from analysis. Given the sheer volume of data, manually reviewing each video requires significant time and resources making it unviable. With the development of computer vision, investigating the video footage can be very effective. Therefore, by using computer vision methods, we can accurately identify different road users within the footage. Once a road user is identified, machine learning models can be utilized to convert the 2D position on the video frame into real-world positions and kinematics of the road users. Threat assessment algorithms can use these positions and kinematics to determine safety-critical scenarios. This approach makes identification of safety critical scenarios using videos feasible in addition to making efficient use of the collected data. This thesis proposes a system that first utilizes the version 7 of You Only Look Once (YOLOv7) model to conduct object detection, providing the bounding boxes for target road users, including pedestrians, cars, cyclists, e-scooterists, and other micromobility vehicles such as bicycles and e-scooters. Then, machine learning regressors are used, taking the features of bounding boxes from YOLOv7 as input, to estimate the positions of the target objects. Finally, a threat assessment algorithm, based on the range and time gap between the ego vehicle and the target object, is used to identify whether the scenario in the video footage is a critical event or not. The object detection model can generate detection results for pedestrians, cars, cyclists, e-scooterists, bicycles, and e-scooters with a mean Average Precision (mAP) of 0.893 for all classes combined. Among all the selected machine learning regressors for position estimation, the RandomForestRegressor has the highest R2 score, exceeding 0.9. With the positional information of the road users, the threat assessment algorithm can apply safety metrics to detect critical events.
dc.identifier.coursecodeMMSX30
dc.identifier.urihttp://hdl.handle.net/20.500.12380/307974
dc.language.isoeng
dc.setspec.uppsokTechnology
dc.subjectVRU
dc.subjectMicromobility
dc.subjectComputer Vision
dc.subjectPosition Estimation
dc.subjectThreat Assessment
dc.titleThreat assessment from naturalistic video-data: How to detect, classify, and estimate the position of multiple road users from cameras
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeMobility engineering (MPMOB), MSc
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
2024 Luhan Fang & Yahui Wu.pdf
Storlek:
17.24 MB
Format:
Adobe Portable Document Format
Beskrivning:
License bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
2.35 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: