TY - JOUR
T1 - Proactive approach to enhancing safety management using deep learning classifiers for construction safety documentation
AU - Kumi, Louis
AU - Jeong, Jaewook
AU - Jeong, Jaemin
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2025/8/1
Y1 - 2025/8/1
N2 - Construction accidents remain a major global concern, highlighting the need for proactive safety measures during the design phase. Design for Safety (DfS) reports, which aim to identify and mitigate hazards during the design stage, play a crucial role in preventing accidents. However, their manual classification in South Korea is time-consuming and error-prone, reducing their effectiveness. Thus, this study presents a deep learning-based classification framework to automatically classify DfS reports based on accident types, facility types, and work types. Three deep learning architectures, Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), and a hybrid CNN-LSTM model, were developed and trained on a comprehensive dataset of 1513 DfS reports collected from government databases and construction companies' internal records. An extensive model evaluation was conducted using key performance metrics, including accuracy, precision, recall, F1 score, and Area Under the Receiver Operating Characteristic Curve (AUC ROC). The results demonstrated that the CNN model achieved the highest performance across all tasks, with an accuracy of 71 % for accident type classification, 46 % for facility type classification, and 64 % for work type classification. Additionally, CNN outperformed LSTM and CNN-LSTM in terms of precision, recall, and F1 score across all categories, indicating its superior capability in extracting spatially correlated textual patterns. Beyond model performance, this study contributes to construction safety by introducing an automated, data-driven approach to DfS report analysis, which facilitates proactive hazard identification, enhances risk prioritization, and optimizes resource allocation during the design phase.
AB - Construction accidents remain a major global concern, highlighting the need for proactive safety measures during the design phase. Design for Safety (DfS) reports, which aim to identify and mitigate hazards during the design stage, play a crucial role in preventing accidents. However, their manual classification in South Korea is time-consuming and error-prone, reducing their effectiveness. Thus, this study presents a deep learning-based classification framework to automatically classify DfS reports based on accident types, facility types, and work types. Three deep learning architectures, Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), and a hybrid CNN-LSTM model, were developed and trained on a comprehensive dataset of 1513 DfS reports collected from government databases and construction companies' internal records. An extensive model evaluation was conducted using key performance metrics, including accuracy, precision, recall, F1 score, and Area Under the Receiver Operating Characteristic Curve (AUC ROC). The results demonstrated that the CNN model achieved the highest performance across all tasks, with an accuracy of 71 % for accident type classification, 46 % for facility type classification, and 64 % for work type classification. Additionally, CNN outperformed LSTM and CNN-LSTM in terms of precision, recall, and F1 score across all categories, indicating its superior capability in extracting spatially correlated textual patterns. Beyond model performance, this study contributes to construction safety by introducing an automated, data-driven approach to DfS report analysis, which facilitates proactive hazard identification, enhances risk prioritization, and optimizes resource allocation during the design phase.
KW - Accident type
KW - Deep learning
KW - Design for safety
KW - Facility type
KW - Natural language processing
KW - Work type
UR - http://www.scopus.com/inward/record.url?scp=105002892037&partnerID=8YFLogxK
U2 - 10.1016/j.engappai.2025.110889
DO - 10.1016/j.engappai.2025.110889
M3 - Article
AN - SCOPUS:105002892037
SN - 0952-1976
VL - 153
JO - Engineering Applications of Artificial Intelligence
JF - Engineering Applications of Artificial Intelligence
M1 - 110889
ER -