TY - JOUR
T1 - ArchShapesNet
T2 - A novel dataset for benchmarking architectural building information modeling element classification algorithms
AU - Yu, Youngsu
AU - Ha, Daemok
AU - Lee, Koeun
AU - Choi, Jiwon
AU - Koo, Bonsang
N1 - Publisher Copyright:
© 2022 The Author(s). Published by Oxford University Press on behalf of the Society for Computational Design and Engineering.
PY - 2022/8/1
Y1 - 2022/8/1
N2 - Recent studies in the domain of semantic enrichment have employed artificial intelligence (AI) approaches to distinguish and classify building information modeling (BIM) elements to check their conformance with open standard data formats. Training AI algorithms requires the development of well-balanced and robust datasets of BIM elements. However, collection is difficult as sources are limited to existing models and sample libraries. This study developed a parametric augmentation approach to create synthetic copies of BIM elements, and thus rapidly supplement manually collected samples. The approach was used to create ArchShapesNet, a dataset consisting of 11 common architectural elements with an equal size of 4,000 samples per class. Two multi-view convolutional neural networks (CNN), a geometric deep learning algorithm, were trained and tested separately on ArchShapesNet and an initial dataset with sample imbalances. Results showed significant improvement in the accuracy and F1 scores, providing evidence of the utility of ArchShapesNet. The size and scope of the dataset are considered to be the first of their kind and provide a benchmark for testing the semantic integrity of BIM models. The augmentation approach also provides a general framework to create custom datasets for different specialties in the Architectural Engineering and Construction industry.
AB - Recent studies in the domain of semantic enrichment have employed artificial intelligence (AI) approaches to distinguish and classify building information modeling (BIM) elements to check their conformance with open standard data formats. Training AI algorithms requires the development of well-balanced and robust datasets of BIM elements. However, collection is difficult as sources are limited to existing models and sample libraries. This study developed a parametric augmentation approach to create synthetic copies of BIM elements, and thus rapidly supplement manually collected samples. The approach was used to create ArchShapesNet, a dataset consisting of 11 common architectural elements with an equal size of 4,000 samples per class. Two multi-view convolutional neural networks (CNN), a geometric deep learning algorithm, were trained and tested separately on ArchShapesNet and an initial dataset with sample imbalances. Results showed significant improvement in the accuracy and F1 scores, providing evidence of the utility of ArchShapesNet. The size and scope of the dataset are considered to be the first of their kind and provide a benchmark for testing the semantic integrity of BIM models. The augmentation approach also provides a general framework to create custom datasets for different specialties in the Architectural Engineering and Construction industry.
KW - BIM (Building Information Modeling)
KW - multi-view CNN
KW - parametric augmentation
KW - semantic enrichment
UR - http://www.scopus.com/inward/record.url?scp=85160011750&partnerID=8YFLogxK
U2 - 10.1093/jcde/qwac064
DO - 10.1093/jcde/qwac064
M3 - Article
AN - SCOPUS:85160011750
SN - 2288-4300
VL - 9
SP - 1449
EP - 1466
JO - Journal of Computational Design and Engineering
JF - Journal of Computational Design and Engineering
IS - 4
ER -