TY - JOUR
T1 - Vision-based multi-label detection framework for capturing occupant action and clothing information using large-scale dataset
AU - Jung, Seunghoon
AU - Jeoung, Jaewon
AU - Hong, Taehoon
AU - Jang, Hyounseung
N1 - Publisher Copyright:
© 2024 Elsevier Ltd
PY - 2024/6/1
Y1 - 2024/6/1
N2 - Capturing occupant action and clothing information is important for applying occupant-centric control (OCC) to mitigate energy overuse and improve indoor environment quality. Therefore, this study introduces a vision-based multi-label detection framework for automatically capturing occupant actions and clothes. Ultimately, a single-stage architecture is designed, providing simultaneous detection of human body bounding boxes, action classes, and clothing classes. The framework also applies the training strategy, allowing concurrent training of action and clothing data. The experiment results showed that the proposed framework reliably detects occupant actions and clothes with a mean average precision (mAP) of 45.0 % and approximately doubles the inference speed compared to the multi-stage detection framework. This advancement paves the way for enhanced OCC systems by ensuring the diversity and variability of occupant information collection.
AB - Capturing occupant action and clothing information is important for applying occupant-centric control (OCC) to mitigate energy overuse and improve indoor environment quality. Therefore, this study introduces a vision-based multi-label detection framework for automatically capturing occupant actions and clothes. Ultimately, a single-stage architecture is designed, providing simultaneous detection of human body bounding boxes, action classes, and clothing classes. The framework also applies the training strategy, allowing concurrent training of action and clothing data. The experiment results showed that the proposed framework reliably detects occupant actions and clothes with a mean average precision (mAP) of 45.0 % and approximately doubles the inference speed compared to the multi-stage detection framework. This advancement paves the way for enhanced OCC systems by ensuring the diversity and variability of occupant information collection.
KW - Computer vision
KW - Large-scale dataset
KW - Multi-label detection
KW - Occupant information
KW - Occupant-centric control
UR - https://www.scopus.com/pages/publications/85190355047
U2 - 10.1016/j.buildenv.2024.111537
DO - 10.1016/j.buildenv.2024.111537
M3 - Article
AN - SCOPUS:85190355047
SN - 0360-1323
VL - 257
JO - Building and Environment
JF - Building and Environment
M1 - 111537
ER -