TY - GEN
T1 - ZenNet-SA
T2 - 24th International Conference on Control, Automation and Systems, ICCAS 2024
AU - Nguyen, Trang
AU - Nabi, Hamza Ghulam
AU - Han, Ji Hyeong
N1 - Publisher Copyright:
© 2024 ICROS.
PY - 2024
Y1 - 2024
N2 - Facial expression recognition (FER) is an important task in computer vision, having practical applications in areas such as human-computer interaction, education, healthcare, and online monitoring. Although significant progress has been made possible by deep learning, real-world deployment of the FER system still needs to be improved, particularly with limited resources such as embedded or mobile systems. From this challenging FER task, we propose a lightweight network that combines a lightweight high performance backbone by applying ZenNAS [1] and a distillation feature part after backbone to refine features. Extensive experimental results demonstrate that the proposed ZenNet-SA achieves state-of-the-art results on three FER benchmark datasets, RAF-DB (88.53%) as well as AffectNet 7 class (65.23%) and 8 class (61.32%) in the light-weight models group with a model size less than 1M and computation costs approximately 450M Flops. Our results demonstrate a notable improvement in efficiency, with a decrease of approximately 97% in FLOPS and 91% in the number of parameters compared to POSTER V2 in terms of computational and resource costs.
AB - Facial expression recognition (FER) is an important task in computer vision, having practical applications in areas such as human-computer interaction, education, healthcare, and online monitoring. Although significant progress has been made possible by deep learning, real-world deployment of the FER system still needs to be improved, particularly with limited resources such as embedded or mobile systems. From this challenging FER task, we propose a lightweight network that combines a lightweight high performance backbone by applying ZenNAS [1] and a distillation feature part after backbone to refine features. Extensive experimental results demonstrate that the proposed ZenNet-SA achieves state-of-the-art results on three FER benchmark datasets, RAF-DB (88.53%) as well as AffectNet 7 class (65.23%) and 8 class (61.32%) in the light-weight models group with a model size less than 1M and computation costs approximately 450M Flops. Our results demonstrate a notable improvement in efficiency, with a decrease of approximately 97% in FLOPS and 91% in the number of parameters compared to POSTER V2 in terms of computational and resource costs.
KW - Facial expression recognition
KW - light-weight model
UR - http://www.scopus.com/inward/record.url?scp=85214413624&partnerID=8YFLogxK
U2 - 10.23919/ICCAS63016.2024.10773246
DO - 10.23919/ICCAS63016.2024.10773246
M3 - Conference contribution
AN - SCOPUS:85214413624
T3 - International Conference on Control, Automation and Systems
SP - 55
EP - 60
BT - 2024 24th International Conference on Control, Automation and Systems, ICCAS 2024
PB - IEEE Computer Society
Y2 - 29 October 2024 through 1 November 2024
ER -