TY - GEN
T1 - Tri-Directional Decoder for Edge Discontinuity Classification
AU - Wang, Jiayue
AU - Lee, Hyuk Jae
AU - Cho, Hansang
AU - Kang, Byungsoo
AU - Jung, Hyunmin
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Extracting and exploiting edge information is important for various computer vision tasks. According to the physical characteristics, edges are further categorized into reflectance, illumination, normal, and depth discontinuities. Previous studies for edge discontinuity classification have achieved impressive performance in extracting discontinuities, but the capacity of classification remains limited. This paper proposes TriDecTr, a novel network comprising a transformer-based encoder to improve semantic understanding and a tri-directional decoder to explore relationships among categories. Extensive experiments demonstrate that TriDecTr achieves state-of-the-art performance on the BSDS-RIND dataset with 0.531 in ODS, 0.571 in OIS, and 0.461 in AP. Moreover, TriDecTr significantly narrows the performance gap between illumination edges and the other categories from 0.191 to 0.118 in ODS.
AB - Extracting and exploiting edge information is important for various computer vision tasks. According to the physical characteristics, edges are further categorized into reflectance, illumination, normal, and depth discontinuities. Previous studies for edge discontinuity classification have achieved impressive performance in extracting discontinuities, but the capacity of classification remains limited. This paper proposes TriDecTr, a novel network comprising a transformer-based encoder to improve semantic understanding and a tri-directional decoder to explore relationships among categories. Extensive experiments demonstrate that TriDecTr achieves state-of-the-art performance on the BSDS-RIND dataset with 0.531 in ODS, 0.571 in OIS, and 0.461 in AP. Moreover, TriDecTr significantly narrows the performance gap between illumination edges and the other categories from 0.191 to 0.118 in ODS.
KW - edge detection
KW - edge discontinuity classification
KW - vision transformer
UR - http://www.scopus.com/inward/record.url?scp=85198531976&partnerID=8YFLogxK
U2 - 10.1109/ISCAS58744.2024.10558011
DO - 10.1109/ISCAS58744.2024.10558011
M3 - Conference contribution
AN - SCOPUS:85198531976
T3 - Proceedings - IEEE International Symposium on Circuits and Systems
BT - ISCAS 2024 - IEEE International Symposium on Circuits and Systems
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 IEEE International Symposium on Circuits and Systems, ISCAS 2024
Y2 - 19 May 2024 through 22 May 2024
ER -