TY - GEN
T1 - HybridNet
T2 - 21st International Conference on Control, Automation and Systems, ICCAS 2021
AU - Yeom, Sangsik
AU - Ha, Jongeun
N1 - Publisher Copyright:
© 2021 ICROS.
PY - 2021
Y1 - 2021
N2 - In this paper, we propose a HybridNet that improves performance by fusing 2D and 3D features. A voxel-based method and a projection-based method were adopted to derive the results through one scan. Our approach consists of two parallel networks, extracts features along each dimension, and converges them in a Fusion Network. In the fusion network, the voxel blocks and 2D feature maps extracted from each structure are fused to the voxel grid and then trained through convolution. For effective training of 2D networks, we use data augmentation techniques using coordinate system rotation transformation. In addition, the performance was effectively improved by using a multi-loss with weights applied to each dimension, and better performance was achieved than the result using a single loss. Our proposed method can achieve better performance by changing the performance of the 2D network and 3D network, which can be generalized using other structures.
AB - In this paper, we propose a HybridNet that improves performance by fusing 2D and 3D features. A voxel-based method and a projection-based method were adopted to derive the results through one scan. Our approach consists of two parallel networks, extracts features along each dimension, and converges them in a Fusion Network. In the fusion network, the voxel blocks and 2D feature maps extracted from each structure are fused to the voxel grid and then trained through convolution. For effective training of 2D networks, we use data augmentation techniques using coordinate system rotation transformation. In addition, the performance was effectively improved by using a multi-loss with weights applied to each dimension, and better performance was achieved than the result using a single loss. Our proposed method can achieve better performance by changing the performance of the 2D network and 3D network, which can be generalized using other structures.
KW - 3D Vision
KW - PointCloud
KW - Semantic Segmentation
UR - https://www.scopus.com/pages/publications/85121444101
U2 - 10.23919/ICCAS52745.2021.9649914
DO - 10.23919/ICCAS52745.2021.9649914
M3 - Conference contribution
AN - SCOPUS:85121444101
T3 - International Conference on Control, Automation and Systems
SP - 1509
EP - 1515
BT - 2021 21st International Conference on Control, Automation and Systems, ICCAS 2021
PB - IEEE Computer Society
Y2 - 12 October 2021 through 15 October 2021
ER -