TY - JOUR
T1 - Efficient Object Detection Acceleration Methods for Autonomous-driving Embedded Platforms
AU - Choi, Jiwoong
AU - Chun, Dayoung
AU - Lee, Hyuk Jae
AU - Kim, Hyun
N1 - Publisher Copyright:
© 2022 Institute of Electronics and Information Engineers. All rights reserved.
PY - 2022
Y1 - 2022
N2 - Object detection in autonomous vehicles is typically operated in an embedded system to reduce power consumption. The use of an object detection algorithm with high accuracy and real-time detection speed in the embedded systems is essential for ensuring safe driving. This study proposes a parallel processing method for GPU and CPU operations to enhance the detection speed of the model. In addition, this study proposes data augmentation and image resize techniques that consider the camera input size of autonomous driving, which increases the accuracy significantly while improving the detection speed. The application of these proposed schemes to a baseline algorithm, tiny Gaussian YOLOv3, improves the mean average precision by 1.14 percent points (pp) for the Berkeley Deep Drive (BDD) dataset and 1.34 pp for the KITTI dataset compared to the baseline. Furthermore, in the NVIDIA Jetson AGX Xavier, which is an embedded platform for autonomous driving, the proposed algorithm improves the detection speed by 22.54 % for the BDD, and 24.67 % for the KITTI compared to the baseline, thereby enabling high-speed real-time detection on both datasets.
AB - Object detection in autonomous vehicles is typically operated in an embedded system to reduce power consumption. The use of an object detection algorithm with high accuracy and real-time detection speed in the embedded systems is essential for ensuring safe driving. This study proposes a parallel processing method for GPU and CPU operations to enhance the detection speed of the model. In addition, this study proposes data augmentation and image resize techniques that consider the camera input size of autonomous driving, which increases the accuracy significantly while improving the detection speed. The application of these proposed schemes to a baseline algorithm, tiny Gaussian YOLOv3, improves the mean average precision by 1.14 percent points (pp) for the Berkeley Deep Drive (BDD) dataset and 1.34 pp for the KITTI dataset compared to the baseline. Furthermore, in the NVIDIA Jetson AGX Xavier, which is an embedded platform for autonomous driving, the proposed algorithm improves the detection speed by 22.54 % for the BDD, and 24.67 % for the KITTI compared to the baseline, thereby enabling high-speed real-time detection on both datasets.
KW - Autonomous driving
KW - Deep learning
KW - Embedded system
KW - NVIDIA Jetson AGX Xavier
KW - Object detection
UR - http://www.scopus.com/inward/record.url?scp=85147250201&partnerID=8YFLogxK
U2 - 10.5573/IEIESPC.2022.11.4.255
DO - 10.5573/IEIESPC.2022.11.4.255
M3 - Article
AN - SCOPUS:85147250201
SN - 2287-5255
VL - 11
SP - 255
EP - 261
JO - IEIE Transactions on Smart Processing and Computing
JF - IEIE Transactions on Smart Processing and Computing
IS - 4
ER -