Object Detection Using Policy-Based Reinforcement Learning

Keong Hun Choi, Jong Eun Ha

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

By constructing an object detection model using a deep neural network structure, it was possible to obtain faster and more accurate results than traditional models. Afterwards, through the method of applying the transformer structure by replacing the existing convolutional layer, the model structure, which was previously carried out in two stages, was simplified, and it became possible to detect the size of the object more freely. However, there are difficulties in generating new data due to the complex configuration of data used for training. As a result, a lot of resources are consumed in data generation, and it is difficult to train in response to a new environment immediately. This study presents a method for learning using simpler data. The simplified data consisted of the input image and the number of objects to be detected. To train using the data, a reinforcement learning method was applied to evaluate the output of the detection model and create and train a reward based on this.

Original languageEnglish
Title of host publication23rd International Conference on Control, Automation and Systems, ICCAS 2023
PublisherIEEE Computer Society
Pages235-237
Number of pages3
ISBN (Electronic)9788993215274
DOIs
StatePublished - 2023
Event23rd International Conference on Control, Automation and Systems, ICCAS 2023 - Yeosu, Korea, Republic of
Duration: 17 Oct 202320 Oct 2023

Publication series

NameInternational Conference on Control, Automation and Systems
ISSN (Print)1598-7833

Conference

Conference23rd International Conference on Control, Automation and Systems, ICCAS 2023
Country/TerritoryKorea, Republic of
CityYeosu
Period17/10/2320/10/23

Keywords

  • Deep learning
  • Object detection
  • Reinforcement learning
  • Transformer

Fingerprint

Dive into the research topics of 'Object Detection Using Policy-Based Reinforcement Learning'. Together they form a unique fingerprint.

Cite this