TY - GEN
T1 - Natural scene text recognition using convolutional recurrent neural network
AU - Wang, Yao
AU - Ha, Jongeun
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/8/17
Y1 - 2021/8/17
N2 - In this article, we explore the scene text recognition problem, which is one of the challenging sub-fields of computer vision. Recently, deep learning has achived state-of-the-art performance for recognition task. The convolutional recurrent neural network (CRNN) architecture is explored for this task, which consists of feature extraction, sequence modeling. Moreover, an attention mechanism is introduced in our study. Unlike many of previous scene text recognition systems, the proposed architecture has several advantages: the model can be trained using the end-to-end manner and the CRNN can deal with the sequences of arbitrary length. Comparing the detection results of several mainstream CNN network structures, the experimental results show that the accuracy of the detection results is improved, and false positives are reduced, which clearly demonstrate its effectiveness.
AB - In this article, we explore the scene text recognition problem, which is one of the challenging sub-fields of computer vision. Recently, deep learning has achived state-of-the-art performance for recognition task. The convolutional recurrent neural network (CRNN) architecture is explored for this task, which consists of feature extraction, sequence modeling. Moreover, an attention mechanism is introduced in our study. Unlike many of previous scene text recognition systems, the proposed architecture has several advantages: the model can be trained using the end-to-end manner and the CRNN can deal with the sequences of arbitrary length. Comparing the detection results of several mainstream CNN network structures, the experimental results show that the accuracy of the detection results is improved, and false positives are reduced, which clearly demonstrate its effectiveness.
KW - Attention Mechanism
KW - Convolutional recurrent neural network
KW - Scene text recognition
UR - https://www.scopus.com/pages/publications/85118926612
U2 - 10.1109/ICCSE51940.2021.9569296
DO - 10.1109/ICCSE51940.2021.9569296
M3 - Conference contribution
AN - SCOPUS:85118926612
T3 - ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education
SP - 789
EP - 793
BT - ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 16th IEEE International Conference on Computer Science and Education, ICCSE 2021
Y2 - 17 August 2021 through 21 August 2021
ER -