IEEE ICIP 2021 || Anchorage, Alaska, USA || 19-22 September 2021

My ICIP 2021 Schedule

Note: Your custom schedule will not be saved unless you create a new account or login to an existing account.

Create a login based on your email (takes less than one minute)
Perform 'Paper Search'
Select papers that you desire to save in your personalized schedule
Click on 'My Schedule' to see the current list of selected papers
Click on 'Printable Version' to create a separate window suitable for printing (the header and menu will appear, but will not actually print)

Paper Detail

Paper ID

MLR-APPL-IVASR-4.10

Paper Title

ANALYSIS OF THE NOVEL TRANSFORMER MODULE COMBINATION FOR SCENE TEXT RECOGNITION

Authors

Yeon-Gyu Kim, Hyunsu Kim, Minseok Kang, Hyug-Jae Lee, Rokkyu Lee, Gunhan Park, NHN, Republic of Korea

Session

MLR-APPL-IVASR-4: Machine learning for image and video analysis, synthesis, and retrieval 4

Location

Area B

Session Time:

Tuesday, 21 September, 13:30 - 15:00

Presentation Time:

Tuesday, 21 September, 13:30 - 15:00

Presentation

Poster

Topic

Applications of Machine Learning: Machine learning for image & video analysis, synthesis, and retrieval

IEEE Xplore Open Preview

Click here to view in IEEE Xplore

Abstract

Various methods for scene text recognition (STR) are proposed every year. These methods dramatically increase the performance of the existing STR field; however, they have not been able to keep up with the progress of general-purpose research in image recognition, detection, speech recognition, and text analysis. In this paper, we evaluate the performance of several deep learning schemes for the encoder part of the Transformer in STR. First, we change the baseline feed forward network (FFN) module of encoder to squeeze-and- excitation (SE)-FFN or cross stage partial (CSP)-FFN. Second, the overall architecture of encoder is replaced with local dense synthesizer attention (LDSA) or Conformer structure. Conformer encoder achieves the best test accuracy in various experiments, and SE or CSP-FFN also showed competitive performance when the number of parameters is considered. Visualizing the attention maps from different encoder combinations allows for qualitative performance analysis.

2021 IEEE International Conference on Image Processing

19-22 September 2021 • Anchorage, Alaska, USA

Imaging Without Borders

2021 IEEE International Conference on Image Processing

19-22 September 2021 • Anchorage, Alaska, USA

My ICIP 2021 Schedule

Paper Detail