Deep Neural Networks Combined with STN for Multi-Oriented Text Detection and Recognition

Saif Hassan Katper; Abdul Rehman Gilal; Ahmad Waqas; Ae Shah Alsughayyie; Abdullah Alshanqiti; Jafreezal Jaafar

doi:10.14569/IJACSA.2020.0110424

Back

Deep Neural Networks Combined with STN for Multi-Oriented Text Detection and Recognition

Journal article

Open access

Deep Neural Networks Combined with STN for Multi-Oriented Text Detection and Recognition

Saif Hassan Katper, Abdul Rehman Gilal, Ahmad Waqas, Ae Shah Alsughayyie, Abdullah Alshanqiti and Jafreezal Jaafar

International journal of advanced computer science & applications, Vol.11(4), pp.178-184

01/01/2020

DOI: https://doi.org/10.14569/IJACSA.2020.0110424

Abstract

Computer Science

Computer Science, Theory & Methods

Science & Technology

Technology

Developing systems for interpreting visuals, such as images, videos is really challenging but important task to be developed and applied on benchmark datasets. This study solves the very challenge by using STN-OCR model consisting of deep neural networks (DNN) and Spatial Transformer Networks (STNs). The network architecture of this study consists of two stages: localization network and recognition network. In the localization network it finds and localizes text regions and generates sampling grid. Whereas, in the recognition network, text regions will be input and then this network learns to recognize text including low resolution, curved and multi-oriented text. Deep learning-based approaches require a lot of data for training effectively, therefore, this study has used two benchmark datasets, Street View House Numbers (SVHN) and International Conference on Document Analysis and Recognition (ICDAR) 2015 to evaluate the system. The STN-OCR model achieves better results than literature on these datasets.

Files and links (1)

url

https://doi.org/10.14569/IJACSA.2020.0110424View

Published (Version of record) Open

Metrics

1 Record Views

Details

Title: Deep Neural Networks Combined with STN for Multi-Oriented Text Detection and Recognition
Creators - without role: Saif Hassan Katper - Sukkur IBA University
Abdul Rehman Gilal - Sukkur IBA University
Ahmad Waqas - Sukkur IBA University
Ae Shah Alsughayyie - Taibah University
Abdullah Alshanqiti - Islamic Univ IU, Fac Comp & Informat Syst, Madinah, Saudi Arabia
Jafreezal Jaafar - Petronas
Publication Details: International journal of advanced computer science & applications, Vol.11(4), pp.178-184
Publisher: Science & Information Sai Organization Ltd
Number of pages: 7
Identifiers: 9930184708331
Academic Unit: Taibah University
Language: English
Resource Type: Journal article