Transfer Learning for Scene Text Recognition in Indian Languages

01/10/2022
by   Sanjana Gunna, et al.
9

Scene text recognition in low-resource Indian languages is challenging because of complexities like multiple scripts, fonts, text size, and orientations. In this work, we investigate the power of transfer learning for all the layers of deep scene text recognition networks from English to two common Indian languages. We perform experiments on the conventional CRNN model and STAR-Net to ensure generalisability. To study the effect of change in different scripts, we initially run our experiments on synthetic word images rendered using Unicode fonts. We show that the transfer of English models to simple synthetic datasets of Indian languages is not practical. Instead, we propose to apply transfer learning techniques among Indian languages due to similarity in their n-gram distributions and visual features like the vowels and conjunct characters. We then study the transfer learning among six Indian languages with varying complexities in fonts and word length statistics. We also demonstrate that the learned features of the models transferred from other Indian languages are visually closer (and sometimes even better) to the individual model features than those transferred from English. We finally set new benchmarks for scene-text recognition on Hindi, Telugu, and Malayalam datasets from IIIT-ILST and Bangla dataset from MLT-17 by achieving 6 and 23 further improve the MLT-17 Bangla results by plugging in a novel correction BiLSTM into our model. We additionally release a dataset of around 440 scene images containing 500 Gujarati and 2535 Tamil words. WRRs improve over the baselines by 8 Gujarati and Tamil datasets.

READ FULL TEXT

page 2

page 6

page 9

page 13

research
01/10/2022

Towards Boosting the Accuracy of Non-Latin Scene Text Recognition

Scene-text recognition is remarkably better in Latin languages than the ...
research
08/06/2023

Towards Scene-Text to Scene-Text Translation

In this work, we study the task of “visually" translating scene text fro...
research
04/09/2021

Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam

Inspired by the success of Deep Learning based approaches to English sce...
research
02/27/2022

A Multimodal German Dataset for Automatic Lip Reading Systems and Transfer Learning

Large datasets as required for deep learning of lip reading do not exist...
research
06/03/2020

Transfer Learning for British Sign Language Modelling

Automatic speech recognition and spoken dialogue systems have made great...
research
09/14/2020

Adaptive Text Recognition through Visual Matching

In this work, our objective is to address the problems of generalization...
research
03/26/2022

Joint Transformer/RNN Architecture for Gesture Typing in Indic Languages

Gesture typing is a method of typing words on a touch-based keyboard by ...

Please sign up or login with your details

Forgot password? Click here to reset