Memory Matters: Convolutional Recurrent Neural Network for Scene Text Recognition

01/06/2016
by   Guo Qiang, et al.
0

Text recognition in natural scene is a challenging problem due to the many factors affecting text appearance. In this paper, we presents a method that directly transcribes scene text images to text without needing of sophisticated character segmentation. We leverage recent advances of deep neural networks to model the appearance of scene text images with temporal dynamics. Specifically, we integrates convolutional neural network (CNN) and recurrent neural network (RNN) which is motivated by observing the complementary modeling capabilities of the two models. The main contribution of this work is investigating how temporal memory helps in an segmentation free fashion for this specific problem. By using long short-term memory (LSTM) blocks as hidden units, our model can retain long-term memory compared with HMMs which only maintain short-term state dependences. We conduct experiments on Street View House Number dataset containing highly variable number images. The results demonstrate the superiority of the proposed method over traditional HMM based methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2017

Reading Scene Text with Attention Convolutional Sequence Modeling

Reading text in the wild is a challenging task in the field of computer ...
research
06/14/2015

Reading Scene Text in Deep Convolutional Sequences

We develop a Deep-Text Recurrent Network (DTRN) that regards scene text ...
research
01/21/2016

Reading Car License Plates Using Deep Convolutional Neural Networks and LSTMs

In this work, we tackle the problem of car license plate detection and r...
research
06/16/2018

Offline Extraction of Indic Regional Language from Natural Scene Image using Text Segmentation and Deep Convolutional Sequence

Regional language extraction from a natural scene image is always a chal...
research
11/05/2019

Improving Long Handwritten Text Line Recognition with Convolutional Multi-way Associative Memory

Convolutional Recurrent Neural Networks (CRNNs) excel at scene text reco...
research
09/06/2017

Scene Text Recognition with Sliding Convolutional Character Models

Scene text recognition has attracted great interests from the computer v...
research
03/17/2022

Knowledge Graph-Enabled Text-Based Automatic Personality Prediction

How people think, feel, and behave, primarily is a representation of the...

Please sign up or login with your details

Forgot password? Click here to reset