Optical Script Identification for multi-lingual Indic-script

08/10/2023
by   Sidhantha Poddar, et al.
0

Script identification and text recognition are some of the major domains in the application of Artificial Intelligence. In this era of digitalization, the use of digital note-taking has become a common practice. Still, conventional methods of using pen and paper is a prominent way of writing. This leads to the classification of scripts based on the method they are obtained. A survey on the current methodologies and state-of-art methods used for processing and identification would prove beneficial for researchers. The aim of this article is to discuss the advancement in the techniques for script pre-processing and text recognition. In India there are twelve prominent Indic scripts, unlike the English language, these scripts have layers of characteristics. Complex characteristics such as similarity in text shape make them difficult to recognize and analyze, thus this requires advance preprocessing methods for their accurate recognition. A sincere attempt is made in this survey to provide a comparison between all algorithms. We hope that this survey would provide insight to a researcher working not only on Indic scripts but also other languages.

READ FULL TEXT

page 2

page 4

page 14

page 15

page 16

research
04/22/2018

Automatic Language Identification in Texts: A Survey

Language identification (LI) is the problem of determining the natural l...
research
03/17/2011

Identification of arabic word from bilingual text using character features

The identification of the language of the script is an important stage i...
research
01/07/2020

Artificial Intelligence for Social Good: A Survey

Artificial intelligence for social good (AI4SG) is a research theme that...
research
11/26/2022

A Survey of Text Representation Methods and Their Genealogy

In recent years, with the advent of highly scalable artificial-neural-ne...
research
09/30/2020

Towards Improved Model Design for Authorship Identification: A Survey on Writing Style Understanding

Authorship identification tasks, which rely heavily on linguistic styles...
research
11/21/2021

Capitalization and Punctuation Restoration: a Survey

Ensuring proper punctuation and letter casing is a key pre-processing st...
research
06/15/2022

A Survey : Neural Networks for AMR-to-Text

AMR-to-text is one of the key techniques in the NLP community that aims ...

Please sign up or login with your details

Forgot password? Click here to reset