Hybrid deep neural network for Bangla automated image descriptor

02/25/2021
by   md-asifuzzaman-jishan, et al.
0

Automated image to text generation is a computationally challenging computer vision task which requires sufficient comprehension of both syntactic and semantic meaning of an image to generate a meaningful description. Until recent times, it has been studied to a limited scope due to the lack of visual-descriptor dataset and functional models to capture intrinsic complexities involving features of an image. In this study, a novel dataset was constructed by generating Bangla textual descriptor from visual input, called Bangla Natural Language Image to Text (BNLIT), incorporating 100 classes with annotation. A deep neural network-based image captioning model was proposed to generate image description. The model employs Convolutional Neural Network (CNN) to classify the whole dataset, while Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) capture the sequential semantic representation of text-based sentences and generate pertinent description based on the modular complexities of an image. When tested on the new dataset, the model accomplishes significant enhancement of centrality execution for image semantic recovery assignment. For the experiment of that task, we implemented a hybrid image captioning model, which achieved a remarkable result for a new self-made dataset, and that task was new for the Bangladesh perspective. In brief, the model provided benchmark precision in the characteristic Bangla syntax reconstruction and comprehensive numerical analysis of the model execution results on the dataset.

READ FULL TEXT

page 4

page 11

research
02/25/2021

Bangla language textual image description by hybrid neural network model

Automatic image captioning task in different language is a challenging t...
research
02/25/2021

Natural language description of images using hybrid recurrent neural network

We presented a learning model that generated natural language descriptio...
research
02/25/2021

IMAGETOTEXT: IMAGE CAPTION GENERATION USING HYBRID RECURRENT NEURAL NETWORK

Generating a natural language description from images is an important pr...
research
06/08/2017

Image Captioning with Object Detection and Localization

Automatically generating a natural language description of an image is a...
research
08/07/2020

Textual Description for Mathematical Equations

Reading of mathematical expression or equation in the document images is...
research
02/12/2015

Phrase-based Image Captioning

Generating a novel textual description of an image is an interesting pro...
research
02/08/2017

Character-level Deep Conflation for Business Data Analytics

Connecting different text attributes associated with the same entity (co...

Please sign up or login with your details

Forgot password? Click here to reset