Food Classification using Joint Representation of Visual and Textual Data

08/03/2023
by   Prateek Mittal, et al.
0

Food classification is an important task in health care. In this work, we propose a multimodal classification framework that uses the modified version of EfficientNet with the Mish activation function for image classification, and the traditional BERT transformer-based network is used for text classification. The proposed network and the other state-of-the-art methods are evaluated on a large open-source dataset, UPMC Food-101. The experimental results show that the proposed network outperforms the other methods, a significant difference of 11.57 respectively, when compared with the second-best performing method. We also compared the performance in terms of accuracy, precision, and recall for text classification using both machine learning and deep learning-based models. The comparative analysis from the prediction results of both images and text demonstrated the efficiency and robustness of the proposed approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/10/2023

Exploring Machine Learning and Transformer-based Approaches for Deceptive Text Classification: A Comparative Analysis

Deceptive text classification is a critical task in natural language pro...
research
05/30/2018

CuisineNet: Food Attributes Classification using Multi-scale Convolution Network

Diversity of food and its attributes represents the culinary habits of p...
research
11/09/2020

Bangla Text Classification using Transformers

Text classification has been one of the earliest problems in NLP. Over t...
research
05/23/2023

Connecting the Dots: What Graph-Based Text Representations Work Best for Text Classification using Graph Neural Networks?

Given the success of Graph Neural Networks (GNNs) for structure-aware ma...
research
03/22/2023

A Small-Scale Switch Transformer and NLP-based Model for Clinical Narratives Classification

In recent years, Transformer-based models such as the Switch Transformer...
research
06/05/2022

Performance Comparison of Simple Transformer and Res-CNN-BiLSTM for Cyberbullying Classification

The task of text classification using Bidirectional based LSTM architect...
research
11/08/2018

Doc2Im: document to image conversion through self-attentive embedding

Text classification is a fundamental task in NLP applications. Latest re...

Please sign up or login with your details

Forgot password? Click here to reset