Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food

03/04/2021
by   Quin Thames, et al.
0

Understanding the nutritional content of food from visual data is a challenging computer vision problem, with the potential to have a positive and widespread impact on public health. Studies in this area are limited to existing datasets in the field that lack sufficient diversity or labels required for training models with nutritional understanding capability. We introduce Nutrition5k, a novel dataset of 5k diverse, real world food dishes with corresponding video streams, depth images, component weights, and high accuracy nutritional content annotation. We demonstrate the potential of this dataset by training a computer vision algorithm capable of predicting the caloric and macronutrient values of a complex, real world dish at an accuracy that outperforms professional nutritionists. Further we present a baseline for incorporating depth sensor data to improve nutrition predictions. We will publicly release Nutrition5k in the hope that it will accelerate innovation in the space of nutritional understanding.

READ FULL TEXT

page 1

page 3

page 5

page 9

page 10

page 11

research
03/30/2021

Large Scale Visual Food Recognition

Food recognition plays an important role in food choice and intake, whic...
research
09/14/2023

NutritionVerse: Empirical Study of Various Dietary Intake Estimation Approaches

Accurate dietary intake estimation is critical for informing policies an...
research
09/14/2017

Exploring Food Detection using CNNs

One of the most common critical factors directly related to the cause of...
research
05/01/2019

Towards computer vision powered color-nutrient assessment of pureed food

With one in four individuals afflicted with malnutrition, computer visio...
research
04/12/2023

NutritionVerse-3D: A 3D Food Model Dataset for Nutritional Intake Estimation

77 challenge to ensuring adequate nutritional intake. It has been report...
research
05/10/2022

Spatial Monitoring and Insect Behavioural Analysis Using Computer Vision for Precision Pollination

Insects are the most important global pollinator of crops and play a key...
research
11/17/2021

ARKitScenes – A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D Data

Scene understanding is an active research area. Commercial depth sensors...

Please sign up or login with your details

Forgot password? Click here to reset