The challenge of representation learning: Improved accuracy in deep vision models does not come with better predictions of perceptual similarity

03/13/2023
by   Fritz Günther, et al.
0

Over the last years, advancements in deep learning models for computer vision have led to a dramatic improvement in their image classification accuracy. However, models with a higher accuracy in the task they were trained on do not necessarily develop better image representations that allow them to also perform better in other tasks they were not trained on. In order to investigate the representation learning capabilities of prominent high-performing computer vision models, we investigated how well they capture various indices of perceptual similarity from large-scale behavioral datasets. We find that higher image classification accuracy rates are not associated with a better performance on these datasets, and in fact we observe no improvement in performance since GoogLeNet (released 2015) and VGG-M (released 2014). We speculate that more accurate classification may result from hyper-engineering towards very fine-grained distinctions between highly similar classes, which does not incentivize the models to capture overall perceptual similarities.

READ FULL TEXT
research
05/23/2018

Do Better ImageNet Models Transfer Better?

Transfer learning has become a cornerstone of computer vision with the a...
research
06/01/2022

Needle In A Haystack, Fast: Benchmarking Image Perceptual Similarity Metrics At Scale

The advent of the internet, followed shortly by the social media made it...
research
12/30/2022

Improving Visual Representation Learning through Perceptual Understanding

We present an extension to masked autoencoders (MAE) which improves on t...
research
01/12/2020

Bag of Tricks for Retail Product Image Classification

Retail Product Image Classification is an important Computer Vision and ...
research
05/26/2023

Image Quality Is Not All You Want: Task-Driven Lens Design for Image Classification

In computer vision, it has long been taken for granted that high-quality...
research
08/05/2021

Evaluating CLIP: Towards Characterization of Broader Capabilities and Downstream Implications

Recently, there have been breakthroughs in computer vision ("CV") models...
research
04/07/2017

Deep Unsupervised Similarity Learning using Partially Ordered Sets

Unsupervised learning of visual similarities is of paramount importance ...

Please sign up or login with your details

Forgot password? Click here to reset