Few-Shot Panoptic Segmentation With Foundation Models

09/19/2023
by   Markus Käppeler, et al.
0

Current state-of-the-art methods for panoptic segmentation require an immense amount of annotated training data that is both arduous and expensive to obtain posing a significant challenge for their widespread adoption. Concurrently, recent breakthroughs in visual representation learning have sparked a paradigm shift leading to the advent of large foundation models that can be trained with completely unlabeled images. In this work, we propose to leverage such task-agnostic image features to enable few-shot panoptic segmentation by presenting Segmenting Panoptic Information with Nearly 0 labels (SPINO). In detail, our method combines a DINOv2 backbone with lightweight network heads for semantic segmentation and boundary estimation. We show that our approach, albeit being trained with only ten annotated images, predicts high-quality pseudo-labels that can be used with any existing panoptic segmentation method. Notably, we demonstrate that SPINO achieves competitive results compared to fully supervised baselines while using less than 0.3 labels, paving the way for learning complex visual recognition tasks leveraging foundation models. To illustrate its general applicability, we further deploy SPINO on real-world robotic vision systems for both outdoor and indoor environments. To foster future research, we make the code and trained models publicly available at http://spino.cs.uni-freiburg.de.

READ FULL TEXT

page 1

page 3

page 5

research
03/18/2020

Semi-supervised few-shot learning for medical image segmentation

Recent years have witnessed the great progress of deep neural networks o...
research
04/20/2023

Text2Seg: Remote Sensing Image Semantic Segmentation via Text-Guided Visual Foundation Models

Recent advancements in foundation models (FMs), such as GPT-4 and LLaMA,...
research
07/24/2021

Personalized Image Semantic Segmentation

Semantic segmentation models trained on public datasets have achieved gr...
research
10/04/2021

Weak-shot Semantic Segmentation by Transferring Semantic Affinity and Boundary

Weakly-supervised semantic segmentation (WSSS) with image-level labels h...
research
06/18/2021

Towards Single Stage Weakly Supervised Semantic Segmentation

The costly process of obtaining semantic segmentation labels has driven ...
research
06/06/2023

Towards Label-free Scene Understanding by Vision Foundation Models

Vision foundation models such as Contrastive Vision-Language Pre-trainin...
research
09/22/2022

NamedMask: Distilling Segmenters from Complementary Foundation Models

The goal of this work is to segment and name regions of images without a...

Please sign up or login with your details

Forgot password? Click here to reset