Fully Convolutional Adaptation Networks for Semantic Segmentation

04/23/2018
by   Yiheng Zhang, et al.
0

The recent advances in deep neural networks have convincingly demonstrated high capability in learning vision models on large datasets. Nevertheless, collecting expert labeled datasets especially with pixel-level annotations is an extremely expensive process. An appealing alternative is to render synthetic data (e.g., computer games) and generate ground truth automatically. However, simply applying the models learnt on synthetic images may lead to high generalization error on real images due to domain shift. In this paper, we facilitate this issue from the perspectives of both visual appearance-level and representation-level domain adaptation. The former adapts source-domain images to appear as if drawn from the "style" in the target domain and the latter attempts to learn domain-invariant representations. Specifically, we present Fully Convolutional Adaptation Networks (FCAN), a novel deep architecture for semantic segmentation which combines Appearance Adaptation Networks (AAN) and Representation Adaptation Networks (RAN). AAN learns a transformation from one domain to the other in the pixel space and RAN is optimized in an adversarial learning manner to maximally fool the domain discriminator with the learnt source and target representations. Extensive experiments are conducted on the transfer from GTA5 (game videos) to Cityscapes (urban street scenes) on semantic segmentation and our proposal achieves superior results when comparing to state-of-the-art unsupervised adaptation techniques. More remarkably, we obtain a new record: mIoU of 47.5 unsupervised setting.

READ FULL TEXT

page 1

page 3

page 6

page 7

page 8

research
12/16/2016

Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks

Collecting well-annotated image datasets to train modern machine learnin...
research
04/19/2019

Weakly Supervised Adversarial Domain Adaptation for Semantic Segmentation in Urban Scenes

Semantic segmentation, a pixel-level vision task, is developed rapidly b...
research
06/11/2020

Transferring and Regularizing Prediction for Semantic Segmentation

Semantic segmentation often requires a large set of images with pixel-le...
research
12/08/2016

FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation

Fully convolutional models for dense prediction have proven successful f...
research
04/16/2018

DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation

Harvesting dense pixel-level annotations to train deep neural networks f...
research
09/04/2018

Penalizing Top Performers: Conservative Loss for Semantic Segmentation Adaptation

Due to the expensive and time-consuming annotations (e.g., segmentation)...
research
12/24/2018

A Curriculum Domain Adaptation Approach to the Semantic Segmentation of Urban Scenes

During the last half decade, convolutional neural networks (CNNs) have t...

Please sign up or login with your details

Forgot password? Click here to reset