A Multi-Stage Multi-Task Neural Network for Aerial Scene Interpretation and Geolocalization

04/04/2018
by   Alina Marcu, et al.
0

Semantic segmentation and vision-based geolocalization in aerial images are challenging tasks in computer vision. Due to the advent of deep convolutional nets and the availability of relatively low cost UAVs, they are currently generating a growing attention in the field. We propose a novel multi-task multi-stage neural network that is able to handle the two problems at the same time, in a single forward pass. The first stage of our network predicts pixelwise class labels, while the second stage provides a precise location using two branches. One branch uses a regression network, while the other is used to predict a location map trained as a segmentation task. From a structural point of view, our architecture uses encoder-decoder modules at each stage, having the same encoder structure re-used. Furthermore, its size is limited to be tractable on an embedded GPU. We achieve commercial GPS-level localization accuracy from satellite images with spatial resolution of 1 square meter per pixel in a city-wide area of interest. On the task of semantic segmentation, we obtain state-of-the-art results on two challenging datasets, the Inria Aerial Image Labeling dataset and Massachusetts Buildings.

READ FULL TEXT

page 9

page 13

page 15

page 16

page 17

page 18

page 19

page 20

research
11/15/2017

Squeeze-SegNet: A new fast Deep Convolutional Neural Network for Semantic Segmentation

The recent researches in Deep Convolutional Neural Network have focused ...
research
03/20/2021

A Novel Upsampling and Context Convolution for Image Semantic Segmentation

Semantic segmentation, which refers to pixel-wise classification of an i...
research
11/21/2020

Height Prediction and Refinement from Aerial Images with Semantic and Geometric Guidance

Deep learning provides a powerful new approach to many computer vision t...
research
04/21/2020

The 1st Agriculture-Vision Challenge: Methods and Results

The first Agriculture-Vision Challenge aims to encourage research in dev...
research
03/08/2022

Stage-Aware Feature Alignment Network for Real-Time Semantic Segmentation of Street Scenes

Over the past few years, deep convolutional neural network-based methods...
research
12/04/2018

PolyMapper: Extracting City Maps using Polygons

We propose a method to leapfrog pixel-wise, semantic segmentation of (ae...
research
08/23/2018

Deep multi-task learning for a geographically-regularized semantic segmentation of aerial images

When approaching the semantic segmentation of overhead imagery in the de...

Please sign up or login with your details

Forgot password? Click here to reset