PixelNet: Towards a General Pixel-level Architecture

09/21/2016
by   Aayush Bansal, et al.
0

We explore architectures for general pixel-level prediction problems, from low-level edge detection to mid-level surface normal estimation to high-level semantic segmentation. Convolutional predictors, such as the fully-convolutional network (FCN), have achieved remarkable success by exploiting the spatial redundancy of neighboring pixels through convolutional processing. Though computationally efficient, we point out that such approaches are not statistically efficient during learning precisely because spatial redundancy limits the information learned from neighboring pixels. We demonstrate that (1) stratified sampling allows us to add diversity during batch updates and (2) sampled multi-scale features allow us to explore more nonlinear predictors (multiple fully-connected layers followed by ReLU) that improve overall accuracy. Finally, our objective is to show how a architecture can get performance better than (or comparable to) the architectures designed for a particular task. Interestingly, our single architecture produces state-of-the-art results for semantic segmentation on PASCAL-Context, surface normal estimation on NYUDv2 dataset, and edge detection on BSDS without contextual post-processing.

READ FULL TEXT

page 1

page 4

page 6

page 10

research
02/21/2017

PixelNet: Representation of the pixels, by the pixels, and for the pixels

We explore design principles for general pixel-level prediction problems...
research
04/23/2022

Class Balanced PixelNet for Neurological Image Segmentation

In this paper, we propose an automatic brain tumor segmentation approach...
research
09/11/2017

One-Shot Learning for Semantic Segmentation

Low-shot learning methods for image classification support learning from...
research
02/21/2018

Semantic Segmentation Refinement by Monte Carlo Region Growing of High Confidence Detections

Despite recent improvements using fully convolutional networks, in gener...
research
09/20/2019

Weakly Supervised Semantic Segmentation Using Constrained Dominant Sets

The availability of large-scale data sets is an essential pre-requisite ...
research
08/15/2019

A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning

Detecting scene text of arbitrary shapes has been a challenging task ove...
research
05/12/2020

Probabilistic Semantic Segmentation Refinement by Monte Carlo Region Growing

Semantic segmentation with fine-grained pixel-level accuracy is a fundam...

Please sign up or login with your details

Forgot password? Click here to reset