HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation

07/18/2022
by   Moein Heidari, et al.
0

Convolutional neural networks (CNNs) have been the consensus for medical image segmentation tasks. However, they suffer from the limitation in modeling long-range dependencies and spatial correlations due to the nature of convolution operation. Although transformers were first developed to address this issue, they fail to capture low-level features. In contrast, it is demonstrated that both local and global features are crucial for dense prediction, such as segmenting in challenging contexts. In this paper, we propose HiFormer, a novel method that efficiently bridges a CNN and a transformer for medical image segmentation. Specifically, we design two multi-scale feature representations using the seminal Swin Transformer module and a CNN-based encoder. To secure a fine fusion of global and local features obtained from the two aforementioned representations, we propose a Double-Level Fusion (DLF) module in the skip connection of the encoder-decoder structure. Extensive experiments on various medical image segmentation datasets demonstrate the effectiveness of HiFormer over other CNN-based, transformer-based, and hybrid methods in terms of computational complexity, and quantitative and qualitative results. Our code is publicly available at: https://github.com/amirhossein-kz/HiFormer

READ FULL TEXT

page 3

page 6

page 8

research
07/19/2021

LeViT-UNet: Make Faster Encoders with Transformer for Medical Image Segmentation

Medical image segmentation plays an essential role in developing compute...
research
07/27/2022

TransNorm: Transformer Provides a Strong Spatial Normalization Mechanism for a Deep Segmentation Model

In the past few years, convolutional neural networks (CNNs), particularl...
research
06/29/2022

C2FTrans: Coarse-to-Fine Transformers for Medical Image Segmentation

Convolutional neural networks (CNN), the most prevailing architecture fo...
research
10/13/2022

ConvTransSeg: A Multi-resolution Convolution-Transformer Network for Medical Image Segmentation

Convolutional neural networks (CNNs) achieved the state-of-the-art perfo...
research
07/27/2023

MCPA: Multi-scale Cross Perceptron Attention Network for 2D Medical Image Segmentation

The UNet architecture, based on Convolutional Neural Networks (CNN), has...
research
03/09/2022

PHTrans: Parallelly Aggregating Global and Local Representations for Medical Image Segmentation

The success of Transformer in computer vision has attracted increasing a...
research
01/26/2022

Class-Aware Generative Adversarial Transformers for Medical Image Segmentation

Transformers have made remarkable progress towards modeling long-range d...

Please sign up or login with your details

Forgot password? Click here to reset