Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review

by   Reza Azad, et al.

The remarkable performance of the Transformer architecture in natural language processing has recently also triggered broad interest in Computer Vision. Among other merits, Transformers are witnessed as capable of learning long-range dependencies and spatial correlations, which is a clear advantage over convolutional neural networks (CNNs), which have been the de facto standard in Computer Vision problems so far. Thus, Transformers have become an integral part of modern medical image analysis. In this review, we provide an encyclopedic review of the applications of Transformers in medical imaging. Specifically, we present a systematic and thorough review of relevant recent Transformer literature for different medical image analysis tasks, including classification, segmentation, detection, registration, synthesis, and clinical report generation. For each of these applications, we investigate the novelty, strengths and weaknesses of the different proposed strategies and develop taxonomies highlighting key properties and contributions. Further, if applicable, we outline current benchmarks on different datasets. Finally, we summarize key challenges and discuss different future research directions. In addition, we have provided cited papers with their corresponding implementations in


page 9

page 12

page 14

page 17

page 21

page 31

page 38

page 40


Transformers in Medical Imaging: A Survey

Following unprecedented success on the natural language tasks, Transform...

Is attention all you need in medical image analysis? A review

Medical imaging is a key component in clinical diagnosis, treatment plan...

Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

Transformer, the latest technological advance of deep learning, has gain...

Recent Advances in the Applications of Convolutional Neural Networks to Medical Image Contour Detection

The fast growing deep learning technologies have become the main solutio...

Transformer-Based Visual Segmentation: A Survey

Visual segmentation seeks to partition images, video frames, or point cl...

Implicit Neural Representation in Medical Imaging: A Comparative Survey

Implicit neural representations (INRs) have gained prominence as a power...

A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges

Uncertainty quantification (UQ) plays a pivotal role in reduction of unc...

Please sign up or login with your details

Forgot password? Click here to reset