Real-time automatic polyp detection in colonoscopy using feature enhancement module and spatiotemporal similarity correlation unit

01/25/2022
by   Jianwei Xu, et al.
9

Automatic detection of polyps is challenging because different polyps vary greatly, while the changes between polyps and their analogues are small. The state-of-the-art methods are based on convolutional neural networks (CNNs). However, they may fail due to lack of training data, resulting in high rates of missed detection and false positives (FPs). In order to solve these problems, our method combines the two-dimensional (2-D) CNN-based real-time object detector network with spatiotemporal information. Firstly, we use a 2-D detector network to detect static images and frames, and based on the detector network, we propose two feature enhancement modules-the FP Relearning Module (FPRM) to make the detector network learning more about the features of FPs for higher precision, and the Image Style Transfer Module (ISTM) to enhance the features of polyps for sensitivity improvement. In video detection, we integrate spatiotemporal information, which uses Structural Similarity (SSIM) to measure the similarity between video frames. Finally, we propose the Inter-frame Similarity Correlation Unit (ISCU) to combine the results obtained by the detector network and frame similarity to make the final decision. We verify our method on both private databases and publicly available databases. Experimental results show that these modules and units provide a performance improvement compared with the baseline method. Comparison with the state-of-the-art methods shows that the proposed method outperforms the existing ones which can meet real-time constraints. It's demonstrated that our method provides a performance improvement in sensitivity, precision and specificity, and has great potential to be applied in clinical colonoscopy.

READ FULL TEXT

page 5

page 8

page 9

page 10

page 12

page 20

page 23

research
03/10/2023

Accurate Real-time Polyp Detection in Videos from Concatenation of Latent Features Extracted from Consecutive Frames

An efficient deep learning model that can be implemented in real-time fo...
research
10/25/2022

End-to-end Transformer for Compressed Video Quality Enhancement

Convolutional neural networks have achieved excellent results in compres...
research
09/15/2017

Feature-Fused SSD: Fast Detection for Small Objects

Small objects detection is a challenging task in computer vision due to ...
research
02/26/2019

MFQE 2.0: A New Approach for Multi-frame Quality Enhancement on Compressed Video

The past few years have witnessed great success in applying deep learnin...
research
09/11/2018

CNN-Based Signal Detection for Banded Linear Systems

Banded linear systems arise in many communication scenarios, e.g., those...
research
08/13/2023

FastLLVE: Real-Time Low-Light Video Enhancement with Intensity-Aware Lookup Table

Low-Light Video Enhancement (LLVE) has received considerable attention i...
research
10/18/2021

A Lightweight and Accurate Recognition Framework for Signs of X-ray Weld Images

X-ray images are commonly used to ensure the security of devices in qual...

Please sign up or login with your details

Forgot password? Click here to reset