Rapid and robust endoscopic content area estimation: A lean GPU-based pipeline and curated benchmark dataset

by   Charlie Budd, et al.

Endoscopic content area refers to the informative area enclosed by the dark, non-informative, border regions present in most endoscopic footage. The estimation of the content area is a common task in endoscopic image processing and computer vision pipelines. Despite the apparent simplicity of the problem, several factors make reliable real-time estimation surprisingly challenging. The lack of rigorous investigation into the topic combined with the lack of a common benchmark dataset for this task has been a long-lasting issue in the field. In this paper, we propose two variants of a lean GPU-based computational pipeline combining edge detection and circle fitting. The two variants differ by relying on handcrafted features, and learned features respectively to extract content area edge point candidates. We also present a first-of-its-kind dataset of manually annotated and pseudo-labelled content areas across a range of surgical indications. To encourage further developments, the curated dataset, and an implementation of both algorithms, has been made public (https://doi.org/10.7303/syn32148000, https://github.com/charliebudd/torch-content-area). We compare our proposed algorithm with a state-of-the-art U-Net-based approach and demonstrate significant improvement in terms of both accuracy (Hausdorff distance: 6.3 px versus 118.1 px) and computational time (Average runtime per frame: 0.13 ms versus 11.2 ms).


page 2

page 4

page 6

page 13


A New Benchmark Dataset for Texture Image Analysis and Surface Defect Detection

Texture analysis plays an important role in many image processing applic...

Image Matching across Wide Baselines: From Paper to Practice

We introduce a comprehensive benchmark for local features and robust est...

SurgeonAssist-Net: Towards Context-Aware Head-Mounted Display-Based Augmented Reality for Surgical Guidance

We present SurgeonAssist-Net: a lightweight framework making action-and-...

Learning to Read Analog Gauges from Synthetic Data

Manually reading and logging gauge data is time inefficient, and the eff...

PL-VINS: Real-Time Monocular Visual-Inertial SLAM with Point and Line Features

Leveraging line features to improve localization accuracy of point-based...

The Filament Sensor for Near Real-Time Detection of Cytoskeletal Fiber Structures

A reliable extraction of filament data from microscopic images is of hig...

Please sign up or login with your details

Forgot password? Click here to reset