Rotation and Translation Invariant Representation Learning with Implicit Neural Representations

04/27/2023
by   Sehyun Kwon, et al.
0

In many computer vision applications, images are acquired with arbitrary or random rotations and translations, and in such setups, it is desirable to obtain semantic representations disentangled from the image orientation. Examples of such applications include semiconductor wafer defect inspection, plankton microscope images, and inference on single-particle cryo-electron microscopy (cryo-EM) micro-graphs. In this work, we propose Invariant Representation Learning with Implicit Neural Representation (IRL-INR), which uses an implicit neural representation (INR) with a hypernetwork to obtain semantic representations disentangled from the orientation of the image. We show that IRL-INR can effectively learn disentangled semantic representations on more complex images compared to those considered in prior works and show that these semantic representations synergize well with SCAN to produce state-of-the-art unsupervised clustering results.

READ FULL TEXT

page 5

page 6

page 15

page 16

page 18

page 19

page 20

research
10/24/2022

Unsupervised Object Representation Learning using Translation and Rotation Group Equivariant VAE

In many imaging modalities, objects of interest can occur in a variety o...
research
08/26/2020

Orientation-Disentangled Unsupervised Representation Learning for Computational Pathology

Unsupervised learning enables modeling complex images without the need f...
research
08/08/2018

Towards Learning Fine-Grained Disentangled Representations from Speech

Learning disentangled representations of high-dimensional data is curren...
research
08/26/2021

A Tutorial on Learning Disentangled Representations in the Imaging Domain

Disentangled representation learning has been proposed as an approach to...
research
11/19/2015

Binding via Reconstruction Clustering

Disentangled distributed representations of data are desirable for machi...
research
11/18/2019

Unsupervised Representation Learning by Discovering Reliable Image Relations

Learning robust representations that allow to reliably establish relatio...
research
03/14/2022

Disentangled Representation Learning for Text-Video Retrieval

Cross-modality interaction is a critical component in Text-Video Retriev...

Please sign up or login with your details

Forgot password? Click here to reset