Deep Image Clustering with Tensor Kernels and Unsupervised Companion Objectives

01/20/2020
by   Daniel J. Trosten, et al.
11

In this paper we develop a new model for deep image clustering, using convolutional neural networks and tensor kernels. The proposed Deep Tensor Kernel Clustering (DTKC) consists of a convolutional neural network (CNN), which is trained to reflect a common cluster structure at the output of its intermediate layers. Encouraging a consistent cluster structure throughout the network has the potential to guide it towards meaningful clusters, even though these clusters might appear to be nonlinear in the input space. The cluster structure is enforced through the idea of unsupervised companion objectives, where separate loss functions are attached to layers in the network. These unsupervised companion objectives are constructed based on a proposed generalization of the Cauchy-Schwarz (CS) divergence, from vectors to tensors of arbitrary rank. Generalizing the CS divergence to tensor-valued data is a crucial step, due to the tensorial nature of the intermediate representations in the CNN. Several experiments are conducted to thoroughly assess the performance of the proposed DTKC model. The results indicate that the model outperforms, or performs comparable to, a wide range of baseline algorithms. We also empirically demonstrate that our model does not suffer from objective function mismatch, which can be a problematic artifact in autoencoder-based clustering models.

READ FULL TEXT

page 1

page 9

research
05/10/2021

Exploiting Elasticity in Tensor Ranks for Compressing Neural Networks

Elasticities in depth, width, kernel size and resolution have been explo...
research
04/13/2016

Joint Unsupervised Learning of Deep Representations and Image Clusters

In this paper, we propose a recurrent framework for Joint Unsupervised L...
research
05/14/2018

Algorithms and Complexity of Range Clustering

We introduce a novel criterion in clustering that seeks clusters with li...
research
03/14/2023

DBSCAN of Multi-Slice Clustering for Third-Order Tensors

Several methods for triclustering three-dimensional data require the clu...
research
04/16/2021

Implementing CNN Layers on the Manticore Cluster-Based Many-Core Architecture

This document presents implementations of fundamental convolutional neur...
research
09/20/2019

Consensual aggregation of clusters based on Bregman divergences to improve predictive models

A new procedure to construct predictive models in supervised learning pr...

Please sign up or login with your details

Forgot password? Click here to reset