Neural encoding and interpretation for high-level visual cortices based on fMRI using image caption features

03/26/2020
by   Kai Qiao, et al.
1

On basis of functional magnetic resonance imaging (fMRI), researchers are devoted to designing visual encoding models to predict the neuron activity of human in response to presented image stimuli and analyze inner mechanism of human visual cortices. Deep network structure composed of hierarchical processing layers forms deep network models by learning features of data on specific task through big dataset. Deep network models have powerful and hierarchical representation of data, and have brought about breakthroughs for visual encoding, while revealing hierarchical structural similarity with the manner of information processing in human visual cortices. However, previous studies almost used image features of those deep network models pre-trained on classification task to construct visual encoding models. Except for deep network structure, the task or corresponding big dataset is also important for deep network models, but neglected by previous studies. Because image classification is a relatively fundamental task, it is difficult to guide deep network models to master high-level semantic representations of data, which causes into that encoding performance for high-level visual cortices is limited. In this study, we introduced one higher-level vision task: image caption (IC) task and proposed the visual encoding model based on IC features (ICFVEM) to encode voxels of high-level visual cortices. Experiment demonstrated that ICFVEM obtained better encoding performance than previous deep network models pre-trained on classification task. In addition, the interpretation of voxels was realized to explore the detailed characteristics of voxels based on the visualization of semantic words, and comparative analysis implied that high-level visual cortices behaved the correlative representation of image content.

READ FULL TEXT

page 6

page 12

page 13

page 14

page 15

research
07/27/2019

Effective and efficient ROI-wise visual encoding using an end-to-end CNN regression model and selective optimization

Recently, visual encoding based on functional magnetic resonance imaging...
research
01/02/2018

Accurate reconstruction of image stimuli from human fMRI based on the decoding model with capsule network architecture

In neuroscience, all kinds of computation models were designed to answer...
research
02/23/2019

A visual encoding model based on deep neural networks and transfer learning

Background: Building visual encoding models to accurately predict visual...
research
03/24/2023

MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion

Reconstructing visual stimuli from measured functional magnetic resonanc...
research
10/01/2020

Neural encoding with visual attention

Visual perception is critically influenced by the focus of attention. Du...
research
11/17/2017

Learning a Robust Representation via a Deep Network on Symmetric Positive Definite Manifolds

Recent studies have shown that aggregating convolutional features of a p...
research
02/27/2015

Hybrid coding of visual content and local image features

Distributed visual analysis applications, such as mobile visual search o...

Please sign up or login with your details

Forgot password? Click here to reset