A Tree Architecture of LSTM Networks for Sequential Regression with Missing Data

05/22/2020
by   S. Onur Sahin, et al.
0

We investigate regression for variable length sequential data containing missing samples and introduce a novel tree architecture based on the Long Short-Term Memory (LSTM) networks. In our architecture, we employ a variable number of LSTM networks, which use only the existing inputs in the sequence, in a tree-like architecture without any statistical assumptions or imputations on the missing data, unlike all the previous approaches. In particular, we incorporate the missingness information by selecting a subset of these LSTM networks based on "presence-pattern" of a certain number of previous inputs. From the mixture of experts perspective, we train different LSTM networks as our experts for various missingness patterns and then combine their outputs to generate the final prediction. We also provide the computational complexity analysis of the proposed architecture, which is in the same order of the complexity of the conventional LSTM architectures for the sequence length. Our method can be readily extended to similar structures such as GRUs, RNNs as remarked in the paper. In the experiments, we achieve significant performance improvements with respect to the state-of-the-art methods for the well-known financial and real life datasets.

READ FULL TEXT
research
09/23/2022

Image Classification using Sequence of Pixels

This study compares sequential image classification methods based on rec...
research
10/22/2019

An Efficient EKF Based Algorithm For LSTM-Based Online Learning

We investigate online nonlinear regression with long short term memory (...
research
09/17/2018

Learning of Multi-Context Models for Autonomous Underwater Vehicles

Multi-context model learning is crucial for marine robotics where severa...
research
07/12/2016

Recurrent Highway Networks

Many sequential processing tasks require complex nonlinear transition fu...
research
03/25/2022

A Hybrid Framework for Sequential Data Prediction with End-to-End Optimization

We investigate nonlinear prediction in an online setting and introduce a...
research
01/23/2014

Predicting Nearly As Well As the Optimal Twice Differentiable Regressor

We study nonlinear regression of real valued data in an individual seque...
research
08/18/2022

Sequence Prediction Under Missing Data : An RNN Approach Without Imputation

Missing data scenarios are very common in ML applications in general and...

Please sign up or login with your details

Forgot password? Click here to reset