"The Squawk Bot": Joint Learning of Time Series and Text Data Modalities for Automated Financial Information Filtering

by   Xuan-Hong Dang, et al.

Multimodal analysis that uses numerical time series and textual corpora as input data sources is becoming a promising approach, especially in the financial industry. However, the main focus of such analysis has been on achieving high prediction accuracy while little effort has been spent on the important task of understanding the association between the two data modalities. Performance on the time series hence receives little explanation though human-understandable textual information is available. In this work, we address the problem of given a numerical time series, and a general corpus of textual stories collected in the same period of the time series, the task is to timely discover a succinct set of textual stories associated with that time series. Towards this goal, we propose a novel multi-modal neural model called MSIN that jointly learns both numerical time series and categorical text articles in order to unearth the association between them. Through multiple steps of data interrelation between the two data modalities, MSIN learns to focus on a small subset of text articles that best align with the performance in the time series. This succinct set is timely discovered and presented as recommended documents, acting as automated information filtering, for the given time series. We empirically evaluate the performance of our model on discovering relevant news articles for two stock time series from Apple and Google companies, along with the daily news articles collected from the Thomson Reuters over a period of seven consecutive years. The experimental results demonstrate that MSIN achieves up to 84.9 truth articles respectively to the two examined time series, far more superior to state-of-the-art algorithms that rely on conventional attention mechanism in deep learning.


page 1

page 2

page 3

page 4


A Stochastic Time Series Model for Predicting Financial Trends using NLP

Stock price forecasting is a highly complex and vitally important field ...

Human-like Time Series Summaries via Trend Utility Estimation

In many scenarios, humans prefer a text-based representation of quantita...

Textual Data for Time Series Forecasting

While ubiquitous, textual sources of information such as company reports...

Incorporating Pre-trained Model Prompting in Multimodal Stock Volume Movement Prediction

Multimodal stock trading volume movement prediction with stock-related n...

Upgrading the Newsroom: An Automated Image Selection System for News Articles

We propose an automated image selection system to assist photo editors i...

The R package sentometrics to compute, aggregate and predict with textual sentiment

We provide a hands-on introduction to optimized textual sentiment indexa...

History Playground: A Tool for Discovering Temporal Trends in Massive Textual Corpora

Recent studies have shown that macroscopic patterns of continuity and ch...

Please sign up or login with your details

Forgot password? Click here to reset