Linguistic Approach to Time Series Forecasting

by   Dmytro Lande, et al.

This paper proposes methods of predicting dynamic time series (including non-stationary ones) based on a linguistic approach, namely, the study of occurrences and repetition of so-called N-grams. This approach is used in computational linguistics to create statistical translators, detect plagiarism and duplicate documents. However, the scope of application can be extended beyond linguistics by taking into account the correlations of sequences of stable word combinations, as well as trends. The proposed methods do not require a preliminary study and determination of the characteristics of time series or complex tuning of the input parameters of the forecasting model. They allow, with a high level of automation, to carry out short-term and medium-term forecasts of time series, characterized by trends and cyclicality, in particular, series of publication dynamics in content monitoring systems. Also, the proposed methods can be used to predict the values of the parameters of a large complex system with the aim of monitoring its state, when the number of such parameters is significant, and therefore a high level of automation of the forecasting process is desirable. A significant advantage of the approach is the absence of requirements for time series stationarity and a small number of tuning parameters. Further research may focus on the study of various criteria for the similarity of time series fragments, the use of nonlinear similarity criteria, the search for ways to automatically determine the rational step of quantization of the time series.


page 1

page 2

page 3

page 4


Parallel Extraction of Long-term Trends and Short-term Fluctuation Framework for Multivariate Time Series Forecasting

Multivariate time series forecasting is widely used in various fields. R...

Review of automated time series forecasting pipelines

Time series forecasting is fundamental for various use cases in differen...

N-BEATS neural network for mid-term electricity load forecasting

We address the mid-term electricity load forecasting (MTLF) problem. Thi...

Critical states in Political Trends. How much reliable is a poll on Twitter? A study by means of the Potts Model

In recent years, Twitter data related to political trends have tentative...

Model Monitoring and Dynamic Model Selection in Travel Time-series Forecasting

Accurate travel products price forecasting is a highly desired feature t...

Pattern Similarity-based Machine Learning Methods for Mid-term Load Forecasting: A Comparative Study

Pattern similarity-based methods are widely used in classification and r...

Please sign up or login with your details

Forgot password? Click here to reset