SwinVRNN: A Data-Driven Ensemble Forecasting Model via Learned Distribution Perturbation

by   Yuan Hu, et al.

Data-driven approaches for medium-range weather forecasting are recently shown extraordinarily promising for ensemble forecasting for their fast inference speed compared to traditional numerical weather prediction (NWP) models, but their forecast accuracy can hardly match the state-of-the-art operational ECMWF Integrated Forecasting System (IFS) model. Previous data-driven attempts achieve ensemble forecast using some simple perturbation methods, like initial condition perturbation and Monte Carlo dropout. However, they mostly suffer unsatisfactory ensemble performance, which is arguably attributed to the sub-optimal ways of applying perturbation. We propose a Swin Transformer-based Variational Recurrent Neural Network (SwinVRNN), which is a stochastic weather forecasting model combining a SwinRNN predictor with a perturbation module. SwinRNN is designed as a Swin Transformer-based recurrent neural network, which predicts future states deterministically. Furthermore, to model the stochasticity in prediction, we design a perturbation module following the Variational Auto-Encoder paradigm to learn multivariate Gaussian distributions of a time-variant stochastic latent variable from data. Ensemble forecasting can be easily achieved by perturbing the model features leveraging noise sampled from the learned distribution. We also compare four categories of perturbation methods for ensemble forecasting, i.e. fixed distribution perturbation, learned distribution perturbation, MC dropout, and multi model ensemble. Comparisons on WeatherBench dataset show the learned distribution perturbation method using our SwinVRNN model achieves superior forecast accuracy and reasonable ensemble spread due to joint optimization of the two targets. More notably, SwinVRNN surpasses operational IFS on surface variables of 2-m temperature and 6-hourly total precipitation at all lead times up to five days.


page 12

page 15


SwinRDM: Integrate SwinRNN with Diffusion Model towards High-Resolution and High-Quality Weather Forecasting

Data-driven medium-range weather forecasting has attracted much attentio...

FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators

FourCastNet, short for Fourier Forecasting Neural Network, is a global d...

Variational Data Assimilation with a Learned Inverse Observation Operator

Variational data assimilation optimizes for an initial state of a dynami...

WeatherBench Probability: A benchmark dataset for probabilistic medium-range weather forecasting along with deep learning baseline models

WeatherBench is a benchmark dataset for medium-range weather forecasting...

FengWu: Pushing the Skillful Global Medium-range Weather Forecast beyond 10 Days Lead

We present FengWu, an advanced data-driven global medium-range weather f...

Statistical and machine learning ensemble modelling to forecast sea surface temperature

In situ and remotely sensed observations have huge potential to develop ...

Integrating Recurrent Neural Networks with Data Assimilation for Scalable Data-Driven State Estimation

Data assimilation (DA) is integrated with machine learning in order to p...

Please sign up or login with your details

Forgot password? Click here to reset