Fig. 17
From: Time series transformer for tourism demand forecasting

Visualization of the encoder–decoder Attention weight matrix in each decoder layer demonstrates that the proposed Tsformer has learned rich features and that it utilizes the previous five days and seasonal features for 1-day-ahead forecasting.