A hybrid container throughput forecasting approach using bi-directional hinterland data of port

Zeng, Fangli; Xu, Shuojiang

doi:10.1038/s41598-024-77376-9

Download PDF

Article
Open access
Published: 26 October 2024

A hybrid container throughput forecasting approach using bi-directional hinterland data of port

Fangli Zeng^1,2 &
Shuojiang Xu³

Scientific Reports volume 14, Article number: 25502 (2024) Cite this article

2589 Accesses
3 Citations
Metrics details

Subjects

Abstract

Accurate forecasting of port container throughput plays a crucial role in optimising port operations, resource allocation, supply chain management, etc. However, existing studies only focus on the impact of port hinterland economic development on container throughput, ignoring the impact of port foreland. This study proposed a container throughput forecasting model based on deep learning, which considers the impact of port hinterland and foreland on container throughput. Real-world experimental results showed that the proposed model with multiple data sources outperformed other forecasting methods, achieving significantly higher accuracy. The implications of this study are significant for port authorities, logistics companies, and policymakers.

Ports’ criticality in international trade and global supply-chains

Article Open access 27 July 2022

Enhancing global maritime traffic network forecasting with gravity-inspired deep learning models

Article Open access 19 July 2024

An approach to assessing tsunami risk to the global port network under rising sea levels

Article Open access 04 December 2024

Introduction

Ports play a crucial role in global trade, serving as vital gateways for the movement of goods and commodities¹. Efficient operations and planning within ports are essential for ensuring smooth supply chains and economic growth. Accurately forecasting container throughput is a key aspect of port management, enabling port authorities and stakeholders to optimise resource allocation, plan infrastructure development, and enhance overall operational efficiency².

Traditionally, container throughput forecasting has relied on statistical models based on historical data and expert knowledge³. However, these methods often struggle to capture the complex and dynamic nature of port operations, leading to limited accuracy in forecasting. Recognising the need for more robust and accurate forecasting, researchers have turned to advanced technologies such as deep learning^4,5.

Deep learning, a subset of machine learning, has shown remarkable success in various domains, including computer vision⁶, natural language processing⁷, and time series analysis⁸. Its ability to automatically learn and extract intricate patterns from vast amounts of data makes it an ideal candidate for improving container throughput forecasting in ports.

In addition, one concept that has gained significant attention in recent years is the notion of a port’s bi-directional hinterland. The bi-directional hinterland refers to the area surrounding a port that serves as both a source and destination for cargo movements, including hinterland and foreland⁹. Hinterland refers to the geographical area connected with the port in some mode of transport, which generates a source of goods for the port or consumes goods imported from the port. Foreland refers to other countries or regions that are connected to the seaport by maritime vessels. Understanding the size and scope of a port’s bi-directional hinterland is crucial for accurately forecasting container throughput¹⁰.

The main motivation of this study is to release the potential and power of port’s bi-directional hinterland data in the forecasting of port throughput. The forecasting accuracy can be improved by introducing data related to port’s bi-directional hinterland. This research proposed a Grey-CNN model that combines Grey Correlation Analysis (GCA) and Convolutional Neural Networks (CNN). GCA is employed to solve the overfitting problem caused by too many input variables. CNN is used to mine non-linear features in data. This paper is the first time that the Grey-CNN model has been used in the field of port throughput forecasting.

The first step is to collect candidate factors that can influence port container throughput. When selecting these factors, two aspects, hinterland, and foreland, need to be considered. The second step is to filter the most relevant factors based on the output of GCA. The third step is to train the CNN model with selected factors. The combined use of GCA and CNN enables the model to not only select the most relevant variables from a variety of data and improve the interpretability of the model, but also capture the complex relationships and dependencies that may exist in the data, so as to achieve more accurate forecasting results. By leveraging historical data, hinterland and foreland information, this approach will enhance the accuracy of container throughput forecasting.

The implications of the study can be significant for port management and logistics planning. By better understanding and utilising the data related to a port’s bi-directional hinterland, port authorities can make informed decisions regarding infrastructure investments, resource allocation, and trade facilitation measures. Ultimately, accurate container throughput forecasting can contribute to reducing congestion, optimising operations, and fostering sustainable growth within ports.

The remainder of this manuscript is organised as follows. Literature review provided a literature review. The methodology and forecasting framework were introduced in Methodology. Empirical study presented and discussed the results of empirical study. Finally, Conclusion concluded this study with practical implications, limitations, and future research.

Literature review

Bi-directional hinterland

Port’s hinterland and foreland are interconnected concepts that define the relationship between a port and its surrounding areas, both on land and at sea. The hinterland refers to the inland regions connected to a particular port through various transportation corridors such as roads, railways, or waterways¹¹. This expansive area extends beyond immediate proximity to the port into neighbouring territories. The hinterlands serve as an integral part of the logistics network supporting economic activities related to import/export operations facilitated by the port. They provide crucial links for transporting cargo between inland locations and the port facility itself.

On the other hand, the foreland serves as the ocean-ward mirror of the hinterland, encompassing ports and overseas markets linked by shipping services from a specific port¹². It represents a maritime space where commercial relationships thrive, particularly with overseas customers¹³. Within this realm, ports play a vital role in facilitating trade activities by establishing connections through shipping services to reach international markets¹⁴. These connections extend beyond physical infrastructure to include destinations worldwide where goods are imported from or exported to via ships departing from or arriving at the port¹⁵.

The symbiotic relationship between hinterland and foreland is essential for fostering economic development and attracting investment in today’s global economy¹⁶. Ports rely on their hinterlands for access to domestic markets and sources of goods for exportation abroad while providing critical infrastructure like terminals, berths, and storage facilities necessary for efficient handling of goods within their forelands’ maritime spaces. The integration between these two entities enables ports not only to benefit from efficient cargo exchanges with overseas markets but also support the economic growth of industries in their hinterlands by improving accessibility to global trade networks. Developing robust transportation infrastructure and optimising logistical operations within the hinterlands can enhance overall supply chain efficiency, reducing time costs in logistics operations, minimising inventory holding requirements, resulting in significant cost savings, and facilitating just-intime delivery processes¹⁷.

Factors affecting container throughput

In view of the changing international situation, the development of the maritime industry will also be affected by it, how to establish a model for the forecast of port container throughput becomes a difficult problem. Moreover, forecasting port throughput is a complicated process, which is closely related to the geographical location, economic conditions, traffic, and natural conditions of the port hinterland. The research shows that the export trade volume of a country is the key macroeconomic variable affecting the container throughput¹⁸. This indicates that the port container throughput is affected by the market conditions of the export destination. Moreover, Li et al. pointed out that factors affecting port throughput can be roughly divided into three categories: economic factors, port factors, and other factors: (1) economic factors include world economic level, foreign trade and domestic trade; (2) the port itself factors include port location, port infrastructure construction and port supply and demand relations; (3) other factors include weather, government policy, and competition between ports¹⁹.

Some of these factors are uncertain and some will change with time, but all of them affect the establishment of the forecasting model, so that the change of port throughput has two linear and nonlinear situations. Li Y et al. also used univariate linear regression, multiple linear regression, and generalised learning system to forecast the container throughput of Lianyun Port and compared the experimental results of single variable and combined two economic factors. The results showed that the port throughput could be more effectively forecasted by adding two economic factors²⁰. Dai employed GCA to analyse the relationship between Tangshan port logistics and regional economic development. The results showd that the tertiary industry and the fixed assets investment of the whole society are highly correlated with port logistics²¹.

Univariate container throughput forecasting

Container throughput forecasting is a popular research topic in the fields of forecasting and maritime research. In the past few decades, researchers proposed many innovative models to forecast the container throughput. From the perspective of data types used, these models can be divided into two categories. The first category is a forecasting model that only uses historical container throughput; the second category is a model that uses both historical container throughput and economic data of the port city.

For the first category, Niu et al. proposed a hybrid decomposition-ensemble model based on Variational Mode Decomposition (VMD) and Hybridising Grey Wolf Optimisation (HGWO). In their study, they collected monthly container throughput of the Port of Singapore (January 1995 - May 2016) and the Port of Shanghai (January 2001 to May 2016. Then the time series were decomposed into low and high frequency components by the VMD algorithm. Then the low frequency components were used to train ARIMA and high frequency components are used to train Support Vector Regression (SVR) using HGWO algorithm²². Mo et al. collected monthly container throughput of the Port of Xiamen and the Port of Shanghai (January 2001 - December 2015). Then the Seasonal Autoregressive Integrated Moving Average (SARIMA) was used to forecast the linear trend of the original time series. After that, the nonlinear residual series was analysed by the BP neural network, genetic programming and SVR, and the Group Method of Data Handling (GMDH) neural network was used to select the optimal complexity model. Then the results of linear and nonlinear parts are added together to get the final results²³. Similarly, Xie et al. collected monthly container throughput of the Port of Singapore and the port of Log Angeles (January 1995 - January 2017). Then they employed the decomposition method to analyse the original time series. Data characteristic analysis was conducted to select the suitable model for the decomposed parts. Finally, the results of the models were aggregated to get the forecasting result²⁴. Furthermore, Hassan et al. built a reinforcement learning framework to forecast freight demand using historical demand data. They first cluster historical demand time series into different groups according to their spatial-temporal characteristics. And then the committee of predictors of each cluster were constructed. Finally, the predictive component models were fed into reinforcement learning model to get the forecasting results. Their results showed that the proposed approach is able to generate accurate results²⁵. Moreover, Yang and Chang proposed a mixed-precision model based on CNN and LSTM to forecast container throughput²⁶. Xiao et al. employed attention mechanism to further improve the performance of VMD-CNN-LSTM model²⁷.

It can be found from the above research cases that for the research with only historical container throughput data, the usual method is to use decomposition algorithm to decompose the original time series, then analyse and forecast the decomposed parts, and finally aggregate all the parts together to get the final result^28,29,30. However, there is still something that can be improved in the above studies. That is, these studies are all based on the univariate forecasting model, while ignoring other factors around the research object which may have effect on it. Therefore, it is necessary to incorporate these kinds of data to build multivariate forecasting model. The next section is going to review previous studies that employed multivariate models to forecast target variable.

Multivariate container throughput

It is limited to forecast the future container throughput based only on the historical container throughput of the port. Historical container throughput can only provide partial information related to the target, but not all information related to container handling. Therefore, the introduction of more data related to container throughput has become an effective means to improve the forecasting accuracy, which is the second category of container throughput forecasting studies.

For instance, Geng et al. collected yearly port throughput data and corresponding socio-economic indicator data for the Port of Shanghai (1978–2013). Then the candidate input variables were analysed using Multivariable Adaptive Regression Splines (MARS) to determine the final input variables. When all input variables were determined, the authors used SVR with an optimisation algorithm to obtain the final forecasting model³¹. Intihar et al. employed dynamic factor analysis and Autoregressive Integrated Moving Average with Exogenous Variables (ARIMAX) to forecast the container throughput of the Port of Koper. The dynamic factor analysis method was used to analyse influential external macroeconomic indicators and ARIMAX was then used to forecast future container throughput³². Moreover, Rashed et al. proposed a combined approach to forecast container throughput demand. They first employed AutoregRessive Distributed Lag (ARDL) model to estimate the link between the economic the activity and container throughput. Then the future growth rates for exogenous variables were calculated using scenario analysis. The final step involved combining the outputs of the first two steps to forecast container throughput³³. Additionally, Tang et al. selected an optimal model based on multiple factors for container throughput forecasting. They collected five external factors, including the total retail sales of consumer goods, the gross domestic product of the local city, import and export trade volume, total output value of the second industry and total fixed asset investment. These factors were then used to train several forecasting methods. They determined that the back-propagation neural network generated the best results for the Port of Shanghai and the Port of Lianyungang³⁴.

It was noted that the usual process for container throughput forecasting involves selecting suitable variables from candidate economic data of the port city using analytic methods. Then, the forecasting model can be trained by the selected economic variables and historical container throughput. However, the existing studies only focus on the impact of economic development of the port hinterland on the container throughput, ignoring the impact of the port foreland. It is necessary to develop a method approach that considers the above two factors to forecast container throughput.

Methodology

Proposed Grey-CNN model

The proposed model is presented in Fig. 1. This section will provide a detailed explanation of each step.

Step 1. Collection of relevant data.

Port container throughput is affected by the development of hinterland and foreland, so data related to these two aspects should be collected.

For hinterland, this study collects influencing factors related to port construction and economic development. However, different cities may not provide exactly the same data. If this is the case, similar data of the same type can be selected as input.

For foreland, this study collects port throughput and Gross Domestic Product (GDP) data for several countries with which the study subjects have the closest trade relations. These two kinds of data can be regarded as a synthesis of other information, which can represent the changes of the entire foreland market⁵. Furthermore, for different types of ports, such as production-export ports (e.g., Shanghai Port, Ningbo Port), transshipment ports (e.g., Port of Singapore), when selecting foreland data, the selection of countries will not be affected by the different port types. Because we chose the countries with which they trade most closely.

In addition, the historical container throughput of the port is also used as input data to train the forecasting model.

Step 2. Grey correlation analysis.

When there are many candidate influencing factors, it is more beneficial to extract the factors that are strongly related to the port container throughput to avoid overfitting problems. Grey correlation analysis is employed to screen the influencing factors.

Step 3. Model training.

In this study, CNN is used as the basis for the forecasting model because it was found in previous studies to be more effective for mining multi-dimensional time series. Due to the limited amount of data used in this study, the cross-validation method cannot be used, so this study adopts the early stop strategy to solve the overfitting problem. To verify the performance of proposed Grey-CNN model, nine models are selected as the benchmark models, which are:

Naïve method: The naive method in time series forecasting assumes that the future value will be the same as the most recent observed value, making it a simple but limited approach that ignores patterns or trends in the data.
ARIMA: Autoregressive Integrated Moving Average (ARIMA) is a common time series analysis method, widely used for data with obvious trends and seasonality, and widely used in economics, finance, meteorology, and other fields.
ARIMAX: AutoRegressive Integrated Moving Average with Exogenous Variables (ARIMAX) is a forecasting model that combines the ARIMA model with additional exogenous variables. It incorporates the autoregressive and moving average components to capture the time series’ internal dynamics, while also considering the impact of external factors. By including exogenous variables, ARIMAX can provide more accurate and robust forecasting by accounting for factors beyond the time series itself, such as economic indicators or weather data.
u-LSTM: Univariate LSTM is a type of recurrent neural network (RNN) used for time series forecasting with a single variable. It can capture temporal dependencies and patterns in the data by utilising memory cells and gates. It is trained on historical data and can generate forecasting results for future time steps based solely on the input variable’s past values.
m-LSTM: Multi-variate LSTM is for time series forecasting with multiple input variables. It can capture complex relationships and dependencies between multiple variables, enabling more accurate forecasts by incorporating information from multiple sources in the input data.
u-GRU: Univariate GRU is another variant of recurrent neural networks (RNN) used for time series forecasting with a single variable. It employs gated mechanisms to capture long-term dependencies and patterns in the data, making it effective for modelling and forecasting sequential data with a single input variable.
m-GRU: Multi-variate GRU is similar to m-LSTM. It utilises gated mechanisms to capture dependencies and patterns between multiple variables, improving the accuracy of forecasts in multi-dimensional data.
u-CNN: Uni-variate CNN is a type of neural network commonly used for time series forecasting with a single input variable. It applies convolutional filters to capture local patterns and learns hierarchical representations, enabling effective feature extraction and forecasting in uni-dimensional data.
u-Transformer: Uni-variate Transformer is a variant of the Transformer model designed for time series forecasting with a single input variable. It utilises self-attention mechanisms to capture global dependencies and learns contextual representations, enabling accurate forecasts and handling long-range dependencies in uni-dimensional data.

Step 4. Comparison and analysis.

This step is to compare the assessment criteria of each model and validate the performance of the proposed Grey-CNN model for container throughput forecasting.

Grey correlation analysis

Grey correlation analysis, also known as Grey System Theory, is a statistical technique that measures the relationship between variables when data is limited or incomplete. It was developed as an extension of classical correlation analysis. Grey correlation analysis is particularly useful in situations where historical data or complete information is not available, making it applicable in various fields such as engineering, economics, and social sciences.

At the core of grey correlation analysis is the concept of grey numbers, which represent incomplete or uncertain information. Grey numbers consist of a definite value, an indeterminate value, and a grey relational coefficient. The grey relational coefficient quantifies the relationship between variables, indicating the degree of correlation between them.

The following steps present the main process of Grey Correlation Analysis.

Step 1

Suppose there is the reference sequence, in this study, is the time series of container throughput, and n is the number of observations. Then the candidate influence factors can be represented as is the number of candidate factors. The sequences of can be expressed as a matrix.

$$\:\left({\varvec{X}}_{1},\:{\varvec{X}}_{2},\:\dots\:,\:{\varvec{X}}_{\varvec{m}}\right)=\left[\begin{array}{ccc}{x}_{11}&\:{x}_{12}&\:\begin{array}{cc}\dots\:&\:{x}_{1 m}\end{array}\\\:{x}_{21}&\:{x}_{22}&\:\begin{array}{cc}\dots\:&\:{x}_{2 m}\end{array}\\\:\begin{array}{c}\dots\:\\\:{x}_{n1}\end{array}&\:\begin{array}{c}\dots\:\\\:{x}_{n2}\end{array}&\:\begin{array}{cc}\begin{array}{c}\dots\:\\\:\dots\:\end{array}&\:\begin{array}{c}\dots\:\\\:{x}_{nm}\end{array}\end{array}\end{array}\right]$$

Step 2: The reference sequence $\:\varvec{Y}$ and the sequences of the candidate factors $\:\left({\varvec{X}}_{1},\:{\varvec{X}}_{2},\:\dots\:,\:{\varvec{X}}_{\varvec{m}}\right)$ should be standardised using one of the following methods: (1) the initial value transform; (2) the average value transform; and (3) the polar difference transform. The transformed matrix would be:

$$\:{\varvec{Y}}^{\varvec{{\prime\:}}}=\left[\begin{array}{c}{y}_{1}^{{\prime\:}}\\\:{y}_{2}^{{\prime\:}}\\\:\begin{array}{c}\dots\:\\\:{y}_{n-1}^{{\prime\:}}\\\:{y}_{n}^{{\prime\:}}\end{array}\end{array}\right]$$

$$\:\left({\varvec{X}}_{1}^{\varvec{{\prime\:}}},\:{\varvec{X}}_{2}^{\varvec{{\prime\:}}},\:\dots\:,\:{\varvec{X}}_{\varvec{M}}^{\varvec{{\prime\:}}}\right)=\left[\begin{array}{ccc}{x}_{11}^{{\prime\:}}&\:{x}_{12}^{{\prime\:}}&\:\begin{array}{cc}\dots\:&\:{x}_{1 m}^{{\prime\:}}\end{array}\\\:{x}_{21}^{{\prime\:}}&\:{x}_{22}^{{\prime\:}}&\:\begin{array}{cc}\dots\:&\:{x}_{2 m}^{{\prime\:}}\end{array}\\\:\begin{array}{c}\dots\:\\\:{x}_{n1}^{{\prime\:}}\end{array}&\:\begin{array}{c}\dots\:\\\:{x}_{n2}^{{\prime\:}}\end{array}&\:\begin{array}{cc}\begin{array}{c}\dots\:\\\:\dots\:\end{array}&\:\begin{array}{c}\dots\:\\\:{x}_{nm}^{{\prime\:}}\end{array}\end{array}\end{array}\right]$$

Step 3

Compute the absolute difference value between reference sequence and the candidate factors.

$$\:{\varDelta\:}_{ti}=\left|{y}_{t}^{{\prime\:}}-{x}_{ti}^{{\prime\:}}\right|,\:t=\text{1,2},\dots\:,n,\:i=\text{1,2},\dots\:,m$$

Step 4

Calculate mm and MM.

$$\:mm={min}_{i}^{m}{min}_{t}^{n}{\varDelta\:}_{ti}$$

$$\:MM={max}_{i}^{m}{max}_{t}^{n}{\varDelta\:}_{ti}$$

Step 5

Calculate the relational coefficient matrix of the reference sequence and the candidate factors.

$$\:\left[\begin{array}{cccc}r({y}_{1}^{{\prime\:}},\:{x}_{11}^{{\prime\:}})&\:r({y}_{1}^{{\prime\:}},\:{x}_{12}^{{\prime\:}})&\:\dots&\:r({y}_{1}^{{\prime\:}},\:{x}_{1 m}^{{\prime\:}})\\\:r({y}_{2}^{{\prime\:}},\:{x}_{21}^{{\prime\:}})&\:r({y}_{2}^{{\prime\:}},\:{x}_{22}^{{\prime\:}})&\:\dots\:&\:r({y}_{2}^{{\prime\:}},\:{x}_{2 m}^{{\prime\:}})\\\:\dots\:&\:\dots\:&\:\dots\:&\:\dots\:\\\:r({y}_{n}^{{\prime\:}},\:{x}_{n1}^{{\prime\:}})&\:r({y}_{n}^{{\prime\:}},\:{x}_{n2}^{{\prime\:}})&\:\dots\:&\:r({y}_{n}^{{\prime\:}},\:{x}_{nm}^{{\prime\:}})\end{array}\right]$$

Where $\:r\left({y}_{t}^{{\prime\:}},\:{x}_{ti}^{{\prime\:}}\right)=\frac{mm+\rho\:\times\:MM}{{\varDelta\:}_{ti}+\rho\:\times\:MM},\:t=\text{1, 2},\dots\:,n,\:i=\text{1, 2},\dots\:,m.$ For most cases, the value of $\:\rho\:$ is 0.5.

Step 6

Calculate the grey relational degree.

$$\:r\left(Y,{X}_{i}\right)=\frac{1}{n}\sum_{t=1}^nr\left({y}_{t}^{{\prime\:}},\:{x}_{ti}^{{\prime\:}}\right),\:i=\text{1, 2},\dots\:,m$$

CNN

The CNN architecture comprises an input layer, convolution layer, pooling layer, fully connected layer, and output layer. In the input layer, the input data undergoes convolution using a convolution kernel to create the convolution layer. Subsequently, the pooling layer applies pooling methods like max pooling or average pooling to effectively reduce the size of the parameter matrix, thereby decreasing the number of parameters in the fully connected layer. This addition of the pooling layer not only accelerates computation but also helps prevent overfitting. Following the pooling process, the pooled data is forwarded to the fully connected layer, which can be considered as a conventional multi-layer perceptron. The input for the fully connected layer consists of features extracted from both the convolution layer and the pooling layer. The final output layer can utilise logistic regression, soft-max regression, or even support vector machine to generate the ultimate output. To enhance network accuracy, the network model employs the gradient descent method to minimise the loss function, enabling iterative adjustment of weight parameters across network layers during frequent training iterations.

The CNN was initially developed to address computer vision tasks, with the default input being an RGB image. This specific type of CNN is referred to as 3DCNN since the RGB image can be divided into three sub-images based on the RGB colours. However, when the input data consists of time series, the CNN is referred to as 1DCNN. Figure 2 illustrates the fundamental structure of a 1D-CNN.

Evaluation criteria

In order to compare the forecasting accuracy of different models, this study uses five evaluation criteria for comparison, and the smaller the value of these criteria, the higher the forecasting accuracy of the model. The five evaluation criteria are calculated as follows.

Mean absolute error (MAE):

$$\:MAE=\sqrt{\frac{1}{n}\sum_{i=1}^{n}\left|{A}_{i}-{F}_{i}\right|}$$

(5)

Root-mean-square error (RMSE):

$$\:RMSE=\sqrt{\frac{1}{n}\sum_{i=1}^{n}{({A}_{i}-{F}_{i})}^{2}}$$

(6)

Mean percentage error (MPE)

$$\:MPE=\frac{1}{n}\sum_{i=1}^{n}\frac{{A}_{i}-{F}_{i}}{{A}_{i}}$$

(7)

Mean absolute percentage error (MAPE)

$$\:MAPE=\frac{1}{n}\sum_{i=1}^{n}\left|\frac{{A}_{i}-{F}_{i}}{{A}_{i}}\right|$$

(8)

Symmetric mean absolute percentage error (SMAPE)

$$\:SMAPE=\frac{100}{n}\sum_{i=1}^{n}\frac{\left|{A}_{i}-{F}_{i}\right|}{(\left|{A}_{i}\right|+\left|{F}_{i}\right|)/2}$$

(9)

Where $\:n$ is the number of observations, $\:{A}_{i}$ is the actual value of the $\:{i}_{th}$ observation, $\:{F}_{i}$ is the forecast value of the $\:{i}_{th}$ observation.

Empirical study

Data collection and pre-processing

In this study, Shanghai Port and Ningbo Port are taken as experimental objects. Because these two ports are the largest container ports in Asia, the forecasting of them has strong practical significance and research value.

Shanghai Port is in the middle of the coastline of mainland China and the estuary of the Yangtze River, connecting the north and south coasts of China and the world ocean before, and then traversing the Yangtze River basin, Jiang-Zhe-Wan River basin and Taihu Lake basin. Considering that the port container throughput will be affected by a variety of factors, this study collects two types of influencing factors, which are shown in Table 1. All the data are annual data from 2000 to 2021. Please refer to Appendix Table A1 and Table A2 for detailed data. According to the research, 80/20 is a common ratio of training and testing sets³⁵. Therefore, the training set and the testing set contain 18 and 4 observations, respectively.

Ningbo port is a major container transportation hub in China. It boasts state-of-the-art facilities, efficient operations, and extensive connectivity, making it an ideal choice for global trade. With its strategic location, it offers seamless logistics solutions and plays a vital role in facilitating international container shipments. The influencing factors of Ningbo port are collected from 1990 to 2021. Please refer to Appendix Table A3 and Table A4 for detailed data. The name of the candidate variables are shown in Table 1. Therefore, the training set and the testing set contain 26 and 6 observations, respectively.

It can be seen that the number of data points used in this study is much smaller than that in other fields of research. But this does not affect the conclusion of this study and the forecasting power of the model. Although the deep learning model can better capture the patterns from of data when there is a large amount of data, as long as the model can converge, no matter the amount of data is large or small, it can form effective forecasting results.

Table 1 Candidate influencing factors.

Full size table

Results

According to the proposed model, the first step is to collect candidate influencing factors for Shanghai Port and Ningbo Port. This step has been done in Data collection and pre-processing. Then, this section will present the output of Grey Correlation Analysis and forecast result of the proposed model.

The grey correlation degree for each candidate factor is shown in Tables 2 and 3. It can be seen that all the degree are larger than 0.5. After comprehensive consideration of the calculation results, 0.77 is chosen as the cut-off point³⁶. Therefore, for Shanghai Port, 7 factors of hinterland and 2 factors of foreland are selected to train the model. For Ningbo Port, 10 factors of hinterland and 4 factors of foreland are selected.

By comparing the factors that affect the container throughput of Shanghai Port and Ningbo Port, this study finds that the influence of foreland on Shanghai port mainly comes from the GDP factors of Japan and the United States, which indicates that the import and export of goods from Shanghai Port is closely related to the economic development of these two countries. The container throughput of Ningbo Port is mainly affected by hinterland, which shows that the containers of Ningbo Port are mainly export goods, while Shanghai Port is mainly responsible for imported goods.

Table 2 The grey relational degree ranking of each index of Shanghai Port.

Full size table

Table 3 The grey relational degree ranking of each index of Ningbo Port.

Full size table

After sifting out the main influencing factors from the candidate factors, these selected factors are used to train the models mentioned in Proposed Grey-CNN model. The forecasting results of Shanghai Port and Ningbo Port are shown in Tables 4 and 5.

Table 4 Forecasting result of each model of Shanghai Port.

Full size table

Table 5 Forecasting result of each model of Ningbo Port.

Full size table

Discussion

By analysing the results in Tables 4 and 5, there are several findings.

First, the most notable observation is that, in most cases, the multivariate model outperforms the univariate model, with the exception of the ARIMAX model for Shanghai Port. This observation suggests the powerful ability of multivariate models to effectively leverage the additional information provided by multiple variables and potential challenges faced by univariate models in capturing the complexity and interdependencies present in the datasets. This observation also supports the study’s hypothesis that using both hinterland and foreland data can improve the accuracy of forecast results.

Second, for the exception case, the forecast accuracy of ARIMAX model is not as good as that of ARIMA for Shanghai Port. The possible reason is that ARIMAX is a linear model, which can only excavate the linear relationship between input variables and the target. However, in this study, the container throughput at Shanghai Port is affected by a number of nonlinear factors, so ARIMAX is unable to produce a more accurate result.

Third, among all the multivariate models, the model proposed in this study has the best performance. In the context of multi-variate time series forecasting, Convolutional Neural Networks (CNNs) have several advantages over models like GRU and LSTM, which are recurrent neural networks: (1) CNNs can perform parallel computation, allowing them to process multiple time steps simultaneously, which improves training and inference efficiency. In contrast, RNN-based models typically process time steps sequentially; (2) CNNs excel at extracting features from local regions, which is beneficial for recognising local patterns in time series data. Through convolution and pooling operations, CNNs can effectively capture local structures in the data; (3) the parameter sharing mechanism in CNNs reduces the model’s parameter count, enhancing generalisation and reducing the risk of overfitting. This parameter sharing feature enables CNNs to better utilise the correlations between variables in multi-variate time series data; (4) compared to LSTM, GRU and other models, CNNs are less prone to gradient vanishing issues during training, making them better at capturing long-term dependencies; (5) for certain time series data, local patterns and translation invariance may be crucial features. CNNs may be more effective in handling these types of data as they are naturally adept at capturing such features. While CNNs offer advantages in certain aspects, the choice of model for time series data should be based on the specific characteristics and requirements of the problem. Sometimes, RNN-based models like LSTM and GRU may be more suitable for capturing long-term dependencies in time series data. Therefore, researchers should carefully consider the strengths and weaknesses of different models based on the specific context and data characteristics when selecting the most appropriate model for accurate forecasting in multi-variate time series tasks. But in this study, the observations of time series were limited. LSTM and GRU were therefore unable to capture long-term dependencies, so CNN performed better than them.

Forth, comparing the performance of simpler models like Naive method with more complex models like LSTM, CNN, and Transformer, it is evident that increased model complexity does not always guarantee better forecasting accuracy. The proposed Grey-CNN model, despite being more complex than some models, outperforms others in terms of forecast accuracy. This highlights the importance of not just model sophistication but also the relevance of the architecture to the specific forecasting task at hand.

Fifth, while complex models like LSTM, GRU, and Transformer may offer high accuracy in forecasting, they often come at the cost of interpretability. On the other hand, simpler models like Naive forecasting or ARIMA are easier to interpret but may lack the forecasting power of more complex models. Balancing between model interpretability and forecasting performance is crucial, especially in scenarios where stakeholders require transparency in the decision-making process.

Sixth, the two ports selected in this study are both production-export ports. In addition, there are some important ports that serve other functions, such as the Port of Singapore. Therefore, when collecting the data, it is necessary to choose according to the types of different ports.

Managerial implications

Tables 4 and 5 illustrated that the proposed method can generate more accurate forecasting results than the other. It also showed that container throughput, as a key indicator in the shipping industry, has a close connection with economic growth. When the economy is growing, there is an increased demand for goods and services. Manufacturers produce more products, and consumers have higher purchasing power. This leads to a greater volume of goods being transported by containers. However, during economic recessions, demand for goods declines, and so does container shipping. For instance, during the 2008–2009 global financial crisis, many countries experienced a significant drop-in economic activity. This led to a decrease in consumer spending and a slowdown in industrial production. As a result, container throughput plummeted as there was less trade and fewer goods being shipped.

For the Shipping Market, a container throughput forecasting model has several managerial implications. Firstly, it helps with planning and resource allocation. Shipping companies can use it to determine the appropriate number of vessels for different routes, and port authorities can invest in infrastructure based on predicted demand. Logistics providers can also manage warehouse space and transportation fleets more effectively. Secondly, it aids in pricing and revenue management. Shipping lines and freight forwarders can set competitive prices by understanding market trends. They can identify peak and off-peak periods to adjust rates and attract business. Additionally, it enables risk management by allowing managers to anticipate disruptions and develop hedging strategies.

For economic growth, the forecasting model is also crucial for economic growth. For policymakers, it helps in formulating trade and transportation policies. They can identify areas for infrastructure investment to support the flow of goods and promote regional development. For investors, it provides a basis for making informed decisions about investing in the shipping industry and related sectors. A positive forecast can attract investment and encourage long-term planning, contributing to overall economic prosperity. In summary, the model is valuable for both the shipping market and economic growth.

Conclusion

The analysis of forecasting models for Shanghai and Ningbo Ports reveals the superiority of multivariate models in capturing complex interdependencies, with exceptions like the ARIMAX model’s limitations in handling nonlinear relationships. The Grey-CNN model’s success highlights CNN advantages in multi-variate time series forecasting. Despite model complexity, the Grey-CNN outperforms simpler and more complex models, emphasising the importance of model relevance to the forecasting task. The trade-off between interpretability and accuracy underscores the need for a balanced approach in model selection, ensuring transparency in decision-making processes for stakeholders.

Furthermore, this study highlights the importance of considering a port’s two-way hinterland concept and utilising deep learning techniques to enhance container throughput forecasting accuracy. The findings contribute to the advancement of port management practices and provide valuable insights for stakeholders involved in maritime logistics and trade.

Insufficient data is the main limitation of this study. In future studies, we will collect data over a longer period of time and input variables of more dimensions to further improve the accuracy of the forecasting model. In the following research, our team will focus on proposing more innovative forecasting models that are more suitable for this research field.

The implications of this research are significant for port authorities, logistics companies, and policymakers. Accurate container throughput forecasting enables better resource planning, improved efficiency, and enhanced decision-making in the maritime industry. Furthermore, a deeper understanding of a port’s two-way hinterland can facilitate the development of targeted strategies to promote economic growth and regional integration.

Data Availability

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

References

Gu, B. & Liu, J. Port resilience analysis based on the HHM-FCM approach under COVID-19. Ocean. Coast Manag. 243, 106741 (2023).
Article Google Scholar
Xu, S., Zou, S., Huang, J., Yang, W. & Zeng, F. Comparison of different approaches of machine learning methods with conventional approaches on container throughput forecasting. Appl. Sci. 12 (19), 9730 (2022).
Article CAS Google Scholar
Farhan, J. & Ong, G. P. Forecasting seasonal container throughput at international ports using SARIMA models. Marit Econ. Logist. 20, 131–148 (2018).
Article Google Scholar
Shankar, S., Ilavarasan, P. V., Punia, S. & Singh, S. P. Forecasting container throughput with long short-term memory networks. Ind. Manag Data Syst. 120 (3), 425–441 (2020).
Article Google Scholar
Shankar, S., Punia, S. & Ilavarasan, P. V. ‘Deep learning-based container throughput forecasting: a triple bottom line approach’, Ind. Manag. Data Syst., vol. 121, no. 10, pp. 2100–2117, Oct. doi: (2021). https://doi.org/10.1108/IMDS-12-2020-0704
Voulodimos, A., Doulamis, N., Doulamis, A. & Protopapadakis, E. ‘Deep learning for computer vision: A brief review’, Comput. Intell. Neurosci., vol. 2018. (2018).
Otter, D. W., Medina, J. R. & Kalita, J. K. A survey of the usages of deep learning for natural language processing. IEEE Trans. Neural Netw. Learn. Syst. 32 (2), 604–624 (2020).
Article MathSciNet Google Scholar
Gamboa, J. C. B. ‘Deep Learning for Time-Series Analysis’, Jan. 07, arXiv: arXiv:1701.01887. Accessed: Feb. 02, 2023. [Online]. Available: (2017). http://arxiv.org/abs/1701.01887
Robinson, R. The hinterland-foreland continuum: Concept and methodology. Prof. Geogr. 22 (6), 307–310 (1970).
Article Google Scholar
Behdani, B., Wiegmans, B., Roso, V. & Haralambides, H. Port-hinterland transport and logistics: emerging trends and frontier research. Marit Econ. Logist. 22, 1–25 (2020).
Article Google Scholar
Talley, W. K. & Ng, M. Hinterland transport chains: a behavioral examination approach. Transp. Res. Part. E Logist Transp. Rev. 113, 94–98 (2018).
Article Google Scholar
Jung, P. H. & Thill, J. C. Sea-land interdependence and delimitation of port hinterland-foreland structures in the international transportation system. J. Transp. Geogr. 99, 103297 (2022).
Article Google Scholar
Martínez-Moya, J. & Feo-Valero, M. Measuring foreland container port connectivity disaggregated by destination markets: an index for Short Sea Shipping services in Spanish ports. J. Transp. Geogr. 89, 102873 (2020).
Article Google Scholar
Martinez-Moya, J., Mestre-Alcover, A. & Sala-Garrido, R. ‘Connectivity and competitiveness of the major Mediterranean container ports using ‘Benefit-of-the-doubt’and common sets of weights methods in Data Envelopment Analysis’. Marit Econ. Logist. 26, 261-282 (2023).
Panahi, R., Ng, A. K., Afenyo, M. & Lau, Y. Reflecting on forty years contextual evolution of arctic port research: the past and now. Transp. Res. Part. Policy Pract. 144, 189–203 (2021).
Article Google Scholar
Gattuso, D., Cassone, G. C. & Pellicanò, D. S. ‘A methodology to identify the Hinterland for Freight ports by Transportation cost functions’. Open. Transp. J., 17, E187444782212301 (2023).
Sdoukopoulos, E. & Boile, M. Port-hinterland concept evolution: a critical review. J. Transp. Geogr. 86, 102775 (2020).
Article Google Scholar
Eskafi, M. et al. Mutual information analysis of the factors influencing port throughput. Marit Bus. Rev. 6 (2), 129–146 (2021).
Article Google Scholar
Li, Y. et al. ‘A Review of Research on Port Throughput Forecasting’, in., IEEE International Conference on Smart Internet of Things (SmartIoT), IEEE, 2019, pp. 449–453. (2019).
Li, Y., Li, T. & Zuo, Y. ‘Port Throughput Forecasting Based on Broad Learning System with Considering Influencing Factors’, in 2020 Chinese Control And Decision Conference (CCDC), IEEE, pp. 4129–4134. (2020).
Dai, P. ‘Grey Correlation Analysis of the Relationship Between Port Logistics and Regional Economic Development in Tangshan City’, J. World Econ., vol. 2, no. 1, Art. no. 1, Mar. (2023).
Niu, M., Hu, Y., Sun, S. & Liu, Y. A novel hybrid decomposition-ensemble model based on VMD and HGWO for container throughput forecasting. Appl. Math. Model. 57, 163–178. https://doi.org/10.1016/j.apm.2018.01.014 (May 2018).
Mo, L. et al. GMDH-based hybrid model for container throughput forecasting: selective combination forecasting in nonlinear subseries. Appl. Soft Comput. 62, 478–490. https://doi.org/10.1016/j.asoc.2017.10.033 (Jan. 2018).
Xie, G., Zhang, N. & Wang, S. ‘Data characteristic analysis and model selection for container throughput forecasting within a decomposition-ensemble methodology’, Transp. Res. Part E Logist. Transp. Rev., vol. 108, pp. 160–178, Dec. doi: (2017). https://doi.org/10.1016/j.tre.2017.08.015
Hassan, L. A. H., Mahmassani, H. S. & Chen, Y. Reinforcement learning framework for freight demand forecasting to support operational planning decisions. Transp. Res. Part. E Logist Transp. Rev. 137, 101926 (2020).
Article Google Scholar
Yang, C. H. & Chang, P. Y. ‘Forecasting the demand for container throughput using a mixed-precision neural architecture based on CNN–LSTM’, Mathematics, vol. 8, no. 10, p. 1784, (2020).
Xiao, Y., Xue, X., Hu, Y. & Yi, M. ‘Novel Decomposition and Ensemble Model with Attention Mechanism for Container Throughput Forecasting at Four Ports in Asia’, Transp. Res. Rec. J. Transp. Res. Board, vol. 2677, no. 6, pp. 530–547, Jun. doi: (2023). https://doi.org/10.1177/03611981221149434
Du, P., Wang, J., Yang, W. & Niu, T. Container throughput forecasting using a novel hybrid learning method with error correction strategy. Knowl. -Based Syst. 182, 104853. https://doi.org/10.1016/j.knosys.2019.07.024 (Oct. 2019).
Ho, C. H., Chang, P. T., Hung, K. C. & Lin, K. P. Developing intuitionistic fuzzy seasonality regression with particle swarm optimization for air pollution forecasting. Ind. Manag Data Syst. 119 (3), 561–577 (2019).
Article Google Scholar
Huang, A., Lai, K., Li, Y. & Wang, S. Forecasting container throughput of Qingdao port with a hybrid model. J. Syst. Sci. Complex. 28 (1), 105–121. https://doi.org/10.1007/s11424-014-3188-4 (Feb. 2015).
Geng, J., Li, M. W., Dong, Z. H. & Liao, Y. S. ‘Port throughput forecasting by MARS-RSVR with chaotic simulated annealing particle swarm optimization algorithm’, Neurocomputing, vol. 147, pp. 239–250, Jan. doi: (2015). https://doi.org/10.1016/j.neucom.2014.06.070
Intihar, M., Kramberger, T. & Dragan, D. ‘Container Throughput Forecasting Using Dynamic Factor Analysis and ARIMAX Model’, PROMET - TrafficTransportation, vol. 29, no. 5, pp. 529–542, Nov. doi: (2017). https://doi.org/10.7307/ptt.v29i5.2334
Rashed, Y., Meersman, H., Sys, C., Van de Voorde, E. & Vanelslander, T. A combined approach to forecast container throughput demand: scenarios for the Hamburg-Le Havre range of ports. Transp. Res. Part. Policy Pract. 117, 127–141. https://doi.org/10.1016/j.tra.2018.08.010 (Nov. 2018).
Tang, S., Xu, S. & Gao, J. An optimal model based on multifactors for container throughput forecasting. KSCE J. Civ. Eng. 23 (9), 4124–4131 (2019).
Article Google Scholar
Al-Musaylh, M. S., Deo, R. C., Adamowski, J. F. & Li, Y. Short-term electricity demand forecasting with MARS, SVR and ARIMA models using aggregated demand data in Queensland, Australia. Adv. Eng. Inf. 35, 1–16. https://doi.org/10.1016/j.aei.2017.11.002 (Jan. 2018).
Li, Q. & Hong, X. Analysis and prediction of correlation factors of container throughput in Dalian Port based on grey theory. Logist Technol. 32 (23), 192–195 (2013).
CAS Google Scholar

Download references

Acknowledgements

The authors acknowledge the financial support from the Guangxi Science and Technology Planning Project funded by Science and Technology Department of Guangxi Zhuang Autonomous Region (reference number: AD23026153 / 2022AC21158).

Author information

Authors and Affiliations

Logistics and E-Commerce College, Zhejiang Wanli University, Ningbo, 315104, China
Fangli Zeng
The Key Research Center of Philosophy and Social Science of Zhejiang Province – Modern Port Service Industry and Creative Culture Research Center, Zhejiang, China
Fangli Zeng
School of Artificial Intelligence, Guangxi Colleges and University Key Laboratory of AI Algorithm Engineering, Guilin University of Electronic Technology, Jinji Road, Guilin, 541004, China
Shuojiang Xu

Authors

Fangli Zeng
View author publications
Search author on:PubMed Google Scholar
Shuojiang Xu
View author publications
Search author on:PubMed Google Scholar

Contributions

F.Zeng. and S.Xu. wrote the main manuscript text and F.Zeng. prepared all Tables. S.Xu provided forecasting models and F.Zeng conducted the experiments. All authors reviewed the manuscript.

Corresponding author

Correspondence to Shuojiang Xu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Zeng, F., Xu, S. A hybrid container throughput forecasting approach using bi-directional hinterland data of port. Sci Rep 14, 25502 (2024). https://doi.org/10.1038/s41598-024-77376-9

Download citation

Received: 22 February 2024
Accepted: 22 October 2024
Published: 26 October 2024
DOI: https://doi.org/10.1038/s41598-024-77376-9

Subjects

Abstract

Similar content being viewed by others

Ports’ criticality in international trade and global supply-chains

Enhancing global maritime traffic network forecasting with gravity-inspired deep learning models

An approach to assessing tsunami risk to the global port network under rising sea levels

Introduction

Literature review

Bi-directional hinterland

Factors affecting container throughput

Univariate container throughput forecasting

Multivariate container throughput

Methodology

Proposed Grey-CNN model

Grey correlation analysis

Step 1

Step 3

Step 4

Step 5

Step 6

CNN

Evaluation criteria

Empirical study

Data collection and pre-processing

Results

Discussion

Managerial implications

Conclusion

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links