Introduction

The world’s largest economy, the United States, has tended to implement trade protectionism since Donald Trump took office as the 45th president1. In recent years, the protectionist trade policies have caused the trade relations between China and the United States increasingly nervous and many disputes also keep occurring. The trade policy differences of major economies may give rise to a new wave of protectionism, which has a negative impact on the economic globalization and global welfare. In this context, how to eliminate divergences in trade policies and promote trade cooperation is an important issue.

Game theory is an important method to study interactive behavior and applies to various fields2,3. In 1944, game theory was applied to the field of economics by John von Neumann & Oskar. As a result of this study, game theory was extended to study a wide range of economic and business problems4,5,6. International trade is the typically economic interaction behavior of different countries, and trade cooperation is influenced by the different trade policy and interests. International trade game is usually portrayed by the Prisoner’s Dilemma model in which all countries have a common game payoff from the free trade policy, but the trade protection policy often occurs for the development of domestic industries. Evolutionary games can effectively reveal the emergence mechanism of cooperative behavior in international trade7,8. Grossman et al.9 studied noncooperative and cooperative trade game models in which domestic politics determine the trade objectives. The results show that the game can reach an equilibrium of taxes, subsidies and political for each trade country. Yin et al.10 analyzed the game strategies and the game outcome of U.S.A and China during their trade war. Namaki et al.11 built the sequential chicken trade game model to simulate the non-cooperative game of the trade war. The optimal strategy is that both players of the game choose either free trade policy or protectionism policy at the same time. MccGwire et al.12 studied the Nash equilibrium of the international trade game model and shown that the ‘Tariff’ strategy is chosen as the optimal strategy for the both game countries even though the ‘Free Trade’ strategy can obtain more game payoffs. Krapohl et al.13 acknowledged that trade cooperation is not a stable equilibrium and absolute trade liberalization can be exploited by protectionist trade policies.

However, the above studies on the evolutionary game of international trade suppose that the players have two game strategies and the game relations are fixed and uniformly interactive. In fact, the game is dynamic competition between multiple strategies and the game relations are also dynamic and non-uniform interactive. If countries are considered as nodes and trade transactions between countries are considered as connected edges, complex networks can effectively express the non-uniform international trade relationship. Complex networks have been developed over two decades and many important results14,15,16 have been presented. Complex networks theory promotes the research boom of international trade network theory17,18. The study of international trade networks focused on three perspectives. Firstly, the topological characteristics and structural indicators of trade networks are studied from a holistic or local perspective19,20. In addition, the characteristics of trade networks focus on special area, such as mineral resources, crop trade, mineral resources, manufacturing and so on21,22. Moreover, international trade network dynamic evolution models are built, which can effectively analyze the final evolutionary state of trade networks structures23,24. These studies of trade networks focused on the network topology and structure. However, the dynamical processes of the international trade networks, especially game dynamics25,26,27,28, rarely been addressed.

Fair trade is an important basis for international trade transactions, which can be traced back to the‘Fair Trade Movement’of the British textile industry in the late 19th century. We believe that absolute fairness is difficult to achieve and the trade cooperation for fair policy requires a certain tolerance for the absolutely equitable benefits. In fact, there are different policies or concepts for international trade due to different countries inconsistencies in development stages, conditions and political demands. In this paper, we propose an evolutionary game model of international trade in which game relationships are portrayed by complex networks, players have multiple strategies and the game relationship is dynamic adjustment. We use theoretical analysis and computer simulation to study the influence of trade policy differences and the tolerance of fair policy on the game payoffs of countries and the overall interests of trade network. Meanwhile, we study the evolutionary game process and the emergent mechanisms of trade cooperation.

The game models

In fact, the different of cultural backgrounds, development stages, political demands and other national conditions for different countries can form the divergence of the trade policy which could be an important cause of protectionism and trade cooperation. We suppose that there are three trade policies for different countries: fair-trade (FT), opportunism (OP) and win-win (WW). In this paper, we regard the trade policy differences as the different game strategies which give rise to different demands for trade benefits. We can explain the economic and practical meaning of these three game strategies in terms of trade benefits requirements.

  1. (i)

    FT strategies are the fair trade policies. Its require that the benefits of trade be distributed equally, but the absolute fairness is difficult to achieve. Therefore, the game payoffs of FT strategies need to have a certain degree of deviation for the absolutely equitable benefits. In this paper, the deviation of the game payoffs for FT strategies can be called the degree of tolerance “t”. In fact, countries with fair trade policies have a limited tolerance for the degree of deviation, so we suppose the degree of tolerance satisfies

    $$\begin{aligned} 0\le t< 0.5 \end{aligned}$$
    (1)

    When the trade benefits of FT strategies \(\pi _{\textrm{FT}}\) exceed an upper limit, the countries will share benefits with their trade partners to maintain long-term trade cooperation. However, the trade benefits below the lower bound, the countries will resort to the trade protection and the trade cooperation fails. In order to maintain trade cooperation and relative fairness, we suppose that the desired trade benefits of FT strategies satisfy

    $$\begin{aligned} 0.5-t\le \pi _{\textrm{FT}}\le 0.5+t. \end{aligned}$$
    (2)
  2. (ii)

    OP strategies are the opportunism trade policies which are the antithesis of the FT strategies. In practice, the less developed country in economy or the countries with location advantages expect to benefit from unilateral preferences or assistance in international trade. Therefore, these countries with OP strategies expect to obtain a larger share of trade benefits and the desired trade benefits satisfy

    $$\begin{aligned} \pi _{\textrm{op}} > 0.5. \end{aligned}$$
    (3)
  3. (iii)

    WW strategies are the win-win trade policies. WW strategies try to find all opportunities to achieve the trade cooperation for the trade benefits. Suppose that the strategies not presuppose the relative shares of trade benefits and only require that trade benefits growth through trade cooperation. The desired trade benefits of WW strategies satisfy

    $$\begin{aligned} \pi _{\textrm{ww}}> 0. \end{aligned}$$
    (4)

Therefore, this strategy is more inclusive to trade partners and more flexible in its demands for trade benefits.

The two countries i and j with trade relations in a game round can obtain the trade benefits \(\pi _{i}\) and \(\pi _{j}\), respectively. The trade benefits of \(\pi _{i}\) and \(\pi _{j}\) are randomly and obey uniformly distributed in the interval [0, 1]. The total benefits of trade cooperation for the two countries satisfy

$$\begin{aligned} \pi _{i}+\pi _{j}=1. \end{aligned}$$
(5)

In each game round, if the values of \(\pi _{i}\) and \(1-\pi _{i}\) satisfy the desired trade benefits (Eqs. 2 to 4) in different trade strategies, the trade cooperation can be achieved. Otherwise, the trade transactions could not be carried out and the trade benefits of the two countries are zero.

When \(0< \pi _{i}< 0.5-t\), the trade benefits of opponent j satisfy \(0.5+t\le \pi _{j}=1-\pi _{i}< 1\). If player i is the OP or FT trade strategies, the trade benefits of player i unable to meet the conditions \(0.5-t\le \pi _{\textrm{FT}}\le 0.5+t\) and \(\pi _{\textrm{OP}}> 0.5\). Therefore, no matter what the strategies chosen by the opponent j, the trade cooperation cannot be achieved and trade benefits of the both players are 0. If player i is the WW trade strategy which means that player i is willing to accept a relatively small trade benefit \(\pi _{i}> 0\), there are three scenarios for player j.

  1. (i)

    The opponent j chooses the OP strategy and \(\pi _{j}=1-\pi _{i}> 0.5\) which satisfy the benefits demands of \(\pi _{\textrm{OP}}> 0.5\).

  2. (ii)

    The opponent j chooses the FT strategy and \(0.5+t< \pi _{j}=1-\pi _{i}< 1\) which satisfy the lower bound of \(0.5-t\le \pi _{\textrm{FT}}\), but it exceeds the upper bound of \(\pi _{\textrm{FT}}\le 0.5+t\). For the consideration of fair trade and maintenance of long-term cooperative relationship between the two player, the opponent j will adjust the trade benefits and the trade benefits of the two sides are \(0.5-t\) and \(0.5+t\), respectively.

  3. (iii)

    The opponent j chooses the WW strategy, \(\pi _{j}=1-\pi _{i}> 0\) which satisfy \(\pi _{\textrm{WW}}> 0\) and trade cooperation can be reached.

Therefore, the game matrix of the trade benefits for \(0< \pi _{i}< 0.5-t\) is shown in Table 1.

Table 1 The game matrix for \(0< \pi _{i}< 0.5-t\).

Similarly, we discuss the different values of \(\pi _{i}\) and the game matrix of trade benefits are shown in Tables 2, 3, and 4.

Table 2 The game matrix for \(0.5-t< \pi _{i}< 0.5\).
Table 3 The game matrix for \(0.5< \pi _{i}< 0.5+t\).
Table 4 The game matrix for \(0.5+t< \pi _{i}< 1\).

On the basis of the above one-shot game matrix, we discuss the repeated game payoffs of different strategy for the different intervals of values of \(\pi _{i}\).

  1. (i)

    The player with FT strategy may encounter opponents with FT, OP and WW strategies, respectively. When strategy combinations are FT–OP, it can be seen that the only way to reach trade cooperation is that \(0.5-t\le \pi _{\textrm{FT}}\le 0.5\) and the benefits distribution can be accepted by the two sides according to the game matrix in Table 1, 2, 3, and 4. Therefore, the expected payoffs of FT strategy are

    $$\begin{aligned} \pi _{\textrm{FT}}^\mathrm {FT-OP}=\int _{0.5-t}^{0.5}\pi \textrm{d}\pi =(t-t^{2})/2. \end{aligned}$$
    (6)

    Similarly, in the strategy combinations of FT–FT and FT–WW, the expected payoffs of FT strategy can be express as Eqs. (7) and (8).

    $$\begin{aligned} \pi _{\textrm{FT}}^\mathrm {FT-FT}&=\int _{0.5-t}^{0.5+t}\pi \textrm{d}\pi =t \end{aligned}$$
    (7)
    $$\begin{aligned} \pi _{\textrm{FT}}^\mathrm {FT-WW}&=\int _{0.5-t}^{0.5+t}\pi \textrm{d}\pi +\int _{0.5+t}^{1}(0.5+t) \textrm{d}\pi =-t^{2}+t+0.25 \end{aligned}$$
    (8)
  2. (ii)

    The player with the strategy of OP in different strategy combination can obtain the expected payoffs

    $$\begin{aligned} \pi _{\textrm{OP}}^\mathrm {OP-OP}&=0 \end{aligned}$$
    (9)
    $$\begin{aligned} \pi _{\textrm{OP}}^\mathrm {OP-FT}&=\int _{0.5}^{0.5+t}\pi \textrm{d}\pi =(t^{2}+t)/2\end{aligned}$$
    (10)
    $$\begin{aligned} \pi _{\textrm{OP}}^\mathrm {OP-WW}&=\int _{0.5}^{1}\pi \textrm{d}\pi =0.375 \end{aligned}$$
    (11)
  3. (iii)

    The player with the strategy of WW in different strategy combination can obtain the expected payoffs

    $$\begin{aligned} \pi _{\textrm{WW}}^{WW-OP}&=\int _{0}^{0.5}\pi \textrm{d}\pi =0.125\end{aligned}$$
    (12)
    $$\begin{aligned} \pi _{\textrm{WW}}^{WW-FT}&=\int _{0}^{0.5-t}(0.5-t) \textrm{d}\pi +\int _{0.5-t}^{0.5+t}\pi \textrm{d}\pi =t^{2}+0.25\end{aligned}$$
    (13)
    $$\begin{aligned} \pi _{\textrm{WW}}^{WW-WW}&=\int _{0}^{1}\pi \textrm{d}\pi =0.5 \end{aligned}$$
    (14)

According to Eqs. (6)–(14), we can obtain expected payoffs matrix of the repeated game as is shown in Table 5. From Table 5, we know that the OP strategy cannot cooperate with other strategies, and the expected payoffs of the OP strategy in the repeated game does not exceed 0.375. The expected payoffs of FT strategy in the repeated game are affected by the degree of tolerance “t”. According to the range of values of t, we know that the expected payoffs of FT strategy are in the interval of [0, 0.5). It’s worth mentioning that the maximum expected payoffs of repeated games for different strategies are 0.5 which is the result of the repeated games between the strategy combinations of win-win.

Table 5 The long term expected payoffs matrix.

The game of international trade

The theoretical analysis

In the trade network, the trad countries are regard as nodes and trade transactions between countries are regard as connected edges. In the game, the game payoffs of countries are affected not only by their own trade strategies, but also by the distribution of various strategies in the trade network. In addition, the responsible countries in the international trade should consider their own payoffs and the maximization of the overall payoffs from the international trade cooperation.

Suppose that the total number of nodes is m and the strategies of OP, FT and WW are randomly and uniformly distributed in the trade network. The proportion of the three strategies are \(p_{\textrm{OP}}\), \(p_{\textrm{FT}}\), \(p_{\textrm{WW}}\) and satisfy \(p_{\textrm{OP}}+p_{\textrm{FT}}+p_{\textrm{WW}}=1\). For n rounds game, the expected cumulative payoffs of FT strategies in the fully connected network satisfies

$$\begin{aligned} u_{\textrm{FT}}=nmp_{\textrm{OP}}\pi _{\textrm{FT}}^\mathrm {FT-OP}+n(mp_{\textrm{FT}}-1)\pi _{\textrm{FT}}^\mathrm {FT-FT}+nmp_{\textrm{WW}}\pi _{\textrm{FT}}^\mathrm {FT-WW} \end{aligned}$$
(15)

If the proportion of FT strategies in the trade network is large enough, we have \(mp_{\textrm{FT}}-1\approx mp_{\textrm{FT}}\). Substituting Eqs. (6)–(8) into the Eq. (15), we have

$$\begin{aligned} u_{\textrm{FT}}=nm\left[ (-t^{2}+t+0.25) p_{\textrm{WW}}+tp_{\textrm{FT}}+\frac{(-t^{2}+t)}{2}p_{\textrm{OP}}\right] . \end{aligned}$$
(16)

Similarly, the cumulative expected payoff of the OP strategies in the network game satisfies

$$\begin{aligned} u_{\textrm{OP}}=nm\left( 0.375p_{\textrm{WW}}+\frac{t^{2}+t}{2}p_{\textrm{FT}}\right) . \end{aligned}$$
(17)

The cumulative expected payoff of the WW strategies in the network game satisfies

$$\begin{aligned} u_{\textrm{WW}}=nm[0.5p_{\textrm{WW}}+(t^{2}+0.25) p_{\textrm{FT}}+0.125p_{\textrm{OP}}] \end{aligned}$$
(18)

From Eqs. (17)–(18), we know that the cumulative expected payoffs of all strategies are affected by the degree of tolerance “t”. Especially, Eq. (17) shows that the cumulative expected payoffs of OP strategies independent of the proportion of OP strategies.

Now we discuss maximizing the overall payoffs of trade networks. The overall payoffs \(U_{\textrm{G}}\) for the n-round game are

$$\begin{aligned} U_{\textrm{G}}=m(p_{\textrm{OP}}u_{\textrm{OP}}+p_{\textrm{FT}}u_{\textrm{FT}}+p_{\textrm{WW}}u_{\textrm{WW}}) \end{aligned}$$
(19)

Substituting Eqs. (16)–(18) into (19) yields

$$\begin{aligned} U_{\textrm{G}}&=nm^{2}p_{\textrm{OP}}\left( 0.375p_{\textrm{WW}}+\frac{t^{2}+t}{2}p_{\textrm{FT}}\right) +nm^{2}p_{\textrm{FT}}\left[ (-t^{2}+t+0.25) p_{\textrm{WW}}+tp_{\textrm{FT}}+\frac{-t^{2}+t}{2}p_{\textrm{OP}}\right] \\&\quad +nm^{2}p_{\textrm{WW}}[0.5p_{\textrm{WW}}+(t^{2}+0.25) p_{\textrm{FT}}+0.125p_{\textrm{OP}}] . \end{aligned}$$

This can be organized to yield

$$\begin{aligned} U_{\textrm{G}}=\frac{1}{2}nm^{2}(p_{\textrm{WW}}+2tp_{\textrm{FT}})(p_{\textrm{FT}}+p_{\textrm{OP}}+p_{\textrm{WW}}). \end{aligned}$$
(20)

Since \(p_{\textrm{FT}}+p_{\textrm{OP}}+p_{\textrm{WW}}=1\), we can obtain that

$$\begin{aligned} U_{\textrm{G}}=\frac{1}{2}nm^{2}(p_{\textrm{WW}}+2tp_{\textrm{FT}}). \end{aligned}$$
(21)

In the Eq. (21), parameters n and m are constants. According to the constraints, we know

$$\begin{aligned} {\left\{ \begin{array}{ll} 0\le p_{\textrm{FT}}\le 1\\ 0\le p_{\textrm{WW}}\le 1\\ 0\le p_{\textrm{WW}}+p_{\textrm{FT}}\le 1\\ 0\le t< 0.5 \end{array}\right. }. \end{aligned}$$

When \(p_{\textrm{WW}}+p_{\textrm{FT}}=1\) and \(t\rightarrow 0.5\), we can obtain that

$$\begin{aligned} \max (U_{\textrm{G}})=\frac{1}{2}nm^{2}. \end{aligned}$$
(22)

Equation (22) shows that the overall payoffs of trade networks are maximized when opportunism strategy is eliminated from the trade network and fair trade strategy is shown to be very inclusive.

The simulation of the network game

In this section, we simulate the evolution game behavior in the uncoupled international trade network. We conducted a series of experiments to change the parameters such as the strategies probability, tolerance and strategy update, which only affect the time for the game to reach equilibrium, but do not affect the evolutionary trend, and therefore do not affect the experimental conclusions of this paper. The empirical study29 showed that the international trade network is heterogeneous, but not a typical scale-free network. We generate a regular lattice network with 225 nodes which represent over 220 countries and regions around the world, and then make this network moderately heterogeneous by breaking edges and reconnecting nodes. The network game model in this paper can be described as

  1. (i)

    The initial state of the nodes. Suppose that, at game round t, the frequencies of the three strategies are \(\textrm{F}_{\textrm{OP}}(t), \textrm{F}_{\textrm{FT}}(t)\) and \(\textrm{F}_{\textrm{WW}}(t)\), respectively. In the initial state of the game, each node randomly selects the initial strategy from the three strategies.

  2. (ii)

    The game payoffs. Each game round, node i obtains the payoffs from his/her neighbors j in terms of the expected payoff matrix (Table 5). The normalized game payoff of node i can be defined as

    $$\begin{aligned} U_{i}=\frac{\sum _{j=\Omega _{i}}\pi (s_{i}, s_{j})}{k_{i}} \end{aligned}$$
    (23)

    where \(\Omega _{i}\) denotes the set of neighbors of node i, \(\pi (s_{i}, s_{j})\) is the strategies combination of node i and j. \(\sum _{j=\Omega _{i}}\pi (s_{i}, s_{j})\) denotes the cumulative payoffs obtained by node i game with neighbor j, and \(k_{i}\) denotes the number of connected edges of node i.

  3. (iii)

    The update rule of the game relations. Considering the relative stability of trade relations, we assume that node i reselects their neighbor nodes every 5 rounds of the game. In order to improve the benefit of trade game, node i randomly selects 10% of the neighbors whose average payoffs below that of the median of all neighbors, and breaks the connecting edges of the selected neighbors which contribute the least to node i’s payoffs. At the same time, node chooses the node with the highest cumulative payoff in the network to re-establish the new edges.

  4. (iv)

    The update rule of the game strategies. Considering the long-term nature of trade strategy, we assumed that 25 (or 200) rounds of the game are played before the strategy adjustment is made. The 20% of nodes whose average payoffs below that of the median of all neighbors are randomly selected, and these nodes learn the strategies from their neighborhood nodes with highest cumulative payoffs.

Firstly, we simulate the game of strategies dynamic evolution scenario. In the game model, the three strategies locate on the network and the game relationships are fixed. We study the influence of the decline of the tolerance “t” of the FT strategy on the payoffs of the three strategies and the whole network. Based on the realistic background, we suppose that the frequencies of the three strategies satisfies that \(\textrm{F}_{\textrm{OP}}(0)=0.4\), \(\textrm{F}_{\textrm{FT}}(0)=0.4\) and \(\textrm{F}_{\textrm{WW}}(0)=0.2\) and the initial value of the tolerance t = 0.3. In the evolutionary game, the value of the tolerance “t” decreases by 0.05 every 200 game rounds, the results after 1000 game rounds as is shown in Fig. 1. The results show that the decreases of the tolerance “t” can reduce the average payoffs of all strategies. Especially, the average payoffs decline of the \(\textrm{FT}\) strategy and the overall trade network are more significant. During the decreases of the tolerance “t”, the WW strategy replaces the FT strategy and the WW strategy becomes the largest contributor to the payoffs of the overall trade network.

Figure 1
Figure 1
Full size image

The game of strategies dynamic evolution scenario. (a) The changes of the average payoffs of the three strategies in the evolution game process. (b) The three strategies contribute to the payoffs of the overall trade network in the evolution game process.

In addition, we simulate the network structure dynamic evolution scenario. In reality, countries spontaneously adjust trade relations for higher trad payoffs or the political purposes, which makes the trade network structure dynamic evolution in the game. Suppose that, in the initial state, the frequencies of the three strategies satisfy that \(\textrm{F}_{\textrm{OP}}(0)=0.4\), \(\textrm{F}_{\textrm{FT}}(0)=0.4\) and \(\textrm{F}_{\textrm{WW}}(0)=0.2\) and the value of the tolerance \(t=0.1\). The changes of average degree and the average payoffs of the three strategies are recorded in Fig. 2. The results show that evolutionary game can promote the adjustment of trade network relationships and the differentiation of the node’s degree. In particular, the node degree differentiation is more significant at earlier game time. The network relationship adjustment is conducive to the WW strategy access to more trading partners and improve the average payoffs of all strategies. Comparison of Figs. 1 and 2, we know that, for \(t=0.1\), the average payoffs of the three strategies and the payoffs of overall trade network have significant increase. The result shows that the dynamic evolution of the network structure somewhat offsets the negative impact of the decrease in tolerance “t”.

In summary, the decrease of the tolerance “t” in the network with fixed structure led to a decline in the payoffs of all nodes, and the payoffs of FT strategies are damaged more obviously. However, the dynamic adjustment of trade relations can effectively enhance the payoffs of all node and the overall trade network and promote the differentiation of the number of node’s trade partners.

Figure 2
Figure 2
Full size image

The network structure dynamic evolution scenario. (a) The changes of average degree for different strategies in the evolution game process. (b) The changes of average payoffs of different strategies and the payoffs of overall trade network in the evolution game process.

Figure 3
Figure 3
Full size image

The network structure and the game strategies coevolution game scenario. (a) The overall payoffs of the trade networks in the coevolution game process. (b) The changes of the clustering coefficient of the network in the process of coevolution game.

Finally, we simulate the network structure and the game strategies coevolution game. In the long run, the trade strategies and the trade relations will be gradually updated through the learning mechanism, which form the self-organizing process of the trade network system. Supposing that the tolerance \(t = 0.1\), the trade relations are adjusted every 5 game rounds and the trade strategies are updated every 25 game rounds. The results of the coevolution game after 800 periods are shown in Figs. 3 and 4. Figure 3 shows that the growth of overall payoffs presents a similar “S” shape curve in the process of self-organization of the trade network. The overall trade payoffs grow faster and more volatile in the initial stage of the game and reach the state of convergence with payoffs-maximizing equilibrium eventually. At the same time, the clustering coefficient of the network is rapidly falling in the initial stage and then sharply rising, which corresponds to the self-organization process of deconstruction, reconstruction and gradual stabilization of the trade relations. The result can explain the overall payoffs of the trade network in the form of an “S” shape.

Figure 4a shows that the competitive relationships and the average payoffs of the three strategies in the coevolution game. At the initial game time, The WW strategy is at a quantitative disadvantage. However, after three stages of accumulation, rapid growth and stabilization, the WW strategy becomes the dominant strategy in the trade network. Figure 4b shows that the average payoffs of nodes are rising for all strategies during the accumulation, rapid growth stages of the WW strategy. The dominance of WW strategy leads to the presence of very few FT and OP strategies in the network, and the average payoffs of the two strategies decreases significantly. It is worth noting that the average payoffs of the WW strategy stabilize at the value of 0.5, which indicates that the fair distribution of trade payoffs is largely achieved.

Figure 4
Figure 4
Full size image

The network structure and the game strategies coevolution game scenario. (a) The number of nodes for different strategies in the coevolution game process. (b) The average payoffs of nodes for different strategies in the process of coevolution game.

Conclusion

The trade interests of various countries are dynamic changes and are difficult to measure them uniformly. The conceptual differences in the distribution of trade benefits are an important reason for the continuous fermentation of global trade friction in recent years. In practice, can fair trade promote trade cooperation among countries, enhance the benefits of global trade and achieve the equitable distribution of trade benefits? To answer this question, this paper establishes an evolutionary game model of international trade network based on trade policy differences. We draw the main conclusions through mathematical analysis and computer simulation.

The developed countries emphasize the concept of fair trade to uphold the established order of benefit distribution and their own status. The reduction of the tolerance “t” of fair trade hardly eliminates opportunism, but instead results in the loss of countries’ game payoffs and the overall payoffs of trade networks. Especially, the decline of game payoffs of the fair trade strategy is more significant. In the long term, only when the win-win trade strategy dominates the network can maximize the overall benefits of the trade network and achieve the balanced distribution of trade benefits. The results indicate that only by exploring more opportunities for trade cooperation, optimizing global resource allocation, and implementing the win-win trade strategy can we achieve truly fair trade.

The policy implication of above findings are as follows. In the current international environment where certain developed powers are exhibiting a declining inclusiveness in trade fairness, all countries need to participate in the reconstruction of global supply chains to seek improvement of trade benefits. A responsible global power should uphold the concept of openness and win-win cooperation, strengthen the capacity of all-round and full-value chain cooperation, and cooperate with countries with different endowments and stages of development, so as to create greater trade benefits sharing with the world.

Looking ahead, further research could focus on how a large number of developing countries can optimize trade partnerships by weighing immediate benefits against long-term benefits, and through the accumulation of long-term trade benefits and the evolution of trade concepts, grow from a “free-rider” role in global trade to a contributor to the benefits of global trade, and thus influence the pattern of global trade.