Introduction

Artificial intelligence (AI) serves as the core driving force behind the new wave of the Industrial Revolution and is expected to become a new engine for enhancing innovation efficiency (Li et al., 2023). The exceptional activities in global innovation within AI technology indicate that AI has become the focus of the attention and development of countries worldwide. Many countries have formulated development policies for the AI industry. For instance, the Select Committee on AI was established in the U.S. to accelerate the development of AI in 2018.Footnote 1 The 2021 Coordinated Plan on Artificial Intelligence aimed to promote the investment and policy adjustment of AI in the European Union.Footnote 2 In 2017, the Chinese government issued the New Generation of Artificial Intelligence Development Plan, pointing out that the competitiveness of the AI industry should reach the highest level in the world by 2030.Footnote 3 Further, clusters are important sources of regional competitive advantage (Porter, 2000). Since the knowledge externalities or spillovers caused by the geographical agglomeration of innovation are the reasons for the existence and success of clusters (Baptista and Swann, 1998), innovation has been regarded as a pivotal factor driving the progress of cluster development. AI innovation has developed pronounced clustering characteristics (e.g., Ho et al., 2021). Exploring ways to accelerate the development of AI clusters and enhance their innovation levels is highly important for driving industrial transformation and upgrading, as well as improving international competitive advantages.

Cluster innovation is considered a kind of network innovation (e.g., Kong et al., 2020). The global cluster network lens asserts the imperative of exploring interregional connections on a global scale beyond the singular local perspective (Bathelt and Li, 2014). Therefore, an AI cluster is a geographical innovation area formed through the local and translocal interactions of AI enterprises and related institutions. Moreover, a variety of methods, including location quotient (LQ) (e.g., Tian et al., 2020) and social network analysis (SNA) (e.g., Luo et al., 2023), have been applied to identify industrial clusters. The scientific and effective identification of clusters is thus a hot topic.

Innovation clusters, which serve as important drivers of national competitiveness and regional economic growth, have received extensive attention from policymakers globally (Kowalski, 2020). Under the mechanism of perfect competition, the phenomenon of market failure is inevitably encountered in the development of clusters. A well-positioned policymaker plays an indispensable role in compensating for market failures (Tian, 2020). Industry policies (IP) can generate innovative behaviours and enhance innovation capabilities (e.g., Zhao et al., 2019). Innovation thus serves as a crucial criterion for assessing the effectiveness of IP (Mallinson, 2021). However, policies may also lead to distortions in the allocation of innovation resources (Huang et al., 2024), exerting a negative impact on innovative activities. Therefore, when studying technological innovation (TI) within AI clusters, evaluating policy effectiveness is necessary, which provides a reference for the government in formulating cluster development plans.

A cluster is characterised as a complex network composed of firms, research institutions, and universities (Leydesdorff and Fritsch, 2006). As regions where knowledge production and knowledge spillovers coexist, AI clusters serve as effective spatial units for studying innovation network. Network centrality (NC) is a crucial metric that reflects the structure of a network and describes the extent to which a participant is closely connected to other participants (Chen et al., 2023). Based on social network theory, NC provides nodes with an advantage in acquiring knowledge and resources (Lubik et al., 2012), but maintaining this advantageous position necessitates continuous investment (Powell, 1998). Furthermore, the resources of a cluster must be allocated both vertically (intraregion) and horizontally (interregion). The current research on innovation networks among clusters is not comprehensive. Therefore, it is imperative to consider how TI is affected by IP under the influence of NC.

We propose the following research questions according to the above discussion: (1) Where are the AI clusters located? (2) How does IP affect TI within AI clusters? (3) What role does NC play in the aforementioned influence? The market scale of the AI industry in China will reach 1.73 trillion yuan ($240.4 billion) by 2035, which will account for 30.6% of the global total.Footnote 4 Therefore, we select China as an intriguing case with the aim of providing a reference for other countries. To be specific, by using patents from the AI industry in China from 2011 to 2020, this paper adopts LQ and SNA to identify AI clusters, and uses the dynamic panel system generalised method of moments (System-GMM) model to explore the influence of IP on TI and the moderating effect of NC.

This paper makes three contributions. First, scholars have identified clusters via LQ or SNA. However, few studies have concurrently combined these two methods to identify clusters. Based on the global cluster network lens, this paper identifies AI clusters from the perspectives of spatial agglomeration and innovation connection. This identification method simultaneously examines innovation linkages within and across regions. Second, previous research has paid attention to the impact of IP on industrial innovation. A unified conclusion remains elusive with regard to whether IP has a beneficial or adverse influence on innovation. In fact, efficient resource allocation can be achieved in clusters due to the integration of business processes among members (Gattorna, 2016). Clusters are more innovative than other regions (Porter, 2000). Clusters are therefore representative of the policy-innovation research framework. We uncover the positive effect of IP on TI from the perspective of clusters rather than that of the whole industry. Third, based on the establishment of interregional networks, this paper reveals that NC can negatively moderate the IP–TI link. Given the scarcity of NC as a moderator in the policy research field, our research contributes to a novel theoretical lens that elucidates the influence of social networks on the function of IP in clusters.

The rest of this paper is organised as follows: Section 2 “Theoretical foundation and mechanistic analysis” imparts the theoretical foundation and mechanistic analysis. The methodology, which includes the identification of clusters, variables, and an empirical approach, is given in Section 3 “Methodology”. The empirical results are shown in Section 4 “Empirical results”. Section 5 “Discussion” presents the discussion. The implications, limitations, and directions for future research are given in Section 6 “Implications, limitations and future research”. The conclusions are summarised in Section 7 “Conclusions”.

Theoretical foundation and mechanistic analysis

Theoretical foundation

Global cluster network lens

For high-tech industries, the network model has become a typical development pattern (Chen et al., 2023). Both intraregional and interregional collaboration can have a decisive impact on regional innovation (Sun and Cao, 2015). It is essential to establish and maintain an innovation network. A cluster is a complex network composed of firms, research institutions, and universities (Leydesdorff and Fritsch, 2006). Many scholars have emphasised the importance of interregional connections, arguing that understanding how to sustain competitiveness by transcending the boundaries of clusters is of paramount significance (e.g., Fitjar and Rodríguez-Pose, 2011). The global cluster network lens asserts the imperative of exploring interregional connections on a global scale beyond the singular local perspective (Bathelt and Li, 2014). This shift from a single cluster to multiple clusters is a deepening of “from local buzzing to global pipeline”.Footnote 5 This paper thereby defines an AI cluster as an innovation-agglomerated area formed through the local and translocal interactions of enterprises and related institutions in the AI sector.

Government intervention theory

Market failures are inevitable in the process of industrial development and evolution. It is hard for regions to foster industrial innovation solely by depending on market mechanisms, thus necessitating government intervention. The objective of government intervention is to enhance the effectiveness of the market mechanism in allocating resources. According to government intervention theory, there are two opposing viewpoints regarding the influence of government intervention on innovation. From a validity viewpoint, government intervention can alleviate the polarization of innovation resources generated by market failures, thus achieving the efficient allocation of innovative resources (Wonglimpiyarat, 2011). A number of current studies have confirmed the positive effect of government intervention on innovation in some high-tech industries. Lin and Luan (2020) and Li et al. (2022) reported the positive impact of government subsidies on innovation in the photovoltaic industry and the new energy vehicle industry, respectively. Lin and Chen (2023) noted that the feed-in tariff policy facilitated innovation in the wind power industry. On the contrary, the invalidity viewpoint suggests that market mechanisms may be disrupted by government intervention, leading to misallocations of innovative resources (Acemoglu et al., 2018). In particular, mismatches in innovative resource allocation are pronounced in high-tech industries, which are subject to government intervention (Zhang et al., 2021). Hong et al. (2016) contended that innovation efficiency could be weakened by government grants in high-tech industries. This conclusion was verified in the integrated circuit industry (e.g., Song and Wen, 2023). Although the IP–TI relationship is noteworthy, scholars have focused on the whole industry. The impact of IP on TI in the AI industry has not been discussed from a cluster perspective. Hence, it is imperative to explore how IP affects TI in AI clusters.

Social network theory

Social network theory proposes that the connections among interdependent social actors have a substantial influence on each actor in a network. In the high-tech sector, the number of various types of cooperative relationships between organizations is increasing (Vermeer and Thomas, 2020), forming intricate innovation networks, such as the innovation network of the integrated circuit industry (e.g., Chen et al., 2023). NC is a crucial metric that reflects the structure of a network, describing the extent to which a participant is closely connected to other participants (Chen et al., 2023). As higher NC facilitates the acquisition of information and knowledge from network neighbours (Yao et al., 2020), the NC of a cluster determines the extent and level of cooperation with other regions. Moreover, the central position requires significant investment to maintain (Powell, 1998). Network members need to continually invest in maintaining cooperative relationships from the perspective of the social costs involved in constructing a social network. Specifically, nodes with high centrality require long-term investments of capital and labour to maintain cooperation with other nodes (Wang et al., 2024). Hence, when studying the TI of AI clusters, the significant role of NC cannot be overlooked.

Above all, the TI of AI clusters cannot be separated from IP and NC, but whether IP and NC have beneficial or adverse influences on innovation remains puzzling. In the next section, we further explore the internal mechanism of the impact of IP on TI and the moderating role of NC.

Mechanistic analysis

Innovative activity in clusters is a complex process characterised by accumulation, continuity, long-term cycles, and high risk. IP influences resource allocation through both capital and labour, thereby promoting cluster innovation. In terms of capital, IP can directly impact innovation by filling funding gaps (e.g., Almus and Czarnitzki, 2003). Moreover, IP can convey economic signals to the market (Yan et al., 2022). More investors can be attracted by such signals to fund fields favoured by policies (Kleer, 2010; Wu, 2017), providing capital support for TI. In terms of labour, human resources serve as a critical factor for fostering innovation (Sanz-Valle and Jiménez-Jiménez, 2018). At present, the AI industry is confronted with a substantial talent deficit. In particular, there will be a shortage of four million people for AI roles in China by 2030.Footnote 6 On the one hand, talent introduction can inject outstanding professionals into the AI industry, thereby stimulating innovative vitality. On the other hand, incentive subsidies provide substantial compensation to talent, motivating individuals to accelerate the pace of innovation.

However, IP can lead to a misallocation of capital and labour, which is a main barrier to advancing the quality of TI (Huang et al., 2024). Government intervention hampers the free flow of capital from less productive enterprises to more productive ones, thereby allowing some less productive enterprises to remain profitable and survive at lower capital prices, which impedes the effective allocation of innovation capital in a cross-section, referred to as “connotation misallocation” (Banerjee and Moll, 2010). Policy bias can lead to “extension misallocation” due to the influence of the “survival of the fittest” process, which is characterised by a loss in innovative efficiency and a decline in innovative capability (Zhang et al., 2021). Moreover, policy intervention in labour mobility results in the distortion of labour allocation. Policy restrictions on the free flow of talent impede the conduct of innovative activities. Stringent labour market control diminishes the impetus for enterprise innovation by escalating the cost of talent. In particular, the negative effect of a distorted labour market is more pronounced in developing countries than in developed ones (Zhang et al., 2021).

In accordance with social network theory, NC provides nodes with an advantage in acquiring knowledge and resources (Lubik et al., 2012), but maintaining this advantageous position necessitates continuous investment (Powell, 1998). Capital and labour are two important inputs, and their levels are closely associated with IP (Fu and Mu, 2014). Hence, NC can influence the role of IP in fostering innovation. Social capital is divided into internal social capital (ISC) and external social capital (ESC). ISC refers to the resources provided by internal social relationships, whereas the connections of external relationships provide ESC. A well-developed external network is able to provide R&D resources for the internal network (Helble and Chong, 2004). That is, ESC can bring about more resources than ISC. Clusters with high centrality have more channels to acquire ESC, which directly facilitates the filling of funding gaps and the introduction of talent, while amplifying the economic signalling and talent incentive roles of IP. TI can thus be continually achieved. Nevertheless, clusters need to trade off the allocation of resources between intraregional and interregional activities. Interregional cooperation may crowd out local resources, resulting in a shortage of funds and a loss of talent within clusters. That is, clusters with high NC may not suffice to balance interregional collaboration, weakening the positive influence of IP on TI. To sum up, the moderating direction of NC depends on the interplay between the aforementioned two forces.

The mechanism of our paper is summarised in Fig. 1.

Fig. 1: Mechanism map.
Fig. 1: Mechanism map.
Full size image

The figure is created using Microsoft Visio.

Methodology

Identification of clusters

Space unit selection

There are no clear criteria for defining the geographical boundaries of industrial clusters. Using spatial data based on administrative units to delineate the scope of clusters is an effective measure for quantitatively identifying industrial clusters (e.g., Brenner, 2006). In China, administrative regions at or above the prefecture level are commonly used as cluster boundaries.Footnote 7 Therefore, we select administrative regions at the prefecture level and above in the Chinese mainland as the spatial units.Footnote 8

Data source

Patents stand as a critical representation of innovation (Wang et al., 2024). According to the definition of AI clusters in Section “Global cluster network lens”, this article uses patents as the data for identifying AI clusters. The steps for acquiring data are given as follows:

Step 1: The Relationship Table of Strategic Emerging Industry Classification and International Patent Classification (2021) (Trial), released by the China National Intellectual Property Administration (CNIPA), lists the patent codes for the AI industry.Footnote 9 We retrieve patent applications on the patent platform of the CNIPA according to the patent codes.Footnote 10 Step 2: Since individuals do not contain address information, patents involving individual applicants are excluded. Step 3: To ensure the availability of data for regression analysis, we select patentees located in the Chinese mainland.

Moreover, the Decision of the State Council on Accelerating the Cultivation and Development of Strategic Emerging Industries, issued in October 2010, repeatedly emphasised smart manufacturingFootnote 11, which enhanced the development of the AI industry. Furthermore, there is a lag between the application and publication of patents. We thereby set the study period from 2011 to 2020. Ultimately, we obtain a total of 600,811Footnote 12 patents, of which 555,062 are independent patents and 45,749 are cooperative patents.Footnote 13,Footnote 14

Steps for identification

A broad range of research has identified and analysed clusters via different methods. Some scholars have used quantitative methods to identify clusters, such as LQ (e.g., Tian et al., 2020). Further, since the focus of research on innovation has switched from aggregate-level diffusion to social network (Muller and Peres, 2019), SNA has been applied to cluster research (e.g., Luo et al., 2023). In fact, agglomeration and network are two important dimensions for industrial clusters. The essence of clusters is an innovation network based on specialised agglomeration. Therefore, the identification of AI clusters should be considered comprehensively from the aspects of spatial agglomeration and industrial linkage. Referring to the study of Zhou et al. (2019), we use LQ and SNA to identify AI clusters. To be specific, there are three steps for identification.

  • Step 1. Delimiting the spatial agglomeration areas.

    The LQ of innovation output can be used to determine whether there is innovation agglomeration in a region. The formula of LQ is presented as follows:

    $${{\rm{LQ}}}_{{\rm{i}}}=\frac{\frac{{{\rm{A}}}_{{\rm{i}}}}{{{\rm{P}}}_{{\rm{i}}}}}{\frac{{\rm{A}}}{{\rm{P}}}}$$
    (1)

    where \({\rm{i}}\) represents region. \({{\rm{LQ}}}_{{\rm{i}}}\) is the location quotient of region \({\rm{i}}\). The number of patent applications of the AI industry in region \({\rm{i}}\) is denoted by \({{\rm{A}}}_{{\rm{i}}}\). \({{\rm{P}}}_{{\rm{i}}}\) is the number of patent applications in region \({\rm{i}}\). \({\rm{A}}\) represents the number of patent applications of the AI industry. \({\rm{P}}\) is the number of patent applications in the nation as a whole. Generally, a region with a value of LQ greater than 1 is considered to be specialised in a specific industry (Kowalewksi, 2015). The LQ of a region is measured through the mean number of patent applications from 2011 to 2020, which can avoid large fluctuations in data from a single year that discredit the results.

  • Step 2. Identifying the potential clusters.

    The AI industry in some regions has a small scale and low innovation output. However, compared with other industries, the AI industry accounts for a larger proportion in these regions, resulting in a high value of LQ. These regions may be misperceived as potential clusters. Galliano et al. (2015) believed that potential clusters could be identified by setting a specialization threshold. Drawing on the research of (Zhou et al., 2019), we set the 80th percentile of patent applications across all regions as the lower threshold limit. Spatial agglomeration areas that do not reach the threshold are excluded, and the remaining regions are considered potential clusters.

  • Step 3. Measuring the innovation linkages of potential clusters.

Collaborative patents can share, overflow, and transfer technological knowledge, which is an open innovation of technological knowledge flow and innovation resource integration based on relational embeddedness. We match the addresses of the organizations that filed the joint patent applications with the space units (see Section “Space unit selection”). Networks between innovation organizations are transformed into intraregional and interregional networks containing potential clusters. This paper takes potential clusters that simultaneously have intraregional and interregional innovation linkages as AI clusters.

Variables

Dependent variable

Patents are categorised into three types in China: invention, utility model, and design (Chen et al., 2022). Invention patents pursue radical changes. Patents granted have to undergo specific processes of examination and approval, resulting in longer time delays (Wang and Wang, 2024). Comparatively, patent applications are more representative of innovative vitality, as they are more proximate to the time period during which innovation activities took place (Ai et al., 2024). TI is thus measured by the number of invention patent applications in this paper. Furthermore, to accurately quantify the level of TI, independent patents are counted directly by number. With respect to cooperative patents, the number of patents is divided equally according to the patentees. For instance, if there are three patentees for one patent, the TI of the cluster corresponding to each patentee is recorded as 1/3.

Independent variable

This paper uses the number of incentive policies enacted by local governments to measure IP (e.g., Wang et al., 2024). We search for policies in the Pkulaw database by using “Artificial Intelligence” as the keyword in the title and full text.Footnote 15 Policies in China are issued mostly in the form of local regulations and local government rules rather than laws. Therefore, local regulations and rules, as well as local working documents, are chosen as policy texts. We obtain 6758 policy texts from 2011 to 2020. Then, we match each policy to its corresponding cluster based on the document-making organ of the policy. For clusters that are not municipalities directly governed by the central government, the number of policy texts at the provincial level is summed up to the prefecture-level level.

Moderating variable

NC is selected as the moderating variable in this research. Closeness centrality, degree centrality, and betweenness centrality are three indicators used to evaluate NC. Although these three indicators can illustrate the role of nodes in a social network, their definitions and measurements are different. Closeness centrality assesses a node’s accessibility to all other nodes via the shortest paths (Crucitti et al., 2006). That is, the closeness centrality of a node indicates how quickly it can access other nodes. Moreover, the effective information acquisition and competitive advantage of a node can be achieved via closeness centrality (Kim, 2019). Therefore, closeness centrality is adopted to assess NC. The equation for closeness centrality is shown as follows (Chen et al., 2023):

$${{\rm{NC}}}_{{\rm{i}}}=\frac{{\rm{n}}-1}{{\sum }_{{\rm{j}}=1}^{{\rm{n}}}{{\rm{d}}}_{{\rm{ij}}}}$$
(2)

where \({{\rm{NC}}}_{{\rm{i}}}\) stands for the closeness centrality of region \({\rm{i}}\). \({\rm{n}}\) is the overall region count. The shortest length connecting regions \({\rm{i}}\) and j is represented by \({{\rm{d}}}_{{\rm{ij}}}\). Regions with higher NC are more accessible to additional nodes.

Control variables

To circumvent the endogeneity concerns caused by city characteristics, this paper adds economic development (ED), infrastructure construction (IC), and innovation environment (IE) as control variables. The per capita gross domestic product can be used to measure ED (e.g., Gao and Yuan, 2022). The per capita road area is the metric for IC (e.g., Wang et al., 2022). Financial expenditures on science and technology act as positive elements that influence IE (Guo et al., 2024); thus, this research uses fiscal expenditures on science to express IE. The data for the control variables are derived from the “China Regional Economic Database” and the “China Urban Database” in the Express Professional Superior (EPS). The “China Urban Statistical Yearbook” and the “China Statistical Yearbook” help supplement missing data.

All the variables are logarithmically transformed before regression. Notably, some variables cannot carry out this transition directly since they contain at least one zero value. We add 1 to all of the values of the corresponding variables before logarithmic transformation (e.g., Chen and Wang, 2022b).

Empirical approach

To alleviate the estimation bias caused by endogeneity, this research utilizes the System-GMM method proposed by Blundell and Bond (1998) to investigate the impact of IP on TI and to identify the moderating role of NC in the IP–TI link. We construct the model shown in Eq. (3).

$${{\rm{TI}}}_{{\rm{it}}}={{\rm{\alpha }}}_{0}+{{\rm{\alpha }}}_{1}{{\rm{TI}}}_{{\rm{it}}-1}+{{\rm{\alpha }}}_{2}{{\rm{IP}}}_{{\rm{it}}}+{{\rm{\alpha }}}_{3}{{\rm{NC}}}_{{\rm{it}}}+{{\rm{\alpha }}}_{4}{{\rm{IP}}}_{{\rm{it}}}* {{\rm{NC}}}_{{\rm{it}}}+{{\rm{\beta }}{\rm{Control}}}_{{\rm{it}}}+{{\rm{\eta }}}_{{\rm{i}}}+{{\rm{\varepsilon }}}_{{\rm{it}}}$$
(3)

where the cluster is denoted by \({\rm{i}}\). \({\rm{t}}\) is the year. \({{\rm{TI}}}_{{\rm{it}}}\) represents TI, and \({{\rm{TI}}}_{{\rm{it}}-1}\) is the lagged one-period term of TI. \({{\rm{IP}}}_{{\rm{it}}}\) is IP. \({{\rm{NC}}}_{{\rm{it}}}\) refers to NC. The control variables are represented as Controlit. α0 is the constant term. \({{\rm{\eta }}}_{{\rm{i}}}\) denotes a fixed effect of cluster. The random error term is represented by \({{\rm{\varepsilon }}}_{{\rm{it}}}\).

Strictly exogenous instruments are not necessary when selecting lagged variables as instruments; thus the System-GMM method can address endogeneity issues (Chen and Wang, 2022b). Specifically, the lagged one-period term of the dependent variable is treated as the instrumental variable (IV) (e.g., Chen and Wang, 2022a). Referring to the practice of Ye and Liu (2020), we regard the lagged one-period term of the dependent variable and independent variable as endogenous variables, and control variables as exogenous variables. Two-step (System-GMM) methodology is adopted to estimate the coefficient values.

Furthermore, when the model does not include the moderating variable, the panel data are balanced. In the case of considering the moderating variable, since noncluster nodes are included in the innovation networks, we only put the data of cluster nodes into the regression after calculating the NC of each node within the networks. Panel data thus becomes unbalanced.Footnote 16 The interaction term of IP and NC (i.e., \({{\rm{IP}}}_{{\rm{it}}}* {{\rm{NC}}}_{{\rm{it}}}\)) is centralised before regression to ensure the estimation accuracy of the moderating effect (e.g., He et al., 2022).

Empirical results

Cluster identification

This research calculates the LQ of innovation output across the administrative regions in the AI industry. The results show that there are thirty-three cities whose LQ is greater than 1.Footnote 17 Four regions (Fuxin, Jilin, Sanya, and Shanwei) that do not meet the threshold of the innovation scale are excluded.Footnote 18 The twenty-nine retained regions (Beijing, Shenzhen, Nanjing, Hangzhou, Xi’an, Shanghai, Wuhan, Harbin, Zhuhai, Guangzhou, Guilin, Jinan, Chengdu, Hefei, Changsha, Nanchang, Qinhuangdao, Shenyang, Changchun, Fuzhou, Xiamen, Mianyang, Kunming, Dalian, Dongguan, Chongqing, Tianjin, Taiyuan, and Zhengzhou) are regarded as potential AI clusters. The potential AI clusters are illustrated in Fig. 2.

Fig. 2: Potential AI clusters.
Fig. 2: Potential AI clusters.
Full size image

The figure is drawn using Microsoft Excel.

To investigate the local and external innovation connections of potential AI clusters, we calculate the total volume of intraregional and interregional collaborations for these clusters, respectively. The outcomes are depicted in Table 1. The high total volume of intraregional collaborations in clusters such as Beijing, Shenzhen, Nanjing, Guangzhou, Shanghai, Hangzhou, Zhuhai, Wuhan, Chengdu, Tianjin, Jinan, and others indicates their focus on internal innovation activities. The high total volume of interregional collaborations among clusters such as Beijing, Nanjing, Shenzhen, Hangzhou, Shanghai, Guangzhou, Wuhan, Chengdu, Jinan, Tianjin, Nanchang, Hefei, and others indicates their emphasis on external resource acquisition. In conclusion, all twenty-nine potential AI clusters have both intraregional and interregional innovation collaborations, satisfying the criteria for cluster identification.

Table 1 Local and external innovation connections.

Because the total volume of interregional collaborations exceeds that of intraregional collaborations in each AI cluster, we can conclude that interregional cooperation is the main form of collaboration for AI clusters. Further, the innovation network of interregional cooperation is constructed in this research. To be specific, we match each region involved in interregional cooperation patents with other regions.Footnote 19 Connections containing the AI clusters are regarded as network edges. Using the total collaboration volume as the weight of network nodes and the number of collaborations as the weight of network edges, we depict the innovation network for the AI clusters in Fig. 3. We find that the innovation network containing the AI clusters is highly dense. Beijing occupies an absolutely central position. The roles of Nanjing, Hangzhou, and Shanghai as significant cities in Eastern China are prominently highlighted. Shenzhen and Guangzhou, two major cities in Southern China, serve as pivotal hubs. Wuhan and Chengdu, representing Central and Western China, respectively, actively engage in innovation collaborations with other areas.

Fig. 3: Innovation network containing the AI clusters.
Fig. 3: Innovation network containing the AI clusters.
Full size image

The figure is generated in VOSviewer software.

Descriptive statistics

This study performs descriptive statistics for all the variables. The outcomes are displayed in Table 2. The mean values of TI, IP, and NC are 5.898, 2.006, and 2.682, respectively. Moreover, the standard deviations of TI and IP are relatively high, which elucidates that gaps in both TI and IP exist across different AI clusters.

Table 2 Results of descriptive statistics.

Correlation analysis

We observe a positive correlation between IP and TI from Table 3, which indicates that IP may enhance TI in the AI clusters. The correlation coefficients between each variable are less than 0.800. In addition, this research conducts a variance inflation factor (VIF) test. The maximum VIF of 3.410 is lower than 10, illustrating that there is no severe concern about multicollinearity in this study (e.g., Chen et al., 2024).

Table 3 Results of the Pearson correlation matrix.

Regression analysis

The results of regression analysis of IP and TI are reported in Table 4. All p-values of AR(1) are less than 0.1, while those of AR(2) are greater than 0.1, indicating the presence of first-order autocorrelation but the absence of second-order autocorrelation in the random disturbance term. We thus accept the null hypothesis of “no autocorrelation in the random disturbance term”. Furthermore, the p-values of the Sargan test are all greater than 0.1, which illustrates that the problem of overidentification of instrumental variables does not exist, thereby allowing for estimation using System-GMM.

Table 4 Results of the System-GMM.

Model l contains only the control variables. The results displayed in Model 2 indicate that the coefficient of IP on TI is 0.037 and significant. IP can promote TI in the AI clusters. That is, the positive effects of IP on TI dominate over the negative ones. Moving on to Model 3, the interaction between IP and NC (i.e., IP*NC) is significantly negative, which illustrates that the high NC of clusters diminishes the positive impact of IP on TI in the AI clusters. Further, this paper visualizes the moderating effect of NC on the IP–TI connection. As shown in Fig. 4, IP responds to less TI with a high value of NC.

Fig. 4: Diagram of the moderating effect of NC.
Fig. 4: Diagram of the moderating effect of NC.
Full size image

The figure is produced using Microsoft Excel.

Robustness checks

Sensitivity analysis

Quartiles are a scientifically and widely used classification method. In this section, we use the third quartile (i.e., the 75th percentile) instead of the 80th percentile for reidentifying potential clusters. The 75th percentile of patent applications for all regions is 386. The four regions (Fuxin, Jilin, Sanya, and Shanwei) are still excluded (see Supplementary Table B), which is consistent with the conclusion in Section “Cluster identification”. In addition, to avoid overlapping with the 75th percentile, we allow a fluctuation of ±4 percentage points around the 80th percentile and conduct multiple tests between the 76th and 84th percentiles. The results are shown in Fig. 5. The threshold for patent applications falls between 374 and 969 when a certain degree of error exists, and the aforementioned four regions remain excluded. Therefore, the conclusion of the twenty-nine AI clusters identified in this paper is robust.

Fig. 5: Diagram of sensitivity analysis.
Fig. 5: Diagram of sensitivity analysis.
Full size image

The figure is produced using Microsoft Excel.

Placebo test

Conducting a placebo test serves to rule out the possibility that random factors unrelated to IP account for the observed effects, thereby guaranteeing a more reliable estimation. Drawing on the study of Zeng et al. (2024), we randomly shuffle IP across all clusters and then estimate the impact of randomly assigned IP on TI using System-GMM regression. This procedure is repeated 1000 times. If the correlation between IP and TI still exists, the previous conclusions are due to randomness rather than IP. The kernel density of the estimated coefficients plotted in Fig. 6 indicates that the coefficient of IP is 0.002, which is far from the real IP coefficient of 0.037. Therefore, the improvement of TI is driven by IP, not by random factors.

Fig. 6: Results of placebo test.
Fig. 6: Results of placebo test.
Full size image

The red solid line represents the random IP coefficient, whereas the red dotted line denotes the real IP coefficient. The figure is produced using Stata software.

Policy penetration

Measuring IP by the number of incentive policies may not reflect implementation rigor. Enterprises act as the principal agents of innovation (Lei and Xie, 2023). To assess the impact of IP more accurately, we examine whether IP effectively reaches enterprises in practice. To be specific, the penetration process for IP is achieved in the following sequence: the release of IP documents, the government support (GS) obtained by enterprises, and the generation of TI. Drawing on the study of Xu et al. (2023), this article constructs the following mediation effect model:

$${{\rm{TI}}^{\wedge}}_{\rm{jt}}={\varphi }_{1}+{\lambda }_{1}{{{\rm{TI}}}^{\wedge}}_{{\rm{jt}-1}}+{{\rm{cIP}}}_{{\rm{jt}}}+{\gamma }_{11}{{\rm{Control}}}_{{\rm{jt}}}+{\eta }_{{\rm{j}}}+{\varepsilon }_{{\rm{jt}}}$$
(4)
$${\rm{G}}{S}_{{\rm{jt}}}={\varphi }_{2}+{\lambda }_{2}{\rm{G}}{S}_{{\rm{jt}}-1}+{{\rm{aIP}}}_{{\rm{jt}}}+{\gamma }_{21}{{\rm{Control}}}_{{\rm{jt}}}+{\eta }_{{\rm{j}}}+{\varepsilon }_{{\rm{jt}}}$$
(5)
$${{\rm{TI}}^{\wedge}}_{\rm{jt}}={\varphi }_{3}+{\lambda }_{3}{{{\rm{TI}}}^{\wedge}}_{{\rm{jt}-1}}+{{\rm{c}}^{\prime} {\rm{IP}}}_{{\rm{jt}}}+{\rm{bG}}{S}_{{\rm{jt}}}+{\gamma }_{31}{{\rm{Control}}}_{{\rm{jt}}}+{\eta }_{{\rm{j}}}+{\varepsilon }_{{\rm{jt}}}$$
(6)

where \({{{\rm{TI}}}^{\wedge}}_{\rm{jt}}\) represents the number of invention patent applications of enterprise \({\rm{j}}\) in year \({\rm{t}}\). \({{\rm{IP}}}_{{\rm{jt}}}\) is the number of incentive policies in the cluster where enterprise \({\rm{j}}\) is located. \({{\rm{GS}}}_{{\rm{jt}}}\) is measured by the government subsidies of enterprise \({\rm{j}}\) (e.g., Chen and Wang, 2022b). \({\text{Control}}_{\text{jt}}\) represents control variables, including enterprise profitability (EP), capital structure (CS), and operating capacity (OC). EP is calculated by the return on total assets (e.g., Chen and Wang, 2022a). The asset‒liability ratio stands for CS (e.g., Peng and Tao, 2022). OC is represented by the asset turnover ratio (e.g., Xu and Chen, 2020). The other symbols serve the same purpose as Eq. (3).

The mediation mechanism is plotted in Fig. 7. If the coefficients c, a, and b are all significant, GS has a mediating effect (Zhong and Zhang, 2024).

Fig. 7: Mediation effect of GS.
Fig. 7: Mediation effect of GS.
Full size image

The figure is created using Microsoft Visio.

We acquire an AI enterprise list from the China Stock Market and Accounting Research (CSMAR) database.Footnote 20 Based on geographical location, we select 125 firms within clusters as our research samples.Footnote 21 The enterprise data come from the CSMAR database. System-GMM estimation is applied to investigate the mediating impact of GS. As displayed in Table 5, all p-values for AR(1) are less than 0.1, while those for AR(2) and the Sargan test exceed 0.1; thus, the adoption of System-GMM is appropriate. The results show that the coefficients for IP → TI^, IP → GS, and GS → TI^ are all significantly positive at the 1% level. Hence, IP promotes TI^ by strengthening GS. The conclusion verifies the penetration effect of IP.

Table 5 Results of the mediating effect of GS.

Changing the regression model

Poisson regression is usually used for models where the dependent variable is count data. In this study, the dependent variable is “excessively dispersed”, i.e., the mean is less than the variance.Footnote 22 We thereby adopt negative binomial regression. The results displayed in Table 6 indicate that the coefficient of IP is significantly positive and that the coefficient of interaction between IP and NC is significantly negative. Therefore, the empirical outcomes of this research are reliable.

Table 6 Results of negative binomial regression.

Discussion

First, considering that SNA is based on the global cluster network lens, this paper recognizes twenty-nine AI clusters from the perspectives of spatial agglomeration and innovation connections. In fact, the clusters identified in this paper show some alignment with the AI regions currently prioritised for development in China. For instance, the National New Generation AI Innovation and Development Pilot Zones and the National AI Innovation and Application Pilot Zones mainly take the clusters identified as the core bearing areas.Footnote 23,Footnote 24 Nearly half of China’s listed AI companies are concentrated in the twenty-nine AI clusters. Moreover, we discover that interregional innovation collaborations outnumber intraregional ones for every AI cluster. Unlike traditional cluster theory’s emphasis on geographically proximate intraregional collaboration, our findings highlight the critical role of interregional linkages in forming clusters. An investigation into the innovation performance of southwestern Norway provides support for our conclusion (Fitjar and Rodríguez-Pose, 2011). Hence, the validity of the global cluster network lens and the rationality of the joint identification method are elucidated.

Second, this paper reveals that IP can promote TI in the AI clusters. This positive relationship was verified in the photovoltaic industry (e.g., Lin and Luan, 2020), the new energy vehicle industry (e.g., Li et al., 2022), and the wind power industry (e.g., Lin and Chen, 2023). In contrast, financial subsidies inhibited TI in the integrated circuit industry (e.g., Song and Wen, 2023). As each industry displays its own specificity in terms of innovative development, the impact of IP on TI exhibits industry heterogeneity (Wang et al., 2024). Specifically, encompassing frontier fields such as machine learning, deep learning, and natural language processing, the AI industry is characterised by great difficulty and uncertainty. Experiences from various nations (e.g., Canada, South Korea, and the UK) indicate that the AI industry requires robust policy support to ensure adequate provision of capital and labour (Lauterbach, 2019), which provides clues for understanding the mechanism between IP and TI. Further, we examine the penetration effect of IP (see Section “Policy penetration”), confirming our findings from the perspective of enterprises.

Third, this paper takes NC into the research framework of IP and TI. As discussed in Section “Mechanistic analysis”, the moderating direction of NC is uncertain. Some studies have shown that NC is conducive to the development of innovation by exerting a moderating effect (e.g., Shi and Zhang, 2019). However, our study reaches the opposite conclusion, suggesting that high NC is not always advantageous, which is consistent with the viewpoint of Lyu et al. (2020) and Lu et al. (2023). In fact, cultivating innovation in the AI sector requires substantial capital and highly skilled labour. On the one hand, clusters with high NC are predominantly situated in relatively developed regions. Procedures for obtaining ESC are more complex in these regions, such as multiple approval steps for capital absorption and rigorous talent qualification standards, resulting in diminished efficiency in acquiring ESC. On the other hand, regions collaborating with high-NC clusters are comparatively less developed. These regions absorb ESC by lowering resource access thresholds and simplifying procedures, which accelerates capital shortage and brain drain in high-NC clusters. Therefore, the crowding-out effect in resource allocation caused by interregional cooperation exceeds the power of ESC acquisition, manifesting as a negative moderation effect of NC.

Implications, limitations and future research

Theoretical implications

Several theoretical implications are presented as follows: First, this paper identifies AI clusters from the perspectives of spatial agglomeration and innovation connections. This approach moves beyond two strands of literature: (1) agglomeration-based cluster research (e.g., Tian et al., 2020) and (2) SNA for cluster studies (e.g., Luo et al., 2023). This synthesis probes the domain of innovation linkages among clusters to validate the global cluster network lens. Second, our concern for AI clusters unquestionably aids in further understanding this pioneering industry. This paper extends beyond the current research on the IP–TI link, which focuses on the entire industry (e.g., Lin and Luan, 2020; Li et al., 2022; Lin and Chen, 2023; Song and Wen, 2023). By identifying the positive IP–TI nexus from a cluster perspective, this research expands government intervention theory in the field of AI clusters. Third, this paper highlights the significance of NC in the IP–TI link. Given the scarcity of NC as a moderator in the policy research field, the findings contribute to a novel theoretical lens that elucidates the influence of social networks on the role that IP plays in TI for clusters, which deepens the conceptual grasp of social network theory.

Practical implications

Some practical implications can be drawn. On the one hand, the positive IP–TI connection has been verified in this paper; thus, government power should be injected into the development of AI clusters. Regarding capital, government-led AI industry funds should be established to finance strategic R&D initiatives in core technologies. With respect to labour, the government should attract talent to the AI industry through measures such as targeted talent subsidies while enhancing career advancement mechanisms and recognition systems to stimulate innovative vitality. Moreover, AI clusters should seek the support of IP. For example, organizations within clusters could establish a regular communication mechanism to accurately convey industrial demands to government departments. Enterprises should leverage their collective technological strengths to actively secure industry-specific development funds. On the other hand, since NC is not conducive to the beneficial effect of IP on TI, the government should encourage innovation cooperation within AI clusters by providing special subsidies for joint R&D projects, establishing industry-university-research cooperation platforms, and instituting structured knowledge-sharing forums. Governments should conduct regular analysis of intra- and interregional innovation cooperation and dynamically adjust IP according to the imbalance of cooperation. In addition, AI clusters must reasonably trade off the allocation of resources between intraregional and interregional activities. Organizations within clusters should focus on fostering internal innovation cooperation. If interregional cooperation is necessary, evaluating and seeking potential partners based on R&D alignment and technological complementarity can reduce the costs of innovation.

Limitations and future research

This research has several limitations, and future research can be improved in the following aspects: First, owing to data availability constraints, we measure TI solely based on invention patent applications. However, TI in the AI industry is inherently multifaceted. Future research could collect primary data to comprehensively measure TI by supplementing with indicators, such as R&D outputs, commercial performance, and practical technological applications. Second, this study identifies only AI industry clusters. We encourage future research to extend the research framework to other high-tech industries, such as nanotechnology and 3D printing. Third, owing to space limitations, this paper detects only the moderating effect of NC in the IP–TI link. Future research could further investigate other moderating factors, such as structural holes, which also serves as an important indicator reflecting network characteristics. Fourth, IP possesses the distinctive features of China in this paper. Whether the research conclusions are applicable to other countries and regions remains uncertain. For comparison with the findings of our study, further research requires utilising datasets from other countries and regions to obtain additional empirical evidence. Finally, this study focuses on investigating whether IP can influence TI. We do not consider the humanistic implications of technological advancements. To advance AI innovation, researchers are encouraged to examine human-machine interaction—a key frontier in AI research (e.g., Schleidgen et al., 2023; Ho and Vuong, 2024).

Conclusions

This paper starts from a global concern about how to accelerate the development of AI clusters and enhance their innovation levels. The formation and persistence of regional innovation cannot be separated from the government. The innovation behaviour of clusters is considered a kind of network innovation, and the resources of a cluster must be allocated both vertically (intraregion) and horizontally (interregion). We aim to detect where the AI clusters are located, whether and how IP affects TI, and how NC moderates this relationship. By using patents from the AI industry in China from 2011 to 2020, this paper adopts the LQ and SNA to identify industrial clusters. We then apply System-GMM estimation to explore the influence of IP on TI and the moderating role of NC in the IP–TI connection. The findings are as follows: First, twenty-nine AI clusters are identified. Interregional cooperation is the main form of collaboration for AI clusters. Second, IP can effectively increase the level of TI in the AI clusters. Third, the high NC of clusters diminishes the positive impact of IP on TI in the AI clusters. Our study makes three contributions. First, based on the global cluster network lens, this paper identifies AI clusters from the perspectives of spatial agglomeration and innovation connection, which simultaneously examines innovation linkages within and across regions. Second, according to government intervention theory, we uncover the positive effect of IP on TI from the perspective of clusters rather than that of the whole industry and address the gap in knowledge about the IP–TI connection in the AI industry. Third, given the scarcity of NC as a moderator in the policy research field, our research offers a novel theoretical lens that elucidates the influence of social networks on the function of IP in clusters.