Introduction

Digital technology has gradually integrated into the real economy over the past two decades. As a major player in digital technology (Korinek and Stiglitz, 2021), China has witnessed a significant expansion in its digital economy. According to the statistics from the Chinese government, from 2008 to 2022, China’s digital economy has witnessed remarkable growth, expanding from 4.8 trillion yuan to 50.2 trillion yuan, with its share in GDP rising from 15.2% to 41.5%. During this period, the revenue of Chinese SOEs has continued to increase, with the proportion of SOEs revenue to GDP reaching 68.3% in 2022. However, SOEs’ market dynamics and motivation to engage in digitalisation are weak (Wu et al., 2021). The digitalisation level of SOEs is significantly lower than that of non-state-owned enterprises (as shown in Appendix Fig. A1). Many studies have confirmed that digitalisation enables high-quality development of enterprises (Acemoglu and Restrepo, 2018; Ghasemaghaei and Calic, 2019; Barrett et al., 2015). Given the importance of SOEs in developing countries such as China and the current state of digitalisation among SOEs, studying the motivations behind the digitalisation of SOEs is both representative and of practical significance.

The mixed ownership reform is a crucial aspect of the reform for Chinese SOEs. Previous studies have shown that the mixed ownership reform has significantly impacted firm financial performance (Gupta, 2005; Opoku-Mensah et al., 2022). However, the impact of mixed ownership reform on enterprise digitalisation has not been adequately explored. We construct a theoretical framework for the influence of the mixed ownership reform on digitalisation from the perspectives of its impact on compensation incentive mechanisms and risk attitude. Firstly, the mixed ownership reform improves the pay-performance sensitivities of management (Gupta, 2005). Digitalisation is key to enhancing firm performance and achieving high-quality development (Loebbecke and Picot, 2015). Management is incentivized to promote digitalisation and expects higher future compensations. From the risk perspective, the mixed ownership reform enables higher corporate risk-taking. digitalisation, as a risky investment practice that requires substantial resources (Scholz et al., 2020; Svahn et al., 2017), becomes more compelling when companies have higher risk-taking capabilities and tendencies. Therefore, the mixed ownership reform of SOEs may have a promoting effect on digitalisation.

Drawing upon the aforementioned theoretical framework, we utilized the word2vec word embedding tool to construct the measure of digitalisation and examine the impact of mixed ownership reform on it. The study finds that the mixed ownership reform of SOEs significantly promotes their digitalisation. Mechanism tests show that mixed ownership reform promotes digitalisation mainly by influencing the pay-performance sensitivities of management and corporate risk-taking. Heterogeneity analysis reveals that the effect is more significant in competitive industries and among enterprise decision-makers who did not experience the Great Chinese Famine during childhood.

In addition to the practical significance, the findings of our study offer significant contributions to the literature on digitalisation. Specifically, this study provides new perspectives and tools for measuring digitalisation. Existing literature has adopted various approaches to measuring it, such as surveys (Commander et al., 2010), investments in digital assets (Müller et al., 2018), and traditional text analysis (Chen and Srinivasan, 2024). However, these methods have limitations, including low accuracy, ostentatious investment, and subjective bias (Cycyota and Harrison, 2006; Triplett, 1999; Li et al., 2021a). Our study employs the Word2vec word embedding model to extract digitalisation-related information from unstructured data to alleviate these issues. The Word2vec model, a deep learning algorithm in the machine learning domain, converts words into vectors based on contextual semantics, enabling synonym searching with vector algorithms. This approach not only overcomes the subjectivity inherent in manually constructed indicators in traditional text analysis but also ensures that the extracted key information on digitalisation originates from the research subjects, thereby enhancing the accuracy and validity of the indicator.

Besides, our study offers a novel perspective on the motivation of digitalisation. The drivers behind firms’ digitalisation are multifaceted. First, economic motivations, such as the pursuit of monetary profit and improvements in financial performance (Verhoef et al., 2019; Müller et al., 2018), play a critical role. Second, managerial characteristics are also influential, including collaborative relationships, power allocation, and social capital (Matt et al., 2015; Marabelli and Galliers, 2017; Li et al., 2018). Third, policy and institutional factors, such as government investments in digital technology, tax incentives, and innovation subsidies, effectively promote digitalisation (Hess et al., 2016; Bharadwaj et al., 2013). While previous studies have predominantly focused on direct digitalisation support policies, few have examined the effects of indirect institutional arrangements, such as mixed ownership reform, on digitalisation. Our research concerns this gap.

Furthermore, our study contributes to the literature on mixed ownership reform. Prior research on mixed ownership reform has concentrated mainly on equity structure adjustments, managerial governance, and their effects on firm performance (Megginson et al., 1994; Megginson and Netter, 2001; Gupta, 2005; Opoku-Mensah et al., 2022), with limited attention to technological transformation. Our study extends the scope of mixed ownership reform research by examining its role in driving firms’ digitalisation. Specifically, we uncover mechanisms through which such reform influences digitalisation processes by adjusting pay-performance sensitivities and enhancing firms’ risk-taking capacity. Mixed ownership reform introduces more excellent market-oriented elements, significantly strengthening the correlation between managerial compensation and firm performance (Cai et al., 2018). Enhanced compensation incentives encourage managers to adopt long-term strategies, including digitalisation. This finding aligns with existing literature on the relationship between managerial incentives and corporate innovation (Lin et al., 2011). Moreover, mixed ownership reform optimizes equity structures and increases risk-taking, making firms more inclined to adopt emerging technologies and digitalisation. Our study validates these mechanisms, offering new insights into how mixed ownership reform influences technological transformation in firms.

The subsequent sections of the paper are structured as follows: Section "Literature review and research hypothesis" concisely reviews the relevant literature and presents the research hypotheses. Section "Data, variables, and descriptive statistics" details the data sources, variable construction process, and descriptive statistics. Section "Empirical Results and Analysis" presents the empirical results and analyses, including the empirical models, baseline regressions, robustness, endogeneity, and mechanism tests. Finally, we offer concluding remarks and implications of our findings.

Literature review and research hypothesis

The background of mixed ownership reform

For a long time, the deficiencies of “lack of ownership” and “insider control” in Chinese SOEs have affected their operational efficiency and led to unfair distribution and rent-seeking corruption. These problems are expected to be alleviated with the mixed ownership reform by introducing non-state property rights (Boardman and Vining, 1989; La Porta et al., 1999). The entry of private capital or foreign equity participation into SOEs can promote the flexibility of SOEs to switch their operating mechanisms and enhance the level of incentive compatibility of rights, responsibilities, and benefits internally, thus bringing about Pareto efficient outcomes. Furthermore, the mixed ownership shareholding structure introduces external shareholder supervision, which is helpful to make up for the previous lack of supervision and can effectively reduce the monitoring cost of the principal-agent relationships and curb the moral hazard behaviour of SOEs executives.

As a pillar of the Chinese economy, the government attaches great importance to reforming mixed ownership in SOEs. The work reports of the Chinese central government in 2021 and 2022 point to the need to deepen the reform of mixed ownership of SOEs. The 14th Five-Year Plan proposes that the mixed ownership reform will remain the direction of SOEs reform in the future (2021–2025). According to the public data, by the end of 2021, Chinese SOEs had introduced more than 2.5 trillion yuan of various types of private capital. The number of mixed ownership enterprises in central and local SOEs exceeds 70% and 54%, respectively, and most SOEs have carried out the mixed ownership reform.

The impact of mixed ownership reform on digitalisation of the enterprise

To begin with, concerning management incentives, the inefficiency observed in SOEs can be attributed to the absence of efficient monitoring and incentive mechanisms for managers, as discussed by Laffont and Tirole (1993). The causal relationship between the operational performance of SOEs and managerial behaviour is ambiguous, and factors such as government regulation constrain the incentive provisions in compensation contracts, leading to severe agency problems within the organization (Bardhan, 1997). As such, managers are not adequately motivated to strive for the development of the enterprise. On the other hand, non-state shareholders generally prioritize profit maximization as their business objective, thus having a strong motivation to enhance executive compensation. Firth et al. (2006) suggest that the relative change in managerial wealth compared to shareholder wealth can be used to measure the pay-performance sensitivities of the management team. Due to compensation constraints and alternative incentives such as political promotion, the pay-performance sensitivities in SOEs are relatively lower than in private enterprises (Kato and Long, 2006). Increasing the pay-performance sensitivities can promote the alignment of interests between principals and agents, alleviating the agency problem (Jensen and Murphy, 1990). This means that agents can achieve personal benefits while improving the operational performance of the enterprise.

The digitalisation of enterprise refers to the significant improvement of business models through integrating and utilizing digital technologies (Vial, 2019; Warner and Wäger, 2019). It leads to enhanced operational performance by fostering innovation, reducing transaction costs, improving internal governance, and increasing total factor productivity (Ghasemaghaei and Calic, 2019; Dana and Orlov, 2014; Rosenblat and Stark, 2016; Commander et al., 2010; Loebbecke and Picot, 2015). Acemoglu and Restrepo (2018) found that firms’ adoption of digital technologies promotes automation and intelligence in production operations, leading to increased productivity. Digitalisation also facilitates information dissemination, thereby reducing transaction search costs and contract costs (Dana and Orlov, 2014). Further mixed ownership reform and digitalisation enable optimized corporate governance through data mining and data-driven intelligent decision-making, leading to more sophisticated and scientific enterprise management. Collecting customer interaction data in products and services creates new business value (Porter and Heppelmann, 2014). Li and Yu (2015) discovered that the involvement of non-state shareholders in governance could lead to enhancements in the compensation incentive policies for senior executives within SOEs, motivating managers to work diligently and promoting increased investment in technology activities. We believe that the mixed ownership reform can effectively enhance the pay-performance sensitivities of the management team. Since digitalisation can improve operational performance and profitability, the increased pay-performance sensitivities induced by the mixed ownership reform will further motivate managers to promote digitalisation. Based on our analysis, we put forward the following hypothesis:

H1: The mixed ownership reform of SOEs promotes digitalisation by increasing the pay-performance sensitivities of the management team.

Second, in terms of risk-taking, as digitalisation requires companies to reinvent their business models, the process will expose companies to uncertainty shocks and disruptive risks (Scholz et al., 2020). Path dependencies such as corporate inertia, available resources, and capabilities are difficulties that companies need to overcome for digitalisation (Svahn et al., 2017). For example, having highly optimized but rigid traditional production processes makes resources not easily reallocated (Kohli and Johnson, 2011), and managers and employees may show antagonistic attitudes when disruptive technologies are introduced into the enterprise (Singh and Hess, 2017). If firms are less willing to take risks, according to prior research by John et al. (2008), they prefer conservative investment strategies. In other words, firms may be reluctant to promote digitalisation when they are less willing to take risks. The SOEs typically carry a heavy policy burden to undertake many social functions for the government (Lin et al., 1998). They usually have low risk-taking levels because excessive risk-taking is not conducive to achieving the government’s political goals and social stability functions. Therefore, firms subject to government intervention prefer conservative investment strategies (Fogel et al., 2008). Shleifer and Vishny (1994) show that the reform of firm ownership from state-owned to private makes it more costly for politicians to intervene in firms. The political pressure exerted by the government on firms to achieve political goals is reduced. This may cause firms to shift their conservative business strategies. At the same time, privatization alleviates the agency problem of SOEs and gives firms a greater incentive to improve their corporate governance mechanisms, raising their risk-taking level (Boubakri et al., 2013). Therefore, the increased risk-taking level of SOEs following the mixed ownership reform will improve the undertaking of digitalisation activities. In summary, we posit the following hypothesis:

H2: The mixed ownership reform of SOEs promotes digitalisation by increasing corporate risk-taking.

We conduct empirical tests using data from Chinese A-share listed SOEs to validate the theoretical analysis and research hypotheses. To enhance the accuracy of the research findings, we employ a word vector model to refine the digitalisation measure and characterize mixed ownership reform in SOEs from two dimensions: equity structure and top governance structure. To ensure the robustness of the conclusions, we supplement the baseline regression with a series of robustness tests, endogeneity tests, and heterogeneity analyses. Furthermore, we address the proposed hypotheses through mechanism tests. The following sections provide a detailed discussion of the data sources, measurement construction, and empirical results.

Data, variables, and descriptive statistics

Data

For our research sample, we choose Chinese A-share listed SOEs spanning from 2010 to 2020 and proceed with the data processing as follows: (1) We exclude financial and insurance, ST and ST* listed companiesFootnote 1; (2) exclude the units with missing observations; The final sample has 10445 observations. The financial data used in this study are sourced from the China Stock Market & Accounting Research Database (CSMAR), and the data of related companies’ annual reports are obtained from the official websites of the Shenzhen Stock Exchange and Shanghai Stock Exchange.

Variable construction

The mixed ownership reform

We assess the level of mixed ownership reform through two distinct dimensions: equity structure and top governance structure. To quantify the equity structure, we utilize the ratio of non-state shareholder equity among the top ten shareholders, denoted as NonSOEs_Equ. Additionally, to assess the top governance structure, we calculate the proportion of directors appointed by non-state shareholders to the total number of directors on the company’s board, denoted as NonSOEs_Dir. The shareholder-appointed directors are determined here following Cheng and Wei (2013): In instances where a shareholder is a natural person, we consider them as having been appointed as a director if they currently hold a directorship position within the listed company. Regarding non-state-owned enterprise shareholders, the judgment standard is whether the directors of the listed company hold positions in the non-state-owned shareholdersFootnote 2. We can obtain more robust and prosperous conclusions by measuring the degree of mixed ownership reform from the above two dimensions.

The digitalisation of enterprise

Loughran and McDonald (2011) showed that quantifying and analyzing key information through word frequency can reflect business characteristics. With the advancement of natural language processing technology, using text analysis to construct the enterprise digitalisation metric is increasingly becoming a major approach (Chen and Srinivasan, 2024; Chen et al., 2024; Wu and Li, 2024; Wu et al., 2021; Yuan et al. 2021; Zhao et al. 2021). For example, Chen and Srinivasan (2024) use textual descriptions of digitalisation in business discussion texts in annual reports (10-K reports) as measurement; Wu et al. (2021) refer to the digital transformation-related literature, essential policy documents, and research reports to compile a library of keywords for digital transformation, and measure enterprises’ digitalisation based on the number of occurrences of the words in texts of listed companies. However, there are four main problems in the current study. First, there is no consistent definition of keyword selection in the digital transformation lexicon, and the study by Wu et al. (2021) is a pioneering and influential study that uses the lexicon method to measure digital transformation in the context of Chinese annual reports. The study summarizes the keywords of digital transformation from the perspectives of “underlying technology use” and “practical application of technology”, but the above categorization concepts are vague, with low precision and insufficient completeness. For example, “Business Intelligence” is included in the application of “underlying technology”, while “Intelligent Marketing” is included in the “practical application of technology”. The Internet of Things is included in cloud computing. Considering digital transformation only as an application based on cloud computing, big data, artificial intelligence, and blockchain technologies, Vial (2019) and Bharadwaj et al. (2013) reveal that the application of a wide range of technologies such as information technology, computing, communication, and connectivity is the foundation of digital transformation. In addition, Yuan et al. (2021) used the descriptions of digital transformation in China’s policy documents at a specific time to construct an enterprise key dictionary. Zhao et al. (2021) took some samples of enterprises’ annual reports to find the key information and build the dictionary in specific areas, namely, the application of digital technology, Internet business models, intelligent manufacturing, and modern information systems. The keyword dictionaries and indicators obtained in various ways in the past are different. The keyword sources, dictionary construction ideas, and choices are relatively subjective and lack theoretical support. Given that the application of digital technology is the foundation of enterprise digital transformation (Verhoef et al., 2019; Singh and Hess, 2017), we start with the definition of digital transformation and extract the key textual information of digital transformation from three perspectives. Second, the completeness of keywords is insufficient. It is difficult to exhaust digital transformation keywords manually. In the case of a long textual period, the variety of ways textual content is expressed in different temporal contexts makes it more challenging to exhaust keywords. Third is the problem of subjective bias. Previous research methods relied entirely on manual techniques to select keywords, and such a process cannot avoid bias caused by individual capabilities and subjective factors. Fourth, there is a lack of keyword validity. The keywords selected by the study may not appear or appear too infrequently in the context of annual reports. Some studies select keywords through external literature, working reports, and policy texts. However, Jiang et al. (2021) found that the expression of words in annual reports has more complex characteristics compared to the written expression of words in literature and policy texts. That is, the enterprise digital transformation lexicon constructed based on literature and policy texts may not effectively reflect the expression characteristics of digital transformation in the context of the annual report. In a subsequent study, we show many expressions related to digitalisation words based on specialization, English, and even acronyms based on the context of the enterprise’s annual reportFootnote 3. In contrast, relatively few policy texts allow the above expressions. To summarize, the dictionary method based on traditional text analysis faces problems such as lack of completeness, subjective bias, significant keyword variation, and lack of validity. These problems affect the accuracy of the results of digital transformation measurement and quantitative analysis.

We refer to previous studies (Li et al., 2021a; 2021b) to improve the traditional manual keyword method with the help of the unsupervised machine learning model of Word2vec, a landmark technology in the field of similar word expansion (LeCun et al., 2015). The Word2vec model is based on the well-recognized concept in linguistics that words occurring in similar contexts have similar meanings. It converts words into vectors based on their contextual contexts so that word similarity can be computed based on the vector similarity and thus learns to predict all of its approximate words. In addition, the Word2vce technique requires specific text to train the model, and according to the research needs, we choose the content of managerial discussion and analysis (MD&A) disclosed in the annual reports of listed companies as the research text. This text reflects the review of business conditions and the outlook for future development of the enterprise during the reporting period, and it has been widely recognized as the research object (Loughran and McDonald, 2011).

Combining previous studies and definitions of enterprise digitalisation, we look for expressions of enterprise digitalisation in three dimensions: first, broad and traditional digitalisation seed words, referring to studies such as Vial (2019), we use “Internet”, “Information technology” and “Communication technology” as seed words; second, cutting-edge digitalisation seed words. In the definition of digitalisation, one strand of literature highlights that enterprise digitalisation is based on cutting-edge digital technologies. For example, Singh and Hess (2017) argue that new digital technologies refer to social media, mobile connectivity, analytics or embedded devices, etc.; Warner and Wäger (2019) argue that new technologies refer to artificial intelligence, cloud, blockchain, and IoT. Combined with the context of Chinese expression, we put “Cloud computing”, “Big data”, “Artificial intelligence”, “Blockchain”, “Internet of Things”, and “Mobile Internet” as the seed words; thirdly, we refer to the use of digital technology seed words in a general way, and we use the words “Digital transformation”, “Digitalisation” and “Digital technology” as seed words. Finally, the above three parts of words are aggregated to get the enterprise digitalisation seed words. Given the characteristics of digitalisation definition, we also use the seed words based on the above frontier digital technologies to construct a dictionary and indicator of digitalisation based on frontier technologies, which are used for robustness testing.

After obtaining the seed words, we use the Word2vec model to output similar words in the MD&A text of listed companies. Following the methodology of Li et al. (2021a, 2021b), we employ the CBOW (Continuous Bag of Words) model within Word2vec for training, setting the vector dimension to 300 and outputting the top 100 similar words for each seed word based on similarity scores. This process involves two key steps: converting words into vectors and estimating the similarity between the word vectors. We use the CBOW model to estimate the word vectors. The core idea of the CBOW model is to predict the probability of the current word based on its context, and by maximizing the likelihood function, the corresponding vectors for each word are obtained. Mathematically, this can be expressed as follows:

$${Vector}{{\_}}{word}=\max \mathop{\sum}\limits_{w\in T}{\mathrm{log}}\,p(w\left|{Text}\right.(w))$$
(1)

Note: T represents the corpus, w represents a word, and Text(w) denotes the context.

Second, the similarity between word vectors is measured using cosine similarity. Based on the word vectors derived from the CBOW model, we calculate the cosine similarity between vectors to determine their similarity. In other words, cosine similarity allows us to identify words similar to the seed words. For two given word vectors X and Y, the cosine similarity is computed as follows:

$${Simlarity}=\cos (\theta )=\frac{X\times Y}{\left|X\right.\left|\times \right.\left|Y\right|}=\frac{\mathop{\sum }\nolimits_{i=1}^{n}{X}_{i}\times {Y}_{i}}{\sqrt{\mathop{\sum }\nolimits_{i=1}^{n}{{X}_{i}}^{2}}\times \sqrt{\mathop{\sum }\nolimits_{i=1}^{n}{{Y}_{i}}^{2}}}$$
(2)

We output each seed word’s top 100 similar words based on their cosine similarity scores. For instance, using the seed word “Digital Technology”, Table 1 presents the top 10 most similar words in the Chinese context and their similarity scores.

Table 1 Example of similar words for “Digital Technology” output by Word2vec.

The results in Table 1 indicate that most of the words output by the word vector model are highly relevant to “Digital technology”, demonstrating the validity of the research method adopted and highlighting the efficiency and accuracy of using machine learning to identify similar words compared to traditional manual methods. However, there may still be a small number of irrelevant words among the outputs. Following Li et al. (2021a, 2021b), we manually verify the machine learning results and exclude a small number of words unrelated to digitalisation (e.g., “Modern Technology”, “Cultural Media”, “Multimedia Technology”, as shown in Table 1). Ultimately, through this process, we developed a digitalisation dictionary based on the MD&A text from annual reports spanning 2010 to 2020.

We extracted the aggregate frequency of keywords in the MD&A text of listed companies’ annual reports through the dictionary. Concerning previous studies (Wu et al., 2021), we logarithmized it to get the enterprise digitalisation indicator (Digital). Figure 1 shows the process of building the digitalisation measure using machine learning.

Fig. 1: The process of building the digitalisation measure.
figure 1

This figure illustrates the steps involved in constructing a measure for digitalisation. The arrow symbols represent processes.

Validation of digitalisation of enterprise measure

Geographical Characteristics

We calculate the digitalisation mean value of all listed companies in any city during the sample period on a city-by-city basis and obtain the digitalisation geographic distribution map of listed companies in China in Fig. 2. The map presents the following two distinctive features. First, the digitalisation level of enterprises in the eastern region, especially in the coastal region, is significantly higher than that in the central, western, and northeastern regions. Second, within the same provincial area, the digitalisation level of provincial capital cities is generally higher than in other regions. The main reason is that the digital infrastructure and economic development of the eastern region and provincial capital cities are higher than those of their western, northeastern, and non-provincial capital regions, which is conducive to the digitalisation of enterprises in the regions mentioned above. In this paper, the constructed indicators reflect the actual situation regarding the geographical characteristics of enterprise digitalisation.

Fig. 2: The geographical characteristics of digitalisation in China.
figure 2

This figure calculates the mean digitalisation level of all Chinese listed firms in each city during the sample period. Darker colors indicate a higher level of digitalisation among listed firms within the city. It reflects the higher level of digitization in economically developed regions.

The time trend

The two curves in Fig. 3 represent our constructed graph of the time trend of the annual mean value of digitalisation of listed companies and the size of China’s digital economy as a share of GDP. To make the graph more intuitive, we multiply the proportion of China’s digital economy size to GDP by 10. The graph reflects two features: first, enterprise digitalisation as a whole shows a growing trend, and from 2010 to 2020, the proportion of China’s digital economy size to GDP grew from 18.6% to 38.6%, and the mean value of enterprise digitalisation during the same period increases from 0.942 roses to 2.713, and the above indicators all show a year-on-year increasing trend, indicating that the growth of enterprise digitalisation is consistent with the overall trend of China’s digital economy development; secondly, enterprise digitalisation showed a more apparent upward leap in 2015, a finding that is in line with the viewpoints of Peng et al. (2023). The main reason is that the Chinese government stepped up its efforts to promote the digitalisation of the real economy in that year. 2015 was the first year that Chinese leaders proposed the construction of “Digital China”. Also, the State Council of China proposed the implementation of the National Big Data Strategy and issued documents such as the “Action Plan for the Promotion of Big Data Development” and “Made in China 2025” to promote the rapid development of digitalisation. The measure reflects the actual situation from the time trend of digitalisation.

Fig. 3: The time trend of digitalization.
figure 3

In this figure, the horizontal axis represents the year, and the vertical axis shows the annual mean of digitalisation and the value of the proportion of China’s digital economy scale to GDP multiplied by 10. The gray solid line (Digital Economy Scale/GDP) reflects the scale of the digital economy, while the gray dashed line (mean of digitalisation) represents the level of corporate digitalisation. This figure indicates a significant increase in digitalisation in 2015, which can be attributed to a series of concentrated and large-scale policies introduced by the Chinese government to support corporate digitalisation during that year, such as the construction of “Digital China”, the implementation of the national big data strategy, the publication of the “Action Plan for the Promotion of Big Data Development” and “Made in China 2025”.

The definition of variables

Based on previous studies, we specify several variables of firm and management characteristics as control variables. Table 2 presents the names and corresponding definitions of each variable used in the study.

Table 2 The definition of variables.

Descriptive statistics

Table 3 displays the descriptive statistics. The mean value of digitalisation level (Digital) of SOEs is 1.659; the variance is 1.328, the maximum value is 6.275, and the minimum value 0. This indicates significant differences in the digitalisation levels of different enterprises. For the indicators of SOEs’ mixed ownership reform, the average proportion of directors appointed by non-state shareholders is only 3.8%, which indicates the overall low level of non-state shareholders’ participation in governance. The proportion of non-state shareholder equity among the top ten shareholders is 13.9%, reflecting the overall low shareholding ratio of non-state shareholders at the level of key shareholders.

Table 3 Descriptive statistics.

Empirical Results and Analysis

The empirical model is as follows:

$$\begin{array}{l}{{Digital}}_{i,t}={\beta }_{0}+{\beta }_{1}{{{NonSOEs}{{\_}}{Dir}}_{i,t}/{NonSOEs}{{\_}}{Equ}}_{i,t}\\\qquad\qquad\quad+\,{\beta }_{2}{{Controls}}_{i,t}+\sum {Industry}+\sum {Year}+{\varepsilon }_{i,t}\end{array}$$
(3)

Digitali,t is the dependent variable that captures the digitalisation of firm i in year t. The independent variable NonSOEs_Diri,t and NonSOEs_Equi,t assessed the mixed ownership reform from the perspectives of top governance structure and equity structure, respectively. Controlsi,t represents control variables. We also control for industry and year fixed effects.

Baseline regression

Table 4 shows the baseline regression results. Columns 1 and 3 do not include control variables and the only control for time and industry fixed effects and show that the impact of the proportion of non-state shareholder directors and the proportion of non-state shareholder ownership among the top ten shareholders on the digitalisation of enterprise is significantly positive at the 1% level. Columns 2 and 4 show that these two mixed ownership reform variables remain significant after including control variables. This implies that the mixed ownership reform of SOEs will significantly promote the digitalisation of enterprises.

Table 4 Baseline regression results.

Robustness tests

Estimating with alternative independent variables and regression models

First, we use the logarithm of the number of directors appointed by non-state shareholders in the board of directors (LnNonSOEs_Dir). The regression results reported in column 1 of Table 5 shows that the alternative mixed reform indicators continue to contribute positively and significantly to digitalisation. In addition, due to the presence of zero values in the dependent variable Digital, we further employ the Tobit model for the regression, and the results reported in columns 2 and 3 show that our basic conclusions remain robust.

Table 5 Changing the mixed ownership reform measurement and using the Tobit model.

Re-specifying the dependent variable

We attempt to reconstruct the dictionary of enterprise digitalisation by considering the definition of digital technology. Some studies suggest that the digital technology employed for digitalisation is an advanced form of technology, distinct from traditional information and communication technologies (Warner and Wäger, 2019). Given this, we reconstruct the digitalisation dictionary and indicator (Digital_RE) as a robustness test. In this dictionary, keywords such as “internet technology”, “information technology”, and “web technology”, which refer to traditional digital technologies, are discarded. The findings in columns 1 and 2 of Table 6 demonstrate the robustness of the mixed ownership reform in fostering the digitalisation of enterprises.

Table 6 Changing digitalisation measurement.

Propensity score matching (PSM)

we also utilize the PSM method for robustness tests. The entire sample is grouped according to the presence or absence of non-state shareholder-appointed directors (NonSOEs_Dummy). The variables of Size, Age, Leverage, Growth, TBQ, ROA, ROA, Indep, Board, and Dual are used as covariates to identify a control group with similar characteristics to the treatment group, using the nearest neighbour matching, radius matching, and kernel matching methods, respectively. The balance tests were performed on all covariates before PSM regression estimation was executed. The differences in characteristics between firms with or without directors appointed by non-state shareholders were effectively eliminated after PSM. The post-matching estimation results presented in Table 7 remain consistent with the baseline regression results, demonstrating that the study findings remain robust after addressing the problem of sample selection bias.

Table 7 The PSM test.

Endogeneity test

Referring to Cai et al. (2018) and Fan et al. (2013), we utilize the historical concessions in China as the instrumental variable of mixed ownership reform, selected for its relevance, exogeneity, and exclusivity. And we give the following reasons:

Before the First Opium War, China’s Qing Dynasty adopted a policy of isolation, severing interactions with other countries. Following the war, a series of agreements and treaties required the Chinese government to open specific trading ports and establish concessions, allowing foreign capital to invest, establish factories, conduct missionary work, and promote economic and cultural exchange. Similar to colonisation, establishing concessions may have long-term effects on developing local markets and legal institutions (Acemoglu et al., 2001). Concession areas were characterised by an emphasis on private enterprise, property rights, and market economies, resulting in a more developed institutional foundation for SOEs reforms in these regions, thus satisfying the relevance requirement for instrumental variables.

However, the existence of concessions is a historical legacy, making this variable highly exogenous. Its influence is mainly reflected in shaping local market mechanisms (Fan et al., 2013). Digitalisation, on the other hand, is largely a product of modern information technology development. This disconnect makes it unlikely that the history of concessions directly affects contemporary corporate digitalisation. Digitalisation relies more on modern information infrastructure, corporate economic incentives, and investment capacity (Vial, 2019; Bharadwaj et al., 2013), which are not directly related to the historical presence of concessions in the firm’s location. Thus, while concessions do not directly drive modern corporate digitalisation, they may exert an indirect effect through their impact on mixed ownership reform. This satisfies the exclusion restriction requirement for instrumental variables. Finally, we construct an instrumental variable (Concession), which takes the value of 1 if the listed company is located in a province that has historically established a concession area and 0 otherwise.

Table 8 reports the instrumental variable regression results. Column 1 indicates that the instrumental variable has a significant positive effect on the proportion of directors appointed by non-state shareholders (NonSOEs_Dir) and passed the weak instrumental variables test and rejection of the under-identification hypothesis. The two-stage regression results in column 2 indicate that the impact of the proportion of directors appointed by non-state shareholders on the digitalisation of enterprise remains positively significant. Similarly, column 3 indicates that the instrumental variable has a significant positive effect on the proportion of non-state shareholder equity among the top ten shareholders (NonSOEs_Equ), and it passes the weak instrumental variable test as well as the under-identification hypothesis. The two-stage regression results in column 4 indicate that the effect of the proportion of non-state shareholder equity among the top ten shareholders on digitalisation of enterprise is still positively significant.

Table 8 Instrumental variable tests.

Mechanism tests

The literature shows that mixed ownership reform can enhance management pay-performance sensitivities and form a performance-oriented incentive mechanism. The digitalisation of enterprise is an essential means to promote corporate performance, and driving digitalisation will give management higher incentives and performance expectations in the future. In China, monetary compensation is the most critical component of managerial compensation incentives, and the assessment of the relationship between managerial compensation and performance in major companies focuses on the relationship between monetary compensation and performance (Cai et al., 2018). Referring to Firth et al. (2006) and Cai et al. (2018), we measured managerial compensation by taking the natural logarithm of the average compensation of the top three management personnel disclosed by the company. We used the ratio of management compensation to the natural logarithm of company net profit to measure the management pay-performance sensitivities (Lnpay). The regression results in columns 1 and 2 of Table 9 show that NonSOEs_Dir and NonSOEs_Equ significantly increase the pay-performance sensitivities of management, indicating that the pay-performance sensitivities are an essential mechanism for the mixed ownership reform to drive digitalisation. Thus, Hypothesis H1 is confirmed.

Table 9 Mechanism tests: the mixed ownership reform, pay-performance sensitivities, and corporate risk-taking.

Second, the literature analysis also shows that digitalisation is a risky activity, and Chinese SOEs tend to have low risk-taking levels due to their political objectives and social stability functions. The mixed ownership reform can reduce the government’s influence on corporate investment decisions, and firms will reorient toward the goal of maximizing corporate value and choosing more aggressive business strategies, undertaking a higher level of corporate risk-taking, which in turn promotes digitalisation. We refer to the research design of John et al. (2008) and use the level of firm earnings volatility (Risk_taking) over a specific period to measure the extent of firm risk-taking. We adopt a 3-year observation period (from t years to t + 2 years) due to the typical tenure of executives in Chinese listed firms, which is generally 3 years. The procedure of Risk_taking calculation is shown in Eqs. (4) and (5), and the results shown in columns 3 and 4 of Table 9 indicate that the appointment of directors by non-SOE shareholders and the proportion of non-SOE shareholders equity among the top ten shareholders of the firm significantly enhance the firm’s risk-taking. The results indicate that corporate risk-taking is an important mechanism through which SOEs’ mixed ownership reform promotes the digitalisation of enterprises. Thus, Hypothesis H2 is confirmed.

$${{\rm{Risk}}{\_{\rm{taking}}}1}_{{\rm{i}},{\rm{j}}}=\sqrt{\frac{1}{{\rm{n}}-1}\mathop{\sum }\limits_{{\rm{j}}=1}^{{\rm{n}}}{\left({{\rm{Adj}}\_}{{\rm{Roa}}}_{{\rm{i}},{\rm{j}}}-\frac{1}{{\rm{n}}}\mathop{\sum }\limits_{{\rm{j}}=1}^{{\rm{n}}}{{\rm{A}}{\rm{dj}}\_{\rm{Roa}}}_{{\rm{i}},{\rm{j}}}\right)}^{2}},{\rm{n}}=3$$
(4)
$${{\rm{Adj\_Roa}}}_{{\rm{i}},{\rm{j}}}=\frac{{{\rm{EBIT}}}_{{\rm{i}},{\rm{j}}}}{{{\rm{ASSET}}}_{{\rm{i}},{\rm{j}}}}-\frac{1}{{{\rm{m}}}_{{\rm{j}}}}\mathop{\sum }\limits_{{\rm{k}}=1}^{{\rm{m}}}\frac{{{\rm{EBIT}}}_{{\rm{k}},{\rm{i}}}}{{{\rm{ASSET}}}_{{\rm{k}},{\rm{j}}}}$$
(5)

Note: Roa is the return on assets, Adj_Roa is the return on assets adjusted by industry mean, EBIT is Earnings before interest and tax, ASSET is total company assets; i denotes the i company, j denotes the j year of the forward-looking 3 years; \({m}_{j}\) is the total number of companies in industry m in the forward-looking j year, and k is the k company in industry m.

Heterogeneity analysis

Jiang (2022) argues that the most crucial role of heterogeneity analysis lies in further testing the mechanisms. Based on this rationale, we examine the mechanisms of the pay-performance sensitivities of management and risk-taking through industry competition levels and decision-makers’ early experiences, respectively.

Degree of industry competition

Competitive SOEs face significant competitive pressure, necessitating the design of more reasonable compensation contracts to maximize shareholder interests and enhance overall corporate performance. Compared to monopolistic SOEs, mixed ownership reform is expected to have a more pronounced effect on improving the pay-performance sensitivities of management in competitive SOEs. To test this hypothesis, we refer to Ke et al. (2017) and divide the sample into regulated and competitive industries. The variable Compete is assigned a value of 1 for firms in competitive industries and 0 otherwise, and a grouped regression analysis is performed. As shown in Columns (1) and (3) of Table 10, the coefficients for NonSOEs_Dir and NonSOEs_Equ are significantly positive in competitive industries. Conversely, in regulated industries, the significance and magnitude of the NonSOEs_Dir coefficient decrease in Column (2), while the coefficient for NonSOEs_Equ in Column (4) is insignificant. An inter-group correlation test confirms that the differences between these coefficients are statistically significant. These findings indicate that the positive effects of mixed ownership reform on digitalisation are more pronounced in competitive industries. This may be attributed to the stronger impact of mixed ownership reform on enhancing the pay-performance sensitivities of management in these industries.

Table 10 Heterogeneity test: degree of industry competition.

Corporate decision-makers who experienced famine during childhood

Previous research has found that corporate decision-makers’ childhood experiences of famine can shape deeply rooted and unchangeable risk perceptions (Wang et al., 2022). If mixed ownership reform influences corporate digitalisation by affecting firms’ risk-taking, the effect should be more pronounced in firms that are more susceptible to such influences. In China, the chairman of the board significantly impacts the firm’s operational decisions (Feng and Johansson, 2018). We examine the risk-taking mechanism based on the historical event of the Great Chinese Famine and its impact on shaping the risk perceptions of the chairman. Referring to the study by Xu and Li (2016), the variable Famine is assigned a value of 1 if the chairman’s birth year falls between 1947 and 1961 (indicating that they experienced famine during childhood), and 0 otherwise, and a grouped regression analysis is conducted. The regression results in Columns (2) and (4) of Table 11 show that in the group where Famine = 0, the coefficients for NonSOEs_Dir and NonSOEs_Equ are significantly positive, whereas in the group where Famine = 1, the coefficients are not significant. An inter-group correlation test confirms that the differences between these coefficients are statistically significant. These findings suggest that mixed ownership reform has a more pronounced effect on promoting digitalisation in firms led by decision-makers without childhood famine experiences. This may be related to the fact that mixed ownership reform more effectively enhances risk-taking in such firms.

Table 11 Heterogeneity test: corporate decision-makers who experienced the Great Chinese Famine in childhood.

Conclusion

we construct a measure of digitalisation through a machine learning approach and find that the mixed ownership reform of SOEs can significantly promote digitalisation. The conclusion holds after various robustness checks, including using alternative independent and dependent variables, PSM, and endogeneity tests. The mechanism analysis reveals that mixed ownership reform facilitates digitalisation by influencing pay-performance sensitivities and risk-taking. This effect is more significant among competitive industries and corporate decision-makers who did not experience famine during childhood. Our study explores digitalisation measures, enriches the literature on the impact of mixed ownership reform on business decisions, and extends the research on the motivation of digitalisation.

The policy suggestions and managerial implications are as follows: First, it is essential to recognise the significant role of mixed ownership reform in SOEs in driving digitalisation. Promoting mixed ownership reform improves not only internal governance, enhances enterprise vitality, and increases operational efficiency but also facilitates high-quality development through the advancement of digitalisation. In implementing digital strategies for SOEs, governments need to concern institutional factors, with particular attention to the role of mixed ownership reform. Second, enterprises should leverage the influence of mixed ownership reform on pay-performance sensitivities and risk-taking attitudes. Using mixed ownership reform as an opportunity, enterprises should align their operations with value creation principles, optimise performance-based compensation mechanisms, and enhance reasonable levels of risk-taking. These measures will effectively promote both market-oriented strategies and digitalisation. Third, attention should be paid to the heterogeneous effects of mixed ownership reform on digitalisation across different types of SOEs. The influence of decision-makers’ childhood experiences highlight the need for governments and enterprises to consider the personal backgrounds of executives when promoting digitalisation or technological innovation through mixed ownership reform. In competitive industries, the pace of reform can be accelerated to achieve rapid improvements in digitalisation. In non-competitive industries, modern management tools can be introduced to enhance the effect of reform on the pay-performance sensitivities of management, enabling reforms to play a greater role in driving digitalisation in non-competitive enterprises.

Our current research still has limitations: Although natural language processing (NLP) technologies have been widely applied to extract specific information from unstructured data such as text, achieving substantial progress, our measure may still have certain biases. For instance, variations in the writing style and information transparency of annual reports across different firms might lead to some firms exaggerating or obscuring their actual digitalisation efforts in their reports. This may cause the inaccuracy of the measure. In the future, we could integrate methods such as event studies to more accurately analyse and understand the role of mixed ownership reform in digitalisation.

Our research can also be further expanded: On one hand, future studies could expand to the domain of non-state-owned enterprises. To date, over 60% of privately listed firms in China have introduced state-owned shareholders through “reverse” mixed ownership reform. Further research could examine the impact of such “reverse” mixed ownership reform on corporate digitalisation. Another intriguing research direction is to extend this methodology to an international context, particularly by comparing authoritarian and democratic regimes, as well as developed and developing countries. These countries exhibit significant differences in the role of state-owned economies, and digitalisation is influenced by factors such as policies, economic structures, and corporate cultures. Cross-national research could help us understand the similarities and differences in how reforms drive digitalisation across various economic systems.