Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Humanities and Social Sciences Communications
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. humanities and social sciences communications
  3. articles
  4. article
Transforming the Northwest frontier: development discourse in Republican China through computational analysis of the historical press
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 13 February 2026

Transforming the Northwest frontier: development discourse in Republican China through computational analysis of the historical press

  • Tao Ren1,2 

Humanities and Social Sciences Communications , Article number:  (2026) Cite this article

  • 437 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Cultural and media studies
  • History
  • Politics and international relations

Abstract

This study examines the discourse surrounding Northwest China’s development during the Republican era (1911–1949). Drawing on 5,461 newspaper and periodical articles from the Quan Guo Bao Kan Suo Yin (CNBKSY) and the Shenbao database, we developed a multi-stage text extraction workflow leveraging Google Gemini to convert complex historical scans into machine-readable text. After standardising historical character forms, we applied structural topic modelling (STM) to recover 26 coherent themes covering infrastructure, resource extraction, state governance, ethnic relations, and cultural mobilisation. Topic-correlation analysis reveals three higher-order clusters—infrastructure and resources, governance and industry, and cultural-educational development—integrated by overarching concerns for strategic geography and state-led economic planning. Temporal analysis shows dramatic inflections: discourse initially gained momentum through regional development initiatives led by warlords in the 1920s, before the 1931 Manchurian Incident amplified a security-centred rhetoric that fully shifted the focus from exploratory surveys to urgent state-led nation-building after the 1937 Japanese invasion—a focus that gradually ebbed post-1945 as wartime imperatives faded. These patterns illustrate how national crises transformed the Northwest from a peripheral frontier into a strategic heartland in Republican-era Chinese media and policy discourse. By leveraging a large-scale corpus and cutting-edge computational methods, this study moves beyond traditional historiography to provide the most comprehensive analysis yet of how Republican China envisioned, prioritised, and rhetorically shaped its interior frontier during a pivotal period of modern Chinese state-building.

Similar content being viewed by others

Determination of the temporal–spatial distribution patterns of ancient heritage sites in China and their influencing factors via GIS

Article Open access 09 May 2024

Measuring competition between the great powers across Africa and Asia using a measure of relative dispersion in media coverage bias

Article Open access 27 October 2022

Leveraging machine learning in a comparative analysis of rural revitalization policies in China and global best practices

Article Open access 10 February 2026

Data availability

Data and analysis are available at: https://github.com/ghuserone/develop-northwest. The repository includes shareable code and the processed corpus used for analysis; the underlying CNBKSY and Shenbao source scans are subject to database licensing and cannot be redistributed.

References

  • Abraham A et al. (2024) Themes and trends in creativity research between 1894 and 2022: A topic modeling approach. Psychol Aesthet Creat Arts. https://doi.org/10.1037/aca0000677

  • Anderson B (2006) Imagined communities: Reflections on the origin and spread of nationalism. Revised Edition. Verso, London

  • Armand C, Henriot C (2023) Beyond digital humanities thinking computationally: A position paper. https://shs.hal.science/halshs-04194570. Accessed 3 Jul 2024

  • Assael Y et al. (2022) Restoring and attributing ancient texts using deep neural networks. Nature 603(7900):280–283. https://doi.org/10.1038/s41586-022-04448-z

    Google Scholar 

  • Baker M (2024) Energy, labor, and Soviet aid: China’s Northwest Highway, 1937–1941. Mod China 50(3):302–334. https://doi.org/10.1177/00977004231203897

    Google Scholar 

  • Beelen K et al. (2025) Whose news? Critical methods for assessing bias in large historical datasets. Comput Humanit Res 1:e8. https://doi.org/10.1017/chr.2025.10007

    Google Scholar 

  • Benoit K (2020) Text as data: An overview. In: Curini L, Franzese R (eds) The SAGE handbook of research methods in political science and international relations. SAGE Publications, London, pp 461–497. https://doi.org/10.4135/9781526486387.n29

  • Blei DM (2012) Probabilistic topic models. Commun ACM 55(4):77–84. https://doi.org/10.1145/2133806.2133826

    Google Scholar 

  • Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3(Jan):993–1022

    Google Scholar 

  • Blouin B et al. (2023) Unlocking transitional Chinese: Word segmentation in modern historical texts. In: Proceedings of the joint 3rd international conference on natural language processing for digital humanities and 8th international workshop on computational linguistics for uralic languages. Association for Computational Linguistics, Tokyo, pp. 92–101. https://aclanthology.org/2023.nlp4dh-1.11. Accessed 16 Nov 2024

  • Cheek T (2015) The intellectual in modern Chinese history. Cambridge University Press, Cambridge. https://doi.org/10.1017/CBO9781139108874

  • Chen B, Anker TB, Liang X (2025) Business continuity management in the sharing economy: Insights from Airbnb online reviews Tour Manag 107:105067. https://doi.org/10.1016/j.tourman.2024.105067

    Google Scholar 

  • Chen L, Mankad S (2025) A structural topic and sentiment-discourse model for text analysis. Manage Sci 71(7):5767–5787. https://doi.org/10.1287/mnsc.2022.00261

    Google Scholar 

  • Chen Y et al. (2025) Geocoding the past world: Unearthing coordinates of early China from texts using generative AI. Int J Geogr Inf Sci. https://doi.org/10.1080/13658816.2025.2491711

  • Cheng L, Wang F and Zhang W (2007) Zhongguo jindai kaifa xibu de sixiang yu zhengce yanjiu (中国近代开发西部的思想与政策研究). Shanghai Renmin Chubanshe, Shanghai

  • CHGIS (2012) CHGIS V5 shapefiles. Harvard Dataverse. https://doi.org/10.7910/DVN/M7WEFY

  • Chiu S-H et al. (2025) Studying tech adoption with ‘text-as-data’: Opportunities, pitfalls, and complementarities in the case of transportation. Environ Plann B: Urban Anal City Sci 52(8):1796–1813. https://doi.org/10.1177/23998083241311039

    Google Scholar 

  • Chow EHC (2024) An experiment with Gemini Pro LLM for Chinese OCR and metadata extraction. The Digital Orientalist, 5 April. https://digitalorientalist.com/2024/04/05/an-experiment-with-gemini-pro-llm-for-chinese-ocr-and-metadata-extraction. Accessed 21 Oct 2024

  • Chuangkanci (1934) (創刊詞). Kaifa Xibei 1(1):1–3

  • Cordier BD (2016) International aid, frontier securitization, and social engineering: Soviet–Xinjiang development cooperation during the governorate of Sheng Shicai (1933–1944). Cent Asian Aff 3(1):49–76. https://doi.org/10.1163/22142290-00301003

    Google Scholar 

  • Cui N et al. (2025) Using Twitter to understand spatial-temporal changes in urban green space topics based on structural topic modelling. Cities 157: 105601. https://doi.org/10.1016/j.cities.2024.105601

    Google Scholar 

  • Dagongbao (1932) Lun Xibei jianshe (論西北建設), 26 April:2

  • Debnath R et al. (2020) Grounded reality meets machine learning: A deep-narrative analysis framework for energy policy research Energy Res Social Sci 69:101704

    Google Scholar 

  • Dikötter F (2015) The discourse of race in modern China. Fully revised and expanded second edition. Oxford University Press, New York

  • DiMaggio P, Nag M, Blei D (2013) Exploiting affinities between topic modeling and the sociological perspective on culture: Application to newspaper coverage of U.S. government arts funding Poetics 41(6):570–606. https://doi.org/10.1016/j.poetic.2013.08.004

    Google Scholar 

  • Dong Z et al. (eds) (1998) Xibei kaifa shiliao xuanji (1930–1947) (西北开发史料选辑 (1930–1947). Jingji keji chubanshe, Beijing

  • Eberle O et al. (2024) Historical insights at scale: A corpus-wide machine learning analysis of early modern astronomic tables. Sci Adv 10(43):eadj1719. https://doi.org/10.1126/sciadv.adj1719

  • Elliott M (2014) Frontier stories: Periphery as center in Qing history Front Hist China 9(3):336–360. https://doi.org/10.3868/s020-003-014-0025-1

    Google Scholar 

  • Fairbank JK (1968) A preliminary framework. In: JK Fairbank (ed) The Chinese world order: Traditional China’s foreign relations. Harvard University Press, Cambridge, MA, pp. 1–19. https://doi.org/10.4159/harvard.9780674333482.c3

  • Fakanci (1936) (發刊詞). Bianjiang 1(1):1–2

  • Filimonov S (2025) Ingesting millions of PDFs and why Gemini 2.0 changes everything, 15 January. https://www.sergey.fyi/articles/gemini-flash-2. Accessed 2 Apr 2025

  • Fogel RW and Engerman SL (1974) Time on the cross: The economics of American negro slavery. Little Brown, New York

  • Forbes ADW (1986) Warlords and Muslims in Chinese Central Asia: A political history of Republican Sinkiang 1911–1949. Cambridge University Press, Cambridge

  • Gavriș A, Popescu C (2024) Encounters of hesitant politics and an unwavering energy transition. Media reflections in Romania. J Cleaner Prod 478: 143870. https://doi.org/10.1016/j.jclepro.2024.143870

    Google Scholar 

  • Ge Z (2011) Zhai zi Zhongguo: Chongjian youguan ‘Zhongguo’ de lishi lunshu (宅兹中国: 重建有关「中国」的历史论述). Zhonghua shuju, Beijing

  • Gemini Team (2024) Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context. https://doi.org/10.48550/arXiv.2403.05530

  • Gentzkow M, Kelly B, Taddy M (2019) Text as data. J Econ Lit 57(3):535–574. https://doi.org/10.1257/jel.20181020

    Google Scholar 

  • Gilkison A, Kurzynski M (2024) Vectors of violence: Legitimation and distribution of state power in the People’s Liberation Army Daily (Jiefangjun Bao), 1956–1989. J Cult Anal 9(1). https://doi.org/10.22148/001c.115481

  • Greitens SC, Truex R (2020) Repressive experiences among China scholars: New evidence from survey data. China Q 242:349–375. https://doi.org/10.1017/S0305741019000365

  • Grimmer J, Roberts ME, Stewart BM (2021) Machine learning for social science: An agnostic approach. Annu Rev Polit Sci 24:395–419. https://doi.org/10.1146/annurev-polisci-053119-015921

  • Grimmer J, Roberts ME, Stewart BM (2022) Text as data: A new framework for machine learning and the social sciences. Princeton University Press, Princeton

  • Grimmer J, Stewart BM (2013) Text as data: The promise and pitfalls of automatic content analysis methods for political texts Polit Anal 21(3):267–297

    Google Scholar 

  • Grootendorst M (2022) BERTopic: Neural topic modeling with a class-based TF-IDF procedure. https://doi.org/10.48550/arXiv.2203.05794

  • Guldi J (2023) The dangerous art of text mining: A methodology for digital history. Cambridge University Press, Cambridge. https://doi.org/10.1017/9781009263016

  • Ho P (1967) The significance of the Ch’ing period in Chinese history. J Asian Stud 26(2):189–195. https://doi.org/10.2307/2051924

    Google Scholar 

  • Hong Z, Chen Y (2024) Persuading the emperors: A quantitative historical analysis of political rhetoric in traditional China. Humanit Soc Sci Commun 11(1):840. https://doi.org/10.1057/s41599-024-03164-5

  • Hou Y, Huang J (2025) Natural language processing for social science research: A comprehensive review. Chin J Sociol 11(1):121–157. https://doi.org/10.1177/2057150X241306780

  • Hu J (2020) 20 shiji sanshi niandai Xibei kaifa zhongde gaodeng jiaoyu wenti (20世纪三四十年代西北开发中的高等教育问题). Zhongguo Shehuikexue Chubanshe, Beijing

  • Hu S (1985) Xibeixue chuyi (西北学刍议). Xibei minzu daxue xuebao (zhexue shehuikexue ban) (1):26–34, 25

  • Jia X, Hua D (2002) Dagongbao yu 1930 niandai de Xibei kaifa (《大公报》与20世纪30年代西北开发). Xibei gongye daxue xuebao (shehuikexue ban) (2):7–13. https://doi.org/10.3969/j.issn.1009-2447.2002.02.003

  • Jiang J (2001) Weida de Xibei (伟大的西北). Ningxia renmin chubanshe, Yinchuan

  • Lim J, Ito A, Zhang H (2025) Uncovering Xi Jinping’s policy agenda: Text as data approach. Dev Econ 63(1):9–46. https://doi.org/10.1111/deve.12418

  • Lin H (2011) Modern China’s ethnic frontiers: A journey to the west. Routledge, London. https://doi.org/10.4324/9780203844977

  • Lipman JN (1997) Familiar strangers: A history of Muslims in Northwest China. University of Washington Press, Seattle. https://doi.org/10.6069/9780295800554

  • Liu X (2011) Bianjiang Zhongguo he 1949 nian (边疆中国和1949年). In: G Han (ed) Zhongguo dangdaishi yanjiu (san). Jiuzhou chubanshe, Beijing, pp. 117–136

  • Mackinnon SR (1997) Toward a history of the Chinese press in the Republican period. Mod China 23(1):3–32. https://doi.org/10.1177/009770049702300101

    Google Scholar 

  • Mancall M (1968) The Ch’ing tribute system: An interpretive essay. In: Fairbank JK (ed) The Chinese world order: Traditional China’s foreign relations. Harvard University Press, Cambridge, MA, pp 63–89. https://doi.org/10.4159/harvard.9780674333482.c6

  • Mann M (1984) The autonomous power of the state: Its origins, mechanisms and results. Eur J Sociol 25(2):185–213. https://doi.org/10.1017/S0003975600004239

  • Matten MA (2016) Imagining a postnational world: Hegemony and space in modern China. Brill, Leiden. https://doi.org/10.1163/9789004327153

  • Mennig P (2025) Who cares about agriculture? Analyzing German parliamentary debates on agriculture and food with structural topic modeling. Food Policy 130: 102788. https://doi.org/10.1016/j.foodpol.2024.102788

    Google Scholar 

  • Mertha A (ed) (2024) Studying China in the absence of access: Rediscovering a lost art. SAIS China Research Center. https://scgrc.sais.jhu.edu/wp-content/uploads/2024/10/32026_JOHNS_HOPKINS.COVER_SP.pdf. Accessed 11 Dec 2024

  • Miller IM (2013) Rebellion, crime and violence in Qing China, 1722–1911: A topic modeling approach. Poetics 41(6):626–649. https://doi.org/10.1016/j.poetic.2013.06.005

  • Milligan I (2019) History in the age of abundance? How the web is transforming historical research. McGill-Queen’s University Press, Montreal. https://doi.org/10.1515/9780773558212

  • Mistral AI Team (2025) Mistral OCR. Mistral AI. https://mistral.ai/news/mistral-ocr. Accessed 18 Mar 2025

  • Mittler B (2004) A newspaper for China?: Power, identity, and change in Shanghai’s news media, 1872–1912. Harvard University Asia Center, Cambridge, MA. https://doi.org/10.1163/9781684173884

  • Morandell T, Wicki M, Kaufmann D (2025) The planning of urban–rural linkages: An automated content analysis of spatial plans adopted by European intermediate cities. Landscape Urban Plann 255:105258. https://doi.org/10.1016/j.landurbplan.2024.105258

  • Nelson LK et al. (2021) The future of coding: A comparison of hand-coding and three types of computer-assisted text analysis methods. Sociol Methods Res 50(1):202–237. https://doi.org/10.1177/0049124118769114

  • Newby LJ (1999) The Chinese literary conquest of Xinjiang. Mod China 25(4):451–474. https://doi.org/10.1177/009770049902500403

    Google Scholar 

  • Newman DJ, Block S (2006) Probabilistic topic decomposition of an eighteenth-century American newspaper. J Am Soc Inf Sci Technol 57(6):753–767. https://doi.org/10.1002/asi.20342

    Google Scholar 

  • Ni X (1936) Xijing (西京). Zhonghua shuju, Shanghai

  • Nian Y, Lin T (2019) 20 shiji 30 niandai Xibei youji zhong de kongjian jiangou yu zhengzhi rentong (20世纪30年代西北游记中的空间建构与政治认同). Hunan shifan daxue shehuikexue xuebao 48(2):96–101. https://doi.org/10.19503/j.cnki.1000-2529.2019.02.012

    Google Scholar 

  • Northrop K (2022) Open source. The Wire China. https://www.thewirechina.com/2022/01/16/open-source. Accessed 29 Dec 2024

  • Paine SCM (1996) Imperial rivals: China, Russia, and their disputed frontier. ME Sharpe, Armonk

  • Pelzer T (2025) Engineers on the move: Elite geographic mobility in Republican China. Twent-Century China 50(1):25–55. https://doi.org/10.1353/tcc.2025.a950426

  • Piao Y (1932) Xibei kaifa yundong de xin zhankai (西北開發運動的新展開). Chulu xunkan 1(3):6–9

    Google Scholar 

  • Poznanski J et al. (2025) olmOCR: Unlocking trillions of tokens in PDFs with vision language models. https://doi.org/10.48550/arXiv.2502.18443

  • Qinggaozong (ed) (1935) Qingchao tongdian (清朝通典). Shangwu Yinshuguan, Shanghai

  • R Core Team (2025) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. https://www.R-project.org

  • Radford A et al. (2022) Robust speech recognition via large-scale weak supervision. https://doi.org/10.48550/arXiv.2212.04356

  • Ramos-Henriquez JM, Morini-Marrero S (2025) Airbnb customer experience in long-term stays: A structural topic model and ChatGPT-driven analysis of the reviews of remote workers. Int J Contemp Hosp Manag 37(1):161–179. https://doi.org/10.1108/IJCHM-01-2024-0034

  • Roberts ME et al. (2014) Structural topic models for open-ended survey responses. Am J Polit Sci 58(4):1064–1082. https://doi.org/10.1111/ajps.12103

    Google Scholar 

  • Roberts ME, Stewart BM, Airoldi EM (2016) A model of text for experimentation in the social sciences. J Am Stat Assoc 111(515):988–1003. https://doi.org/10.1080/01621459.2016.1141684

    Google Scholar 

  • Roberts ME, Stewart BM, Tingley D (2019) stm: An R package for structural topic models. J Stat Softw 91(2):1–40. https://doi.org/10.18637/jss.v091.i02

  • Rosenzweig R (2003) Scarcity or abundance? Preserving the past in a digital era. Am Hist Rev 108(3):735–762. https://doi.org/10.1086/ahr/108.3.735

    Google Scholar 

  • Şakar S, Tan S (2025) Research topics and trends in gifted education: A structural topic model. Gift Child Q 69(1):68–84. https://doi.org/10.1177/00169862241285041

  • Schmiedel T, Müller O, vom Brocke J (2019) Topic modeling as a strategy of inquiry in organizational research: A tutorial with an application example on organizational culture. Organ Res Methods 22(4):941–968. https://doi.org/10.1177/1094428118773858

  • Scott JC (1998) Seeing like a state: How certain schemes to improve the human condition have failed. Yale University Press, New Haven. https://doi.org/10.12987/9780300128789

  • Shang J, Ding X (2023) Xibei guojie tongdao de kapi yu zhonghuaminzu gongtongti yishi de rongzhu: Yi Xinjiang minzhong zuliu yundong wei zhongxin (西北国际通道的开辟与中华民族共同体意识的熔铸——以抗战时期新疆民众筑路运动为中心). Zhongzhou daxue xuebao 40(3):68–76. https://doi.org/10.13783/j.cnki.cn41-1275/g4.2023.03.011

    Google Scholar 

  • Shen S (2006) Jiangshan ruci duojiao: 1930 niandai de Xibei lüxing shuxie yu guozu xiangxiang (江山如此多嬌——1930年代的西北旅行書寫與國族想像). Taida lishi xuebao (37):145–216. https://doi.org/10.6253/ntuhistory.2006.37.03

  • Shen X (2007) Kangri zhanzheng shiqi guominzhengfu de Xibei kaifa (抗日战争时期国民政府的西北开发). Zhejiang daxue xuebao (renwen shehuikexue ban) 37(5):104–113. https://doi.org/10.3785/j.issn.1008-942X.2007.05.015

    Google Scholar 

  • Sheridan JE (1966) Chinese warlord: The career of Feng Yü-hsiang. Stanford University Press, Stanford

  • Short JC, McKenny AF, Reid SW (2018) More than words? Computer-aided text analysis in organizational behavior and psychology research. Annu Rev Organ Psychol Organ Behav 5:415–435. https://doi.org/10.1146/annurev-orgpsych-032117-104622

    Google Scholar 

  • Sun Y (2021) The international development of China: A project to assist the readjustment of post-bellum industries. Springer, Singapore. https://doi.org/10.1007/978-981-16-0961-9

  • Tai J (2015) The Northwest question: Capitalism in the sands of nationalist China. Twent-Century China 40(3):201–219. https://doi.org/10.1179/1521538515Z.00000000066

  • Tang Y-K et al. (2025) Bridging insight gaps in topic dependency discovery with a knowledge-inspired topic model. Inf Process Manage 62(1):103911. https://doi.org/10.1016/j.ipm.2024.103911

    Google Scholar 

  • Tian S (ed) (2007) Xibei kaifa shi yanjiu (西北开发史研究). Zhongguo shehuikexue chubanshe, Beijing

  • Tighe J (2005) Constructing Suiyuan: The politics of northwestern territory and development in early twentieth-century China. Brill, Leiden. https://doi.org/10.1163/9789047407881

  • Tighe J (2009) From borderland to heartland: The discourse of the North-West in early Republican China. Twent-Century China 35(1):54–74. https://doi.org/10.1179/tcc.2009.35.1.54

  • Todorov K, Colavizza G (2022) An assessment of the impact of OCR noise on language models. https://doi.org/10.48550/arXiv.2202.00470

  • Tonidandel S et al. (2022) Using structural topic modeling to gain insight into challenges faced by leaders. Leadersh Q 33(5):101576. https://doi.org/10.1016/j.leaqua.2021.101576

    Google Scholar 

  • Underwood T (2019) Distant horizons: Digital evidence and literary change. University of Chicago Press, Chicago. https://doi.org/10.7208/chicago/9780226612973.001.0001

  • Underwood T (2025) The impact of language models on the humanities and vice versa. Nat Comput Sci:1–3. https://doi.org/10.1038/s43588-025-00819-4

  • Veg S (2021) Creating public opinion, advancing knowledge, engaging in politics: The local public sphere in Chengdu, 1898–1921. China Q 246:331–353. https://doi.org/10.1017/S0305741021000217

  • Viola L, Verheul J (2020) Mining ethnicity: Discourse-driven topic modelling of immigrant discourses in the USA, 1898–1920. Digit Scholarsh Humanit 35(4):921–943. https://doi.org/10.1093/llc/fqz068

  • Wang R (2010) Nanjing guominzhengfu shangceng renshi yu ‘Xibei kaifa’ (南京国民政府上层人士与‘西北开发’). Xibei. nonglin keji daxue xuebao (shehuikexue ban) 10(3):136–140. https://doi.org/10.13968/j.cnki.1009-9107.2010.03.001

    Google Scholar 

  • Wang R (2015) Weiji xiade zhuanji: Guominzhengfu shiqi de Xibei jingji kaifa yanjiu (危机下的转机: 国民政府时期的西北经济开发研究). Zhongguo shehuikexue chubanshe, Beijing

  • Wang Z (1943) Xibei jianshe lun (西北建設論). Qingnian chubanshe, Chongqing

  • Wencker T, Borst-Graetz J, Niekler A (2025) Text as data for evaluation: Natural language processing and large language models to generate novel insights from unstructured text data. Evaluation 31(3):369–393. https://doi.org/10.1177/13563890251330911

  • Weng J (2023) Stop the presses! Publishing Chinese character simplification, 1935–1936. Harv J Asiat Stud 83(2):333–364. https://doi.org/10.1353/jas.2023.a938222

    Google Scholar 

  • Weston SJ et al. (2023) Selecting the number and labels of topics in topic modeling: A tutorial. Adv Methods Pract Psychol Sci 6(2). https://doi.org/10.1177/25152459231160105

  • Weston TB (2010) China, professional journalism, and liberal internationalism in the era of the First World War. Pac Aff 83(2):327–347. https://doi.org/10.5509/2010832327

    Google Scholar 

  • Wilkerson J, Casas A (2017) Large-scale computerized text analysis in political science: Opportunities and challenges. Annu Rev Polit Sci 20:529–544. https://doi.org/10.1146/annurev-polisci-052615-025542

  • Wu R (2023a) The making of ‘public opinion’: Media and open diplomacy in China’s strategy at Versailles and the May Fourth Movement. Mod Asian Stud 57(4):1355–1386. https://doi.org/10.1017/S0026749X22000609

  • Wu SX (2015) Empires of coal: Fueling China’s entry into the modern world order, 1860–1920. Stanford University Press, Stanford. https://doi.org/10.1515/9780804794732

  • Wu SX (2023b) Birth of the geopolitical age: Global frontiers and the making of modern China. Stanford University Press, Stanford. https://doi.org/10.1515/9781503636859

  • Xia S (2024) Fandom culture as a catalyst for propaganda. China Q 259:814–823. https://doi.org/10.1017/S0305741023001650

    Google Scholar 

  • Xiang H (2018) Rechao, shijian yu kunjing: Kangzhan qian Xibei kaifa de zai shenshi (1928–1937) (热潮、实践与困境: 抗战前西北开发的再审视 (1928–1937). Jindai Zhongguo (2):264–292

  • Yan D and Zhang L (2006) Minguo ‘kaifa Xibei’ zhong yici weijun de yimin jihua: 1942 nian zhi 1944 nian de Xinjiang yimin (民国‘开发西北’中一次未竣的移民计划——1942年至1944年的新疆移民). Minguo dangan (3):105–112

  • Yang H (2013) Kangzhan shiqi Xibei jingji kaifa sixiang yanjiu (抗战时期西北经济开发思想研究). Zhongguo shehuikexue chubanshe, Beijing

  • Ying L, Montgomery JM, Stewart BM (2022) Topics, concepts, and measurement: A crowdsourced procedure for validating topics as measures. Polit Anal 30(4):570–589. https://doi.org/10.1017/pan.2021.33

  • You S (1936) Xibei zhi jiaotong xiankuang jiqi jianshe (西北之交通現况及其建設). Luxiang 3(8):231–236

    Google Scholar 

  • Yu T (1929) Fakanci (發刊詞). Xin Xibei (1):7

  • Zaagsma G (2023) Digital history and the politics of digitization. Digit Scholarsh Humanit 38(2):830–851. https://doi.org/10.1093/llc/fqac050

    Google Scholar 

  • Zeng W (1936) Zhongguo jingying Xiyu shi (中國經營西域史). Shangwu yinshuguan, Shanghai

  • Zhang H (1934) a) Xiyou xiaoji (yi) (西游小記 (一)). Lüxing zazhi 8(9):7–10

  • Zhang L (1989) Jindai guoren de kaifa Xibei guan (近代國人的開發西北觀). Jindaishi yanjiusuo jikan (18):163–188. https://doi.org/10.6353/BIMHAS.198906.0163

  • Zhang R (1934b) Kaifa Xibei shiye jihua (開發西北實業計劃). Zhuzhe shudian, Beiping

  • Zhang X (2021) Xibei kaocha yu guozu xiangxiang (西北考察与国族想象). Dissertation, Nanjing Daxue. https://doi.org/10.27235/d.cnki.gnjiu.2021.001356

  • Zhang Y (2002) Kangzhan qianshinian guominzhengfu kaifa Xibei de zhengce quxiang (抗战前十年国民政府开发西北的政策取向). Sichuan daxue xuebao (zhexue shehuikexue ban) (5):121–128. https://doi.org/10.3969/j.issn.1006-0766.2002.05.019

  • Zhao J (2008) Fenjie yu chonggou: Qingji minchu de baojie tuanti (分解与重构: 清季民初的报界团体). Shenghuo, dushu, xinzhi sanlian shudian, Beijing

  • Zhao S (2004) A nation-state by construction: Dynamics of modern Chinese nationalism. Stanford University Press, Stanford. https://doi.org/10.1515/9781503624498

  • Zhao X (2025) Running a mainstream revolutionary newspaper: Guangdong Qunbao and socialist propaganda in 1920s South China. Labor Hist 66(3):417–429. https://doi.org/10.1080/0023656X.2024.2383968

    Google Scholar 

  • Zhong Y (2019) Chinese grammatology: Script revolution and literary modernity, 1916–1958. Columbia University Press, New York. https://doi.org/10.7312/zhon19262

  • Zhou Y (2006) Historicizing online politics: Telegraphy, the internet, and political participation in China. Stanford University Press, Stanford. https://doi.org/10.1515/9780804767583

  • Zong Y (2003) 20 shiji 30 niandai baokan meijie yu Xibei kaifa (20世纪30年代报刊媒介与西北开发). Shixue yuekan (5):54–58. https://doi.org/10.3969/j.issn.0583-0214.2003.05.008

Download references

Author information

Authors and Affiliations

  1. School of Public Affairs, Zhejiang University, Hangzhou, China

    Tao Ren

  2. Faculty of Arts and Sciences, Harvard University, Cambridge, MA, USA

    Tao Ren

Authors
  1. Tao Ren
    View author publications

    Search author on:PubMed Google Scholar

Contributions

TR conceived and designed the study, collected and processed the data, performed the analysis, and wrote and revised the manuscript.

Corresponding author

Correspondence to Tao Ren.

Ethics declarations

Competing interests

The author declare no competing interests.

Ethical approval

This article does not contain any studies with human participants performed by any of the authors.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Informed consent

This article does not contain any studies with human participants performed by any of the authors.

Supplementary information

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ren, T. Transforming the Northwest frontier: development discourse in Republican China through computational analysis of the historical press. Humanit Soc Sci Commun (2026). https://doi.org/10.1057/s41599-026-06682-6

Download citation

  • Received: 27 May 2025

  • Accepted: 02 February 2026

  • Published: 13 February 2026

  • DOI: https://doi.org/10.1057/s41599-026-06682-6

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • Reviews & Analysis
  • News & Comment
  • Collections
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • Journal Information
  • Referee instructions
  • Editor instructions
  • Journal policies
  • Open Access Fees and Funding
  • Calls for Papers
  • Events
  • Contact

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Humanities and Social Sciences Communications (Humanit Soc Sci Commun)

ISSN 2662-9992 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited