Towards a theory of sustainable city sizes

Kamruzzaman, Liton Md

doi:10.1038/s42949-025-00254-4

Download PDF

Article
Open access
Published: 13 August 2025

Towards a theory of sustainable city sizes

Liton Md Kamruzzaman ORCID: orcid.org/0000-0001-7113-942X¹

npj Urban Sustainability volume 5, Article number: 66 (2025) Cite this article

5032 Accesses
13 Altmetric
Metrics details

Subjects

Abstract

This study proposes a new theory of sustainable city size: the condition where a city’s actual population aligns with its theoretical capacity to function effectively. Drawing from four foundational theories (locational fundamentals, increasing returns, central place, and central flow), the study develops a conceptual framework to estimate theoretical city size using data from 655 Urban Centres and Localities (UCLs) in Australia. It examines how deviations of actual size from this theoretical size influence sustainability outcomes, using rent, walking-to-work rates, and multi-vehicle household share as proxies for economic, environmental, and socio-environmental sustainability. Results show that UCLs within ±4% of their theoretical size achieve optimal outcomes across all indicators. Nationally, achieving equilibrium could save $5.3 billion in annual rent, generate 44,000 additional daily walking trips, and reduce multi-vehicle dependency in 275,000 households. These findings support the proposed theory and offer a practical tool to align cities with their systemic capacity.

Exploring the interplay between population profile and optimal routes in U.S. cities

Article Open access 08 May 2025

The three-dimensional structure of population density in world cities

Article Open access 26 September 2025

Unravelling the spatial directionality of urban mobility

Article Open access 27 May 2024

Introduction

Hundreds of new cities are built globally each decade^1,2,3,4, yet few have population size targets grounded in systematic theory. Most rely instead on political aspirations or speculative investment, raising critical planning questions: (a) How can city sizes be determined before they are built or exceed their limits? and (b) Would systematically predetermined sizes enhance urban sustainability? Determining city size is foundational to urban planning⁵, shaping land use, infrastructure, and service provision⁶. Without a priori estimates, urban planning becomes reactive and fragmented. Yet, the literature offers little guidance on how to determine the size of new cities. Existing approaches focus on current size distributions (e.g., rank-size rule) or project growth based on past trends⁷, offering little strategic guidance or sensitivity to planning interventions. As a result, many cities face unsustainable growth or population decline².

The case for a priori city size estimates is further complicated by conflicting evidence on the relationship between city size and sustainability. While larger cities are often seen as growth-inducing⁸, others find that productivity is independent of size⁹, or highly context-dependent^10,11. Large cities also face challenges such as crime¹², emissions¹³, heat islands¹⁴, unaffordable housing¹⁵, social segregation¹⁶, and declining civic engagement¹⁷—yet remain attractive for their amenities¹⁸ and innovation¹⁹. Some studies suggest medium-sized cities perform better on life satisfaction²⁰, and environmental sustainability²¹. Although commuting distances typically increase with city size²², the effect is often small^23,24, or even absent^25,26. This study proposes that such inconsistencies stem from a reliance on actual population size, without considering a city’s theoretical or systemic capacity. This reflects the absence of a robust framework for estimating theoretical city size—that is, the population a city can sustainably support given its unique opportunities and constraints. Importantly, theoretical city size is not fixed but can be altered through planning interventions. The study further hypothesises that cities perform better on sustainability outcomes when their actual sizes approximate their theoretical sizes, a condition referred to here as sustainable city size.

To support this hypothesis, it is essential to revisit the theoretical foundations that explain how city sizes form and evolve. A large body of literature offers insights into the drivers of urban growth, structure, and interaction, though often in fragmented strands^27,28. Early approaches aimed to identify a universal optimum²⁹, but this notion was dismissed as overly simplistic. Zipf’s law, a widely cited empirical regularity, proposes that city size follows a rank-size distribution^30,31; yet it lacks predictive capacity^28,32, especially for forward-looking planning. Cities differ in their specificity (e.g., topographic or climatic conditions) and unicity (e.g., capital city status)³³, resulting in varying theoretical sizes³⁴ that can change over time through interventions such as infrastructure investment, zoning reforms, or institutional change.

Three dominant theories explain how individual cities grow: random growth, locational fundamentals, and increasing returns to scale^35,36. Random growth theory posits that cities grow randomly, eventually conforming to Zipf’s law³⁷. Locational fundamentals theory suggests that geographic, physical, and climatic attributes (e.g., elevation, port access, and rainfall) are randomly distributed and shape a city’s potential^{38,39,40,41,42,43}. Although traditionally seen as fixed, recent studies suggest that these fundamentals can shift in response to technological or social change⁴⁴. Increasing returns to scale theory highlights how economic agglomeration drives growth through specialisation, diversity, and productivity^8,45. These effects are dynamic and policy-sensitive: labour productivity, connectivity, and spatial clustering can be shaped by zoning reform, regional investment, or new transport infrastructure^44,46,47. However, these theories focus on individual cities and overlook their interactions within broader urban systems. Central place theory explains vertical hierarchies based on functional differentiation and service range (Fig. 1a)⁴⁸. Central flow theory, by contrast, highlights horizontal flows (e.g., labour, capital, freight, information), emphasising inter-city connectivity (Fig. 1b)^49,50,51,52. Concepts such as ‘borrowed size’ and ‘borrowed function’ illustrate how cities can benefit from their neighbours—e.g., smaller cities borrowing function, and larger cities borrowing population^27,53,54,55.

**Fig. 1: Schematic representation of the hierarchical distribution of cities.**

This study integrates four theoretical traditions (locational fundamental, increasing returns, central place, and central flow) into a unified framework for estimating theoretical city size (Fig. 2). These theories have typically been applied in isolation due to perceived contradictions or assumptions that each applies to a different stage of urbanization^36,38. For example, Meijers⁵⁶, p.246 states that central flow theory “is essentially opposite to the central place model”, emphasising horizontal rather than vertical hierarchies. In contrast, this study considers the theories as complementary and jointly explanatory of city size and growth. The framework identifies theory-specific factors and shows how policy interventions can influence them. It also recognises feedback loops: for example, population growth can enhance accessibility and specialisation, which in turn attract further growth^57,58, highlighting the co-evolutionary nature of city size. This framework informs a method to assess whether cities operate above, below, or near their theoretical size. Cities near this threshold are defined as having sustainable city size. The paper tests whether such cities perform better across three sustainability outcomes—economic (median rent), environmental (% of commuters walk to work), and socio-environmental (share of households with 2+ vehicles)—using data from 655 Urban Centres and Localities (UCLs) in Australia (Fig. 3). Two population thresholds are examined: (a) UCLs with ≥4000 residents (n = 205) and those with ≥10,000 residents (n = 100), with and without inclusion of the seven mainland capital cities. To ensure robustness, the analysis addresses potential reverse causality between city size and its determinants using instrumental variables.

**Fig. 2: Conceptual framework for theorising sustainable city size.**

**Fig. 3: Location and population distribution of Urban Centres and Localities (UCLs) across Australia.**

Results

By testing the research hypotheses, the study addresses the following three research questions: (1) How can the theoretical size of cities be robustly determined? (2) How does a city’s actual size affect its ability to achieve its theoretical size? and 3) How does the alignment (or misalignment) between actual and theoretical size influence sustainability outcomes?

Table 1 presents descriptive statistics for the variables used to estimate theoretical size of Australian cities. On average, each of the 655 UCLs had approximately 25 K people in 2016. As expected, the average population size increases to 78 K and 152 K when the analytical samples are reduced to UCLs with ≥4 K and ≥10 K populations, respectively. However, this reduction in sample size increases variability, suggesting that the excluded UCLs have more homogenous (albeit smaller in size) populations.

Table 1 Descriptive summary of the variables

Full size table

The increase in population density with reduced sample sizes indicates that the excluded UCLs are more rural, occupying larger lot sizes. Similarly, job accessibility levels increased as the sample was restricted, suggesting that accessible employment opportunities are more concentrated in larger settlements. Average driving time to the nearest train station was lower for the ≥4 K sample (40 min) compared to the full sample (56 min) and the ≥10 K sample (46 min), implying that many mid-sized UCLs (4 K–10 K) had a train station nearby. On average, each UCL had 57 K population in nearby settlements (borrowed size), a figure that decreased slightly (to 46–50 K) as the sample was restricted. This suggests that larger UCLs had lower borrowed sizes due to their greater internal population mass. However, the slightly higher borrowed size for the ≥10 K UCLs compared to the ≥4 K UCLs suggests that mid-sized UCLs (≥4 K and <10 K) are more isolated. Borrowed function indicators showed similar patterns. A gradual decrease in per capital availability of amenities (per 10 K population) with reduced sample sizes indicates that smaller settlements enjoyed greater access to amenities relative to their size. Little variation was observed in travel time to the nearest capital city or across climatic variables (rainfall, sunshine, temperature), indicating that excluded and included UCLs are similarly distributed in geographic terms.

Larger UCLs had a higher proportion of specialist jobs compared to smaller UCLs. However, occupational diversity levels were similar across settlement groups, suggesting that smaller UCLs trade off specialist roles with a broader base of non-specialist employment. Little variation was found in closeness centrality across the groups, while betweenness centrality scores were notably higher for larger UCLs, indicating that they more frequently fall on the shortest paths connecting other cities. However, higher standard deviations for the =>10 K group imply greater internal variation in network dominance. Only 7 UCLs are designated as state or territory capitals (Sydney, Melbourne, Brisbane, Adelaide, Perth, Canberra and Darwin) (Fig. 3), and all are included in the ≥10 K sample group.

Smaller UCLs performed better on two of the three sustainability outcomes examined in this study (economic and environmental), but exhibited the opposite pattern for the socio-environmental sustainability outcome. As shown in Table 2, smaller settlements had lower median rents and a higher share of people walking to work, indicating stronger economic and environmental sustainability. However, they also exhibited a slightly higher proportion of households owning two or more vehicles, suggesting reduced socio-environmental sustainability due to increased reliance on private transport.

Table 2 Multiple linear regression model results showing the factors associated with optimum city sizes

Full size table

Overall, the descriptive trends observed across different indicators align with the literature as reported in the Introduction and are commensurate with the general understanding of the topic.

Determinants of theoretical city sizes

Table 2 presents results from four multiple linear regressions models estimating how theoretically derived factors explain city sizes in Australia (Model 1: ≥10 K model with capital cities; Model 2: ≥10 K model without capital cities; Model 3: ≥4 K model with capital cities; and Model 4: ≥4 K model without capital cities).

Among the 15 explanatory factors considered in this study (Table 1), Model 1 shows that only five of them had a statistically significant association with theoretical size of cities (capital city status, job accessibility, high-level urban function, urban diversity, and betweenness centrality). These five factors explained 72% variations in the population size among the 100 UCLs considered for this sample. The signs of the coefficients were also found to be intuitive. Model 1 shows that the expected percent increase in the geometric mean of population sizes from non-capital cities to capital cities is about 1155% (((e^2.53 -1)*100) ≈ (12.55-1)*100 ≈ 1155). Or, simply, it can be said that population sizes were nearly 12 times higher for the UCLs designated as capital cities than the other UCLs considered in this group. This finding indicates the strong influence of the locational fundamentals theory in determining city sizes in Australia. It is also to be noted here that this variable was strongly correlated with the travel time to the capital city variable, suggesting the strong influence of the central place theory as well. The influence of central place dynamics is also reflected by the high-level urban function variable in Model 1. It shows that a one percent increase in the high-level urban function increases the theoretical size of cities by 1.5%.

The increasing returns to scale theory is represented by the job accessibility indicator in Table 2, the effect of which is also verified in this study. Model 1 shows that for a one percent increase in job accessibility level, theoretical sizes increased by 0.43%. The effect of the increasing returns to scale theory is further reinforced by the evidence that a one percent increase in urban diversity level, theoretical sizes increased by 27%.

In contrast to the theoretical underpinning of the central flow theory that cities receiving an increasing flow will have a larger theoretical size, the betweenness centrality factor in Model 1 shows that the UCLs that frequently fell on the shortest paths between other UCL had a lower theoretical population size. To be specific, the theoretical size of a UCL decreased by 0.04% with a one percent increase in their betweenness centrality. This finding is not unexpected in the Australian context, where major cities are historically located on the coastline (Fig. 3), reducing their likelihood of falling frequently along the shortest paths between other settlements.

Model 2 broadly corroborates the results of Model 1, suggesting that the underlying drivers of theoretical city sizes are similar for both capital and non-capital cities. Since Model 2 represents non-capital cities exclusively, the capital city variable logically does not appear in this model.

The modelling results from the ≥4 K sample group (Model 3) follow the general trend observed in the ≥10 K group (Model 1), with the exception of four significant factors identified in Model 3 instead of five factors in Model 1. While the betweenness centrality factor was statistically significant in Model 1, this factor became insignificant in Model 3. However, the remaining four significant factors in Model 3 were also appeared in Model 1, and the signs of the coefficients remained consistent, suggesting the stability of these factors across different sample groups. The magnitude of the coefficients was slightly modified, which is expected due to the inclusion of relatively smaller-sized UCLs in Model 3. For example, Model 3 indicates that capital cities theoretically contained 25 times more people than other UCLs in this group. Overall, Model 3 explains 60% of the variation in city size. Additionally, Model 4 in Table 2 supports the conclusions drawn from Model 3, indicating that the determinants of theoretical city sizes are consistent across both capital and non-capital cities.

Table 3 presents findings generated from a two-stage least squares (2SLS) model applying instrument variable approaches to isolate the effects of reverse causality between city size and its determinants. Models 5 and 8 replicate Models 1 and 3, respectively; and are included in Table 3 to facilitate comparisons. Four 2SLS models were estimated. In Models 6 and 7, job accessibility and urban diversity are respectively instrumented based on the specification of Model 5 for the ≥10 K group, while Models 9 and 10 represent the same for the ≥4 K group. Identifying suitable instruments is widely recognised as a methodological challenge⁵⁹. Nevertheless, the strength test results of the applied instruments in this study (spatial lag of job accessibility and spatial lag of urban diversity) indicate that they were strong, as reflected by their partial R² values. The overall explanatory powers of the 2SLS models (Models 6, 7, 9, and 10) remained similar to their respective structural models (Models 5 and 8). Despite the strength of the instruments, both the Wu-Hausman and Durbin test results show that they are not statistically significant in Models 6 and 7 for the ≥10 K group, and in Model 10 for the ≥4 K group. This means that the urban diversity variable was not subjected to endogeneity bias for any of the sample groups whereas the presence of endogeneity is confirmed for the job accessibility variable for the ≥4 K group. Consequently, the linear regression coefficient for job accessibility in Model 8 is likely biased, and Model 9 provides the preferred, unbiased estimate for ≥4 K group. Irrespective of the presence/absence of endogeneity, the directions of the coefficients are identical, and the magnitudes of the coefficients are in an acceptable level of agreement between the models. As expected, the result from the 2SLS model (Model 9) shows a relatively weaker coefficient estimate for the job accessibility variable.

Table 3 Instrumental variable estimates

Full size table

These results establish a robust basis for estimating theoretical city sizes, enabling the subsequent comparison between cities’ actual and theoretical population sizes.

Actual vs. theoretical city sizes in Australia

This section presents results based on Model 1, as key findings remain consistent across alternative model specifications (Models 1–10). Figure 4 plots the theoretical population sizes (log-transformed) as estimated in Model 1 against the actual population sizes (log-transformed) for the 100 UCLs in the ≥10 K group, with selected UCLs labelled for contextual illustration.

**Fig. 4: Theoretical vs. actual population sizes of cities with ≥ 10,000 residents.**

It is evident that several UCLs, such as Perth, and Canberra, operated near equilibrium conditions in 2016, lying close to the diagonal line. Their actual population sizes closely matched their theoretical sizes. Importantly, actual population size itself was not a determinant of whether a UCL operated in equilibrium. For example, Perth had a large actual population (14.23 in log scale), while Port Pirie had a much smaller population (9.34 in log scale), yet both aligned closely with their estimated theoretical sizes.

Conversely, many cities exhibited stress—where actual population exceeded theoretical capacity—as indicated by points located above the diagonal (e.g., Melbourne, Gold Coast, Murray Bridge). Despite large differences in actual population, both Melbourne and Murray Bridge showed signs of stress, illustrating that stress was not simply a function of absolute population size.

Cities falling below the diagonal line represent cases where the actual population was lower than the theoretical size. These cities could be deemed unproductive because they have the available infrastructure and services to accommodate more people, yet experience population shortfalls. As with stress, underpopulation was not driven by size alone. For example, both Darwin (11.46 in log scale) and Lara (9.25 in log scale) had actual populations below their theoretical expectations, despite Darwin being substantially larger.

Having established how actual and theoretical city sizes align or diverge, the next section examines how these deviations relate to cities’ sustainability outcomes.

Actual vs. theoretical sizes and sustainability outcomes

A key hypothesis of this study is that cities perform more sustainably when their actual population sizes align closely with their theoretical sizes. This condition is referred to as having a ‘sustainable city size’. Figure 5 summarises findings generated from piecewise linear regression models showing how the deviations between theoretical and actual population sizes are linked to the three sustainability outcomes examined in this study (weekly median rent, % of households with two or more vehicles, and % of people walking to work) across the two groups of UCLs (≥4 K and ≥10 K). Further statistical details generated from these models are shown in Table 4. These models were estimated separately with and without the two covariates considered (population density and median household income), yielding 12 models in total (Models 11–22). Models without and with the covariates are referred to as the basic and expanded models, respectively. For theoretical size estimation, Model 1 was used for the ≥10 K group and Model 9 for the ≥4 K group.

**Fig. 5: Piecewise regression plots showing relationship between deviations from theoretical city size and sustainability outcomes.**

Table 4 Piecewise regression analysis results

Full size table

In Fig. 5, the x-axis shows the percentage deviation of each UCL from its theoretical population size (on a log scale), while the y-axis presents the real values of the three sustainability outcome variables. A zero value on the x-axis represents perfect population equilibrium; positive values indicate actual population exceeding theoretical size, and negative values indicate population shortfall. Vertical lines in Fig. 5 indicate breakpoints – i.e., the optimal point for each outcome, estimated using Stata’s nl (nonlinear regression) command.

Table 4 confirms that all basic models (without covariates) achieve statistical significance at the 95% confidence level, suggesting that deviations in city size meaningfully predict sustainability outcomes regardless of sample groups. Including covariates substantially increased explanatory power (expanded models), suggesting that sustainability outcomes are also influenced by other factors. Nevertheless, the primary intent here is to test the robustness of the deviation effect. Table 4 demonstrates that the sign and magnitude of the deviation coefficients remain stable across basic and expanded models, confirming that city size deviation is a robust predictor of sustainability. Figure 5 is based on the basic models, offering a consistent and interpretable visual representation of the core relationships without the influence of covariates.

In Fig. 5a, UCLs with actual populations below their theoretical size experienced higher rental burdens. Prior to the breakpoint, a 1% reduction in the deviation (i.e., actual size approaching theoretical) was associated with $9 (β₁) decrease in the rent, reaching the lowest predicted rent of $244 (as indicated by the model constant) when the deviation was approximately -3% (α). The figure is $30 lower than the overall average rent ($274) for this group, translating to annual savings of $1560 per household. After the breakpoint, rent increased by $3 (β₂) for every 1% increase in actual population compared to theoretical size. A similar pattern is evident for the ≥10 K group (Fig. 5b), although the breakpoint shifts to 1.12% with different rates of decline and increase (β₁ = −4.29; β₂ = 6.90). The minimum rent at the breakpoint is $256, which is $27 less than the group’s average of $283, implying annual savings of $1404 per household.

Figure 5c, d show that equilibrium city sizes are associated with lower household vehicle ownership. In the ≥4 K sample, two-plus vehicle households decline to 50% at the breakpoint (α = −1.20), then increased thereafter (Model 15). For the ≥10 K sample (Fig. 5d and Model 17), two-plus vehicle ownership declined significantly up to α = −4.21, after which the pattern flattened. Across both groups, equilibrium cities had about 2.5% fewer households with multiple vehicles (50% vs. 52.5%).

Sustainable travel outcomes are shown in Fig. 5e, f. In the ≥10 K group (Fig. 5f), walking rates rose by 0.06% for every 1% shift towards equilibrium, peaking at 4% when the deviation reached +3%. Beyond this point, walking declined by 0.14% per 1% increase in actual population size. On average, 4% of commuters walked to work in UCLs with equilibrium sizes, higher than the group mean of 3.52%. Similar trends are observed for the ≥4 K group.

These findings consistently support the theoretical proposition that deviations from sustainable city size, in either direction, are associated with measurable reductions in sustainability performance, reinforcing the value of aligning actual city sizes with their theoretical capacities.

Discussion

The optimum city size has long been debated in the literature, yet findings linking city size to sustainability outcomes remain inconsistent. This study posits that such inconsistency arises from relying on actual population measures, rather than assessing cities against their theoretical capacity to function within an urban system. To test this, a unified conceptual framework was developed by integrating four foundational theories—location fundamentals, increasing returns to scale, central place, and central flow. The findings demonstrate the utility of this integrated approach for estimating theoretical city size and confirm that these theories collectively explain variation across urban contexts, challenging the view that they are only applicable in specific spatial or temporal conditions^36,38,56.

This theoretical contribution is supported by empirical evidence. The study found that the determinants of theoretical city size—particularly capital city status, job accessibility, urban diversity, and high-level urban functions—are consistent across different sample groups (≥10 K and ≥4 K UCLs) and remain stable when capital cities are excluded. While betweenness centrality exhibited a negative association with city size among larger UCLs, this finding aligns with Australia’s unique coastal urban morphology. Instrumental variable analyses further validated the model, identifying endogeneity in the job accessibility measure for the ≥4 K group but confirming the stability of the core results. Together, these findings provide strong empirical support for factors determining theoretical city size and set the stage for testing the main hypothesis: that cities perform better when their actual populations closely align with their theoretical size—a condition defined in this study as sustainable city size. This hypothesis is confirmed by piecewise regression models, which show that cities operating near their theoretical sizes—within a ± 4% deviation—consistently achieve better outcomes in rental affordability, multi-vehicle ownership, and active commuting. Based on the findings, three key conclusions are drawn: (1) Theoretical city size can be determined a prior; (2) Actual city size does not determine whether it operates near theoretical size; and (3) A city’s sustainability hinges on its equilibrium size, not its actual size.

Urban planning typically relies on projected population growth^5,7. For existing cities, this is relatively straightforward to derive due to known demographic trends. However, determining the population for new cities is more complex, particularly given their diversity in type and function—ranging from capital cities to smart cities, eco-cities, and airport hubs^1,60,61. Examples include Canberra, Brasilia, Putrajaya, Islamabad, Abu Dhabi, Dholera, Songdo, Shenzhen, Kwinana, Yanbu, Waterfall City, and Lanseria Airport City^{4,61,62,63,64,65,66}. Many planned cities, such as Canberra, Yanbu, Putrajaya have used arbitrary or politically motivated population targets, lacking theoretical justification^63,67,68. This study addresses that gap by demonstrating that theoretical population size can be derived from known or assignable characteristics—such as capital city status, planned job accessibility, urban function diversity, and network centrality. These factors are either designed explicitly by planners (e.g., land use zoning, transit infrastructure) or known in advance (e.g., whether the city is a capital). The framework is equally useful for recalibrating the size of existing cities where traditional trend-based projections fall short. Such projects often fail to account for inter-city relationships or the impact of new infrastructure on regional connectivity⁷. By incorporating central place and central flow dynamics, the model captures how investments or spatial shifts influence population potential—something absent in conventional methods.

The findings presented in this study confirm that cities of any actual size can operate at or near their theoretical size. For example, both Perth (a large capital city) and Port Pirie (a small regional centre) aligned closely with their theoretical size in 2016. This supports the study’s hypothesis: that each city has a unique theoretical size, shaped by a specific combination of factors—capital status, urban function diversity, job accessibility, professional job share, and network centrality. These determinants are dynamic and can be modified through targeted planning interventions, suggesting that a city’s theoretical size is not static. For example, cities operating above their theoretical size (e.g., Melbourne or the Gold Coast) could increase job accessibility or functional diversity through new transport links or decentralised employment strategies, helping align actual and theoretical sizes. Conversely, underperforming cities like Darwin or Lara already have the capacity to absorb growth without further investment—offering latent potential for spatially distributed growth. These findings suggest a new way of planning for growth—not by reinforcing expansion in already large cities, but by identifying and leveraging untapped capacity in smaller, well-positioned settlements. Unlike traditional population distribution models, this approach embeds strategy, infrastructure alignment, and systemic capacity into city sizing decisions⁶⁹.

Findings presented in Fig. 5 show that cities whose actual sizes closely align with their theoretical sizes—estimated here as being within ±4%—consistently perform better across economic, socio-environmental, and environmental sustainability metrics. Economically, households in equilibrium cities save approximately $1560 per year in rent, equating to $5.3 billion in national savings across Australia’s rental population (30.6% of 11 million households in Australia lived in rental properties in 2021). Socio-environmentally, equilibrium cities have 2.5% fewer households owning two or more vehicles, reducing car-dependence and enhancing social inclusion—equivalent to 275,000 households shifting to fewer vehicles in Australia. Environmentally, these cities support 0.5% more walking trips to work, generating 44,160 additional walking trips daily among the Australian commuters (there were 9.2 million commuters in Australia in 2021). These optimal outcomes occur not at extremes of scale, but at the point where actual and theoretical sizes are aligned—validating the sustainable city size hypothesis. The findings provide an empirically grounded target range (approximately −4.2% to +3.2%) that can serve as a benchmark for planning sustainable cities.

A range of actionable insights are drawn from the conceptual framework (Fig. 2) and empirical findings. First, for strategic planning in new cities: the ability to estimate theoretical size using a priori indicators enables more informed decisions on land use, infrastructure investment, and zoning provisions from the outset of city development. Second, for targeting growth in existing cities: underutilised cities (e.g., Lara, Darwin) can be identified as strategic growth nodes, while over-capacity cities (e.g., Melbourne, Gold Coast) may require interventions—such as high-speed rail, employment decentralisation, or land use reforms— to expand their theoretical size and reduce pressure on infrastructure and housing. Third, as an empirical planning benchmark: the ±4% deviation threshold around theoretical size provides an empirically grounded benchmark for assessing whether a city is operating within a sustainable population range. Fourth, for policy-responsive growth modelling: the framework highlights that theoretical size is shaped by modifiable factors such as connectivity, diversity, and accessibility. This enables policy levers to proactively shift a city’s capacity to accommodate growth sustainably. Finally, for scenario planning and investment alignment: by embedding the framework into spatial planning or land use-transport models, agencies can assess how various interventions (e.g., transit upgrades, zoning changes) alter theoretical capacity and associated sustainability outcomes.

While the study presents a novel and empirically supported framework, several limitations and future directions remain. First, city boundary definitions: UCLs provides national consistency but may not fully capture functional urban areas such as labour markets. Future studies could test alternative spatial definitions to assess boundary sensitivity. Second, sustainability indicators: the study relies on three proxies—rent, multi-vehicle ownership, and walking to work—which reflect data availability rather than a comprehensive sustainability framework. Future research should consider broader indicators such as emissions, equity, or health. This is why the study frames it as a theory rather than a principle: it is strongly supported but requires further evidence for generalisation. Third, generalisability: the model is developed in an Australian context, and its relevance to other countries or metropolitan regions remains to be tested. Comparative studies could strengthen its applicability. Fourth, governance and implementation: institutional, political, or financial constraints may limit efforts to achieve theoretical city size. Qualitative studies could explore these barriers. Fifth, integration into dynamic models: applying the framework in Land Use–Transport Interaction (LUTI) or simulation models would support long-term, scenario-based planning. Finally, measurement of size deviation: the use of log-residuals converted to percentage terms assumes small linear deviations. While bias is likely minimal here, future studies could test alternative formulations or directly model residuals to assess robustness across broader size distributions.

Methods

This study adopts a three-step empirical strategy building on the conceptual framework presented in Fig. 2: (a) Estimating the theoretical size of each city; (b) Measuring the level of deviation between actual and theoretical population sizes; and (c) Examining the relationship between these deviations and cities’ sustainability outcomes.

Study context and city boundary definition

Defining the boundary of a city is a long-standing methodological challenge in urban research⁷⁰. Common approaches include administrative boundaries (e.g., municipalities), morphological definitions based on contiguous built-up areas (e.g., density), functional urban areas delineated by labour market integration or commuting flows, and thresholding based on night-time lights^71,72,73. Each approach captures different aspects of urbanity—governance, physical form, or socioeconomic function—and involves trade-offs^74,75. The study was conducted in Australia and applies the country’s statistical definition of urban areas—Urban Centres and Localities (UCLs)—developed by the Australian Bureau of Statistics (ABS)⁷⁶. Only UCLs classified as urban centres are considered, excluding localities. An urban centre is defined as a cluster of contiguous SA1s (Statistical Area Level 1) where each SA1 has a population density of ≥100 persons/km² and a dwelling density of ≥50 dwellings/km², with an aggregated population of at least 1000 people. This approach offers national consistency and comparability across Australia’s urban settlement system. While UCLs may not fully capture functional linkages (e.g., commuting across city edges), they provide a pragmatic spatial unit for testing city size theories. Moreover, the key explanatory variables used in this study—such as amenities, density, and job accessibility—are closely aligned with the spatial extent of UCLs, reinforcing their suitability for the conceptual and empirical framework.

This study used data from the 2016 Census, as the 2021 Census was conducted during COVID-19 lockdowns (in some states and territories) and may not reflect typical behavioural outcomes. In 2016, 1853 UCLs were identified. UCLs located within Mainland Australia and meeting the ABS urban definition (population ≥1000) were initially selected, resulting in a study sample of 655 UCLs (Fig. 3). Subsequently, the hypotheses were tested across two population groups to verify the robustness of findings: (a) UCLs with populations of ≥4000 (n = 205), enabling analysis across a diverse range of urban centres; and (b) UCLs with populations of ≥10,000 (n = 100), focusing on relatively larger cities. Each group was further subdivided by excluding the seven UCLs designated as state or territory capitals in Mainland Australia to explore whether the determinants of theoretical city size operate differently between capital and non-capital cities. These capital cities also attract the largest share of international migrants in Australia⁷⁷, although recent trends suggest this dominance is diminishing⁵⁸.

Mathematical formulation of theoretical city size

The choice of a place to live may be assumed to be driven by utility maximization, which occurs when the benefits of living equal or exceed the costs of living. These benefits and costs can be represented as functions of locational and network advantages and disadvantages, as conceptualised in Fig. 2. However, city size itself can act as both a benefit (e.g., greater amenities) and as a cost (e.g., longer commuting distance), as discussed in the Introduction.

Using job accessibility and distance to a train station as example factors representing benefits (B) and costs (C) respectively, the functional forms can be expressed as:

$$B=f({size},{job\; accessibility})$$

(1)

$$C=f({size},{distance\; to\; train\; station})$$

(2)

Assuming a standard Cobb-Douglas specification, these functions can be further specified as:

$$B={{size}}^{\alpha }\,\times \,{{job\; accessibility}}^{\beta }$$

(3)

$$C={{size}}^{\gamma }{\times {distance\; to\; train\; station}}^{\delta }$$

(4)

In an equilibrium state, the marginal cost equals the marginal benefit, that is:

$$\frac{\delta C}{\delta ({size})}=\frac{\delta B}{\delta ({size})}$$

(5)

Expanding the derivatives:

$$\Longrightarrow \frac{\delta ({{size}}^{\gamma }{{distance\; to\; train\; station}}^{\delta })}{\delta ({size})}=\frac{\delta ({{size}}^{\alpha }\,{{job\; accessibility}}^{\beta })}{\delta ({size})}$$

(6)

$$\Longrightarrow {\gamma {size}}^{\gamma -1}{{distance\; to\; train\; station}}^{\delta }=\alpha {{size}}^{\alpha -1}\,{{job\; accessibility}}^{\beta }\,$$

(7)

Rearranging:

$$\Longrightarrow {{size}}^{\gamma -\alpha }=\,\frac{\alpha }{\gamma }\,\frac{{{job\; accessibility}}^{\beta }}{{{distance\; to\; train\; station}}^{\delta }}$$

(8)

Equation 8 can be log-linearized to obtain an estimable function:

$$(\gamma-\alpha){\text{ln}}(size)=\,{\text{ln}}\left(\frac{\alpha}{\gamma}\right)+\beta \,{\text{ln}}(Job\,accessibility)-\delta \,{\text{ln}}(distance\,to\,train\,station)$$

(9)

Generalising using Y and X as vectors of benefits and costs respectively:

$${\text{ln}}(size)=a+\sum _{k}{b}_{k}{\text{ln}}({Y}_{k})-\sum _{j}{c}_{l}{\text{ln}}({X}_{l})$$

(10)

where, a = intercept,$\,{b}_{k}$ = coefficients of benefit factors, and$\,{c}_{l}$ = coefficients of cost factors.

Equation10 shows that a city’s theoretical size can be systematically derived based on its associated advantages and disadvantages as conceptualised in Fig. 2. This estimated value is referred to as the city’s theoretical size throughout the study. To operationalise this model empirically, indicators representing the key factors influencing theoretical city size were constructed from various spatial and census data sources, as described below.

Operationalising indicators of theoretical city size

Building on the theoretical model (Eq.10), this section describes the operationalisation of the framework using empirical data. It details how key factors influencing theoretical city size were measured, the indicators used to represent these factors, and the sources of data, and the methods applied to estimate the theoretical population sizes of UCLs across Australia (Table 5).

Table 5 Operationalisation of theoretical city size determinants: theories, factors, indicators, and data sources

Full size table

The study focuses exclusively on indicators whose values can be either derived or assigned for both new and existing cities. For example, in the case of a new city, an urban planner can exogenously assign a target density, measure distance to planned public transport services, quantify the provision of public amenities (e.g., parks, playgrounds, childcare, schools, hospitals), estimate the number of jobs accessible within a certain travel time, regulate land use diversity through zoning controls, estimate the population or specialist functions in surrounding areas (borrowed size or borrowed function), and calculate centrality measures based on geographic location. This study used geographic centrality rather than flow centrality because the former only requires the geographic position of a development, whereas the latter depends on empirical data about flows of people, goods, or transaction—information unavailable for new developments. Accordingly, some factors known to be associated with city size, such as crime rates, urban sprawl, or rent levels, were excluded from the model^28,34. These exclusions were made to ensure that the framework remains plannable and applicable to both new and existing cities, enhancing its practical relevance for policymakers and urban planners. While many factors were directly sourced, some are derived as detailed below.

Time to train station: The study used OSM (open street map) railway network data featuring the 2021 version of railway lines and stations, downloaded from AURIN (Australian Urban Research Infrastructure Network). The driving time (min) between all the SA2s in Mainland Australia and their closest train station was derived using the Closest Facility function in ArcGIS Pro (v.3.0) based on its built-in transport network. The driving time of all SA2s belonging to a UCL was averaged to derive UCL-level access to train stations.

Labour market efficiency: Labour market efficiency is measured by the job accessibility indicator. Research shows that areas with good access to employment exhibit increasing returns to scale due to agglomeration effects⁴⁶. Given that not all jobs are equally attractive (i.e., jobs located close by are more attractive than those located further away), a distance decay parameter was initially estimated to discount the travel time effect to derive the job accessibility level of each UCL. To estimate this parameter, 660 SA2s (statistical area level 2) were randomly selected from 2308 SA2s because of the limit posed by ArcGIS Pro (v.3.0) to derive OD (origin-destination) cost matrix. The SA2s were chosen because it is the smallest statistical unit for which the commuting flow data were available. This selection also enabled a finer level analysis than UCL, given that most UCL contains one or more SA2s (e.g., Melbourne as a UCL contains 292 SA2s). The commuting flow data, people who drove to work, between the SA2s were downloaded using the ABS TableBuilder functionality. The drive time (min) between the centroids of SA2s was derived using the OD (origin-destination) Cost Matrix function in ArcGIS Pro (v.3.0) based on its built-in transport network. The flow data were classified by travel time bins, ranging from 3 to 149 min, with one-minute interval. Commuting flows exceeding 149 min were found to be negligible (0.01%). The flow within each band was transformed into three indicators: (a) natural log; (b) standardised probability density function (PDF); and (c) inverse cumulative density function (ICDF)⁷⁸. A range of curve estimation regression models were estimated in SPSS using these indicators including logarithmic, inverse, power, and exponential. Among them, the negative exponential function was found to better fit the data with a decay coefficient of 0.058 (Fig. 6). As a result, the negative exponential decay parameter was employed in this study, which is also the most widely used function in travel behaviour research⁷⁹.

**Fig. 6: Travel time decay curve for commuting flows between SA2s.**

The derived distance decay parameter was then used to calculate job accessibility of the 655 UCLs based on Eq.11. The drive times between the centroids of the 655UCLs and all SA2s in Mainland Australia were derived using the OD Cost Matrix function in ArcGIS Pro based on its built-in transport network. This derivation was conducted in stages, separating SA2s for different states and territories, so that the maximum number of input features does not exceed the limit imposed by the software. The number of jobs available within the SA2s was derived based on the total number of commuting flows destined there from all SA2s.

$${A}_{i}=\mathop{\sum }\limits_{j=1}^{n}{O}_{j}{e}^{-\beta {t}_{{ij}}}$$

(11)

where, ${A}_{i}$ = Jobs accessibility of UCL_i, ${O}_{j}$ = Number of jobs available in SA2_j, $\beta$ = estimated decay parameter, ${t}_{{ij}}$ = driving time (min) between UCL_i and SA2_j.

Job diversity: The urban diversity was calculated following the formula of Simpson’s diversity index (Eq.12). The study used occupation data from the 2016 ABS census for the 655 UCLs and calculated the percentage of people employed in the eight occupational status categories (Managers, Professionals, Technicians and Trades Workers, Community and Personal Service Workers, Clerical and Administrative Workers, Sales Workers, Machinery Operators and Drivers, and Labourers). The diversity index ranges between 0 (homogenous) and 1 (fully diverse) for the UCLs.

$${Diversity}=1-\left(\frac{\sum n(n-1)}{N(N-1)}\right)$$

(12)

where, n= % of employment in a particular category; N= % of employment in all categories

Centrality and intermediacy: Two centrality measures were employed based on spatial network analysis in this study: betweenness centrality to represent intermediacy and closeness centrality to measure how close a city is located to all other cities. Unlike topological/social network analysis, where the distance between two contacts has little meaning, spatial network analysis takes into account the geographic distance between two contacts to derive centrality indicators. This is particularly important in the case of cities because unlike social contacts in which not everyone is connected with everyone else, all cities are to some extent connected with each other, and as a result, all cities will result in an identical score for some centrality measures if the distances between them are not taken into account. Different types of flows (e.g., passenger, telephone calls) between cities are often used as weights in centrality measures⁸⁰. However, this study has not applied such weights given that flow data cannot be known for a new city. The two centrality indicators were derived using the Urban Network Analysis Toolbox 1.01 in ArcMap (v.10.8) (http://cityform.mit.edu/projects/urban-network-analysis). The tool required a network dataset and building points/polygons. The OSM road network for Australia was downloaded from AURIN and used to derive a network dataset with travel distance as an impedance. Given the national level analytical focus of this study, local and minor roads were excluded, keeping motorways, primary, trunk, secondary, and tertiary roads and giving a hierarchy score of 1, 2, 3, 4, and 5, respectively. For similar reasons, the resulting road networks were simplified through collapsing dual roads and thinning operations. The 655 UCL centroids were used as inputs for ‘Buildings’ in the tool to derive the indicators.

Borrowed size and borrowed function: The same distance decay parameter was used to calculate the borrowed size and borrowed function indicators (Table 5). A spatial lag formulation of the UCL boundary was applied, which ensured that the population or high-functional jobs that were located within the boundary were excluded from analyses for the derivation of these two indicators. For this reason, a large UCL receives a low borrowed size score compared to a smaller UCL located next a larger UCL. For example, Melbourne as a UCL had a borrowed population size of 29 K (distance weighted); whereas Lara, a small UCL located next to Melbourne, had a borrowed population size of 250 K. Also, the application of the distance decay function ensured that the population/high-functional jobs that were located in nearby SA2s received a higher weight than those located in distant SA2s.

Operationalising the theoretical city size model

All indicators were nature log-transformed to be consistent with the functional form derived in Eq.10, except for the Capital City indicator, which was dummy-coded. The population sizes of the UCLs were regressed on the explanatory factors listed in Table 5 in a linear regression model – i.e., to estimate how much of the observed variation in city sizes can be explained by the identified theoretical factors. A four-step process was employed to estimate the model:

Table 6 Correlations among the factors* of city sizes (≥10 K Sample)

Full size table

a.
Correlation analysis: A correlation analysis was conducted among the explanatory factors, and variables with strong correlations (>0.75) were excluded. A strong correlation was observed between capital city and travel time to capital city factors, and between borrowed size and borrowed function factors (Table 6). Consequently, the capital city and borrowed size factors were retained due to their better explanatory power of theoretical city size;
b.
Factor identification: The statistical significance of the remaining factors was tested in a simple (univariable) linear regression model and any factors with insignificant (at the 0.1 level) statistical association with theoretical city size were removed from further analysis;
c.
Multicollinearity analysis: The remaining factors were then entered into a multiple linear regression model to check multicollinearity among the factors based on the variance inflation factor (VIF) and variables with the highest VIF values were gradually removed until all the variables in the model achieve a VIF value of ≤5;
d.
Model refinement: A parsimonious model of theoretical city size was estimated by gradually removing insignificant factors (p > 0.05).

Addressing endogeneity in explanatory variables

The above steps were separately estimated for the two sample groups of UCLs ( ≥ 10 K, and ≥4 K). The significant factors were evaluated for potential reverse causality based on theoretical reasoning, revealing that two of them – job accessibility and urban diversity - were susceptible to endogeneity bias. Job accessibility serves as an indicator that captures the planning outcomes of investments in both transport and land uses, particularly the job opportunities⁵⁷. Consequently, such investments tend to be made in areas with high population concentration to maximise the benefits⁸¹. Similarly, urban diversity, represented by the variety of job options available, plays a crucial role in attracting individuals to reside and work in cities, thereby fostering meaningful employment opportunities⁵⁸. Contrary to this argument is that large cities inherently possess a diverse knowledge base, which in turn attracts firms seeking to capitalise on this wealth of expertise through diversifying their operations. Urban planning also responds to this diversity by provisioning alternative goods and services that cater to the needs of the population⁸².

This study applied the instrument variable approach to isolate the reverse causality bias, operationalised within a two-stage least square (2SLS) regression framework – a method widely utilised in the field^36,83. The spatial lags of job accessibility and urban diversity factors were used as instruments for their respective variables (Table 1). The selection of these instruments was guided by Tobler’s first law of geography⁸⁴^{, p.236}: “everything is related to everything, but near things are more related than distant things”. This implies that cities surrounded by others with higher levels of job accessibility and urban diversity are likely to exhibit higher levels themselves. However, it is unlikely that population size in one city will directly determine job accessibility and urban diversity levels in neighbouring cities. Using spatial lag as an instrument thus captures spillover effects by leveraging the influence that neighbouring areas’ characteristics exert on a UCL’s own conditions through spatial diffusion or policy interdependence. All models were estimated using Stata software (v.16).

Measuring deviations from theoretical size

The parsimonious model, as developed in Section 3.1, was run and the theoretical size of UCLs (unstandardised predicted values) and residuals (unstandardised) were saved in a database. The residuals denote the differences between actual population sizes (natural log transformed) and theoretical (natural log transformed) population sizes of the UCLs. The residuals were transformed as the percentage of actual population to represent percentage differences between the actual and theoretical population sizes.

Assessing sustainability outcomes of size deviation

The third research question essentially tests the validity of the inherent hypothesis that cities in the equilibrium of theoretical and actual population sizes would produce more sustainable outcomes for cities. If this hypothesis is to be true, then an optimum sustainability outcome for cities would be achieved when the differences between actual and theoretical population sizes are zero. Empirically, this is operationalised using piecewise/segmented linear regression models. This type of model has been widely applied to test the effects of different policy interventions⁸⁵, including in travel behaviour research⁸⁶. Mathematically, the model can be expressed as:

$$y=\left\{\begin{array}{l}{\beta }_{0}+{\beta }_{1}x+\gamma Z+\,\varepsilon ,x < \alpha \\ {\beta }_{0}+{\beta }_{1}x+{\beta }_{2}\left(x-\alpha \right)+\gamma Z+\varepsilon ,x\ge \alpha \end{array}\right.$$

(13)

where, $y$ = sustainability outcomes of UCLs, $x$ = deviations (% difference between actual and theoretical population sizes), $\alpha$ = breakpoint to be estimated, ${\beta }_{0}$ = constant, ${\beta }_{1}$ and ${\beta }_{2}$ = model coefficients before and after the breakpoint, $\gamma$ is a vector of coefficients corresponding to covariates Z, and $\varepsilon$ = error term. The effect of Z is assumed to be the same on both sides of the breakpoint.

The piecewise regression models were estimated in Stata (v.16). The breakpoint was determined using Stata’s nl (nonlinear regression) command (https://stats.oarc.ucla.edu/stata/faq/how-can-i-find-where-to-split-a-piecewise-regression/). To facilitate interpretation, the deviation variable was centred at the estimated breakpoint, allowing the constant term to represent the predicted sustainability outcome at the sustainable city size.

Three types of sustainability outcomes of cities were tested in this study: economic, socio-environmental, and environmental. The economic outcome is measured by the weekly median rent paid by households. Affordable rent is considered to be directly related to urban productivity as it enables concentration of workers closer to their jobs and an important precondition for digital start-ups⁸⁷. The socio-environmental outcome is measured by the percent of households having two or more vehicles. High rates of multi-vehicle ownership often reflect a lack of accessible, affordable, and inclusive transport options, which can increase household financial vulnerability and social exclusion^88,89. Moreover, two or more vehicle ownership is associated with increased private vehicle travel and emissions, making it a negative indicator of both social and environmental sustainability⁹⁰. Environmental sustainability is further assessed by the percentage of people (aged 15 and over) who walked to work. Walking is associated with a range of environmental benefits including air quality, reduced noise and reduced carbon emissions through a lower use of car-based fossil-fuel. Studies have shown that if a one percent car trip is substituted by active travel, it would reduce fuel consumption by 2–4%⁹¹. All three datasets were downloaded from the ABS Census 2016.

The piecewise regression models were estimated twice: first, without any additional covariates; and second, adding two covariates– population density and median household income. The intent was to check the robustness of the sustainable city size effects on sustainability outcomes. These covariates were selected given their established relevance to the sustainability outcomes under consideration: population density has been consistently shown to influence housing costs through agglomeration economies⁹², to reduce vehicle ownership rates by decreasing car dependency^90,93, and to promote walking behaviour through more compact, walkable environments⁹⁴. Similarly, median household income affects rental affordability⁹⁵, multi-vehicle ownership⁹⁶, and walking participation, as higher-income groups often have more transport choices but are less dependent on walking unless supported by a high-quality urban environment⁹⁷. By including these covariates, the analysis provides a more conservative test of the hypothesis, ensuring that the observed effects of size deviations are not confounded by baseline density or income characteristics.

Data availability

The sources of data used in this study are listed in Table 5 and are publicly available for download. Derived datasets generated and analysed during the current study are available from the corresponding author upon reasonable request.

Abbreviations

ABS:: Australian Bureau of Statistics
AURIN:: Australian Urban Research Infrastructure Network
BoM:: Bureau of Meteorology
GIS:: Geographic Information System
ICDF:: Inverse Cumulative Density Function
OD:: Origin–Destination
OSM:: OpenStreetMap
PDF:: Probability Density Function
SA1:: Statistical Area Level 1
SA2:: Statistical Area Level 2
UCL:: Urban Centres and Localities
VIF:: Variance Inflation Factor
2SLS:: Two-Stage Least Squares

References

Moser, S. & Côté-Roy, L. New cities: Power, profit, and prestige. Geogr. Compass 15, e12549 (2021).
Article Google Scholar
UN HABITAT. World Cities Report 2022: Envisaging the Future of Cities. (https://unhabitat.org/sites/default/files/2022/06/wcr_2022.pdf, 2022).
Watson, V. African urban fantasies: dreams or nightmares?. Environ. Urban. 26, 215–231 (2014).
Article Google Scholar
Datta, A. New urban utopias of postcolonial India: ‘Entrepreneurial urbanization’ in Dholera smart city. Gujarat. Dialogues Hum. Geogr. 5, 3–22 (2015).
Article Google Scholar
Myers, D. Demographic futures as a guide to planning: California’s Latinos and the compact city. J. Am. Plan. Assoc. 67, 383–397 (2001).
Article Google Scholar
Berke, P. R., Godschalk, D. R., Kaiser, E. J. & Rodriguez, D. A. Urban land use planning. (University of Illinois Press Urbana, 2006).
Wang, X. & Hofe, R. Research methods in urban and regional planning. (Springer Science & Business Media, 2008).
Bettencourt, L. M. A. The origins of scaling in cities. Science 340, 1438–1441 (2013).
Article CAS Google Scholar
Turok, I. & McGranahan, G. Urbanization and economic growth: the arguments and evidence for Africa and Asia. Environ. urbanization. 25, 465–482 (2013).
Article Google Scholar
Mendez, P., Atienza, M. & Modrego, F. Urbanization and productivity at a global level: new empirical evidence for the services sector. Reg. Sci. Policy Pract. 15, 1981–1998 (2023).
Article Google Scholar
Frick, S. A. & Rodríguez-Pose, A. Big or Small Cities? On city size and economic growth. Growth Change 49, 4–32 (2018).
Article Google Scholar
Glaeser, E. L. & Sacerdote, B. Why is there more crime in cities?. J. Polit. Econ. 107, S225–S258 (1999).
Article Google Scholar
Mohajeri, N., Gudmundsson, A. & French, J. R. CO₂ emissions in relation to street-network configuration and city size. Transport. Res. Part D: Transp. Environ. 35, 116–129 (2015).
Article Google Scholar
Zhou, B., Rybski, D. & Kropp, J. P. The role of city size and urban form in the surface urban heat island. Sci. Rep 7, 4791 (2017).
Article Google Scholar
Czinkan, N. & Horváth, Á Determinants of housing prices from an urban economic point of view: evidence from Hungary. J. Eur. Real. Estate Res. 12, 2–31 (2019).
Article Google Scholar
Krupka, D. J. Are big cities more segregated? Neighbourhood scale and the measurement of segregation. Urban Stud. 44, 187–197 (2007).
Article Google Scholar
Tavares, A. F. & Carr, J. B. So close, yet so far away? The effects of city size, density and growth on local civic participation. J. Urban Aff. 35, 283–302 (2013).
Article Google Scholar
Buch, T., Hamann, S., Niebuhr, A. & Rossen, A. What makes cities attractive? The determinants of urban labour migration in Germany. Urban Stud. 51, 1960–1978 (2013).
Article Google Scholar
Moon, M. J. & Norris, D. F. Does managerial orientation matter? The adoption of reinventing government and e-government at the municipal level. Inf. Syst. J. 15, 43–60 (2005).
Article Google Scholar
Chen, J., Davis, D. S., Wu, K. & Dai, H. Life satisfaction in urbanizing China: The effect of city size and pathways to urban residency. Cities 49, 88–97 (2015).
Article Google Scholar
Li, L. et al. Impacts of city size change and industrial structure change on CO₂ emissions in Chinese cities. J. Clean. Prod. 195, 831–838 (2018).
Article Google Scholar
Levinson, D. Network structure and city size. PLoS ONE 7, e29721 (2012).
Article CAS Google Scholar
Engelfriet, L. & Koomen, E. The impact of urban form on commuting in large Chinese cities. Transportation 45, 1269–1295 (2018).
Article Google Scholar
An, Q., Gordon, P. & Moore, J. E. A note on commuting times and city size: Testing variances as well as means. J. Transp. Land Use 7, 105–110 (2014).
Article Google Scholar
Horner, M. W. Extensions to the concept of excess commuting. Environ. Plan. A 34, 543–566 (2002).
Article Google Scholar
Gordon, P., Kumar, A. & Richardson, H. W. Congestion, changing metropolitan structure, and city size in the United States. Int. Regional Sci. Rev. 12, 45–56 (1989).
Article Google Scholar
Alonso, W. The economics of urban size. Pap. Reg. Sci. Assoc. 26, 66–83 (1971).
Article Google Scholar
Shen, D., Thill, J.-C. & Sun, J. Are Chinese cities oversized?. Int. Reg. Sci. Rev. 43, 632–654 (2020).
Article Google Scholar
Richardson, H. W. Optimality in city size, systems of cities and urban policy: A sceptic’s view. Urban Stud. 9, 29–48 (1972).
Article Google Scholar
Jiang, B., Yin, J. & Liu, Q. Zipf’s law for all the natural cities around the world. Int. J. Geogr. Inf. Sci. 29, 498–522 (2015).
Article Google Scholar
Gabaix, X. Z. ipf’s Law for Cities: An explanation. Q. J. Econ. 114, 739–767 (1999).
Article Google Scholar
Soo, K. T. Z. ipf’s Law for cities: A cross-country investigation. Reg. Sci. Urban Econ. 35, 239–263 (2005).
Article Google Scholar
Schmid, C. Specificity and urbanization: A theoretical outlook. In The Inevitable Specificity Of Cities: Napoli, Nile Valley, Belgrade, Nairobi, Hong Kong, Canary Islands, Beirut, Casablanca (Eds. Diener R. et al.) 287–307 (Zurich: Lars Müller Publishers, 2015).
Camagni, R., Capello, R. & Caragliu, A. One or infinite optimal city sizes? In search of an equilibrium size for cities. Ann. Reg. Sci. 51, 309–341 (2013).
Article Google Scholar
González-Val, R. & Olmo, J. A statistical test of city growth: Location, increasing returns and random growth. Munich Personal RePEc Archive (MPRA) Paper No. 27139. https://mpra.ub.uni-muenchen.de/27139/ (2010) (accessed 01/08/2025).
Davis, D. R. & Weinstein, D. E. Bones, bombs, and break points: the geography of economic activity. Am. economic Rev. 92, 1269–1289 (2002).
Article Google Scholar
Chen, Z., Fu, S. & Zhang, D. Searching for the parallel growth of cities in China. Urban Stud. 50, 2118–2135 (2013).
Article Google Scholar
Ayuda, M. I., Collantes, F. & Pinilla, V. From locational fundamentals to increasing returns: the spatial concentration of population in Spain, 1787–2000. J. Geographical Syst. 12, 25–50 (2010).
Article Google Scholar
Gassebner, M., Schaudt, P. & Wong, M. H. L. Competition and political violence. J. Dev. Econ 162, 103052 (2023).
Article Google Scholar
Jedwab, R., Johnson, N. D. & Koyama, M. Pandemics and cities: Evidence from the Black Death and the long-run. J. Urban Econ 139, 103628 (2024).
Article Google Scholar
Santos, J. L. & Fernández Fernández, M. T. The spread of urban–rural areas and rural depopulation in central Spain. Reg. Sci. Policy Pract. 15, 863–877 (2023).
Article Google Scholar
Esteban-Oliver, G. On the right track? Railways and population dynamics in Spain, 1860–1930. Eur. Rev. Econ. Hist. 27, 606–633 (2023).
Article Google Scholar
Gómez Valenzuela, V. & Holl, A. Growth and decline in rural Spain: an exploratory analysis. Eur. Plan. Stud. 32, 430–453 (2024).
Article Google Scholar
Nygaard, C. A. & Parkinson, S. Analysing the impact of COVID-19 on urban transitions and urban-regional dynamics in Australia. Aust. J. Agric. Resour. Econ. 65, 878–899 (2021).
Article Google Scholar
Krugman, P. Urban concentration: the role of increasing returns and transport costs. Int. Reg. Sci. Rev. 19, 5–30 (1996).
Article Google Scholar
Ciccone, A. & Hall, R. E. (National Bureau of Economic Research Cambridge, MA, USA, 1993).
Perl, A., Deng, T., Correa, L., Wang, D. & Yan, Y. Understanding the urbanization impacts of high-speed rail in China. Archives of Transp 58, 21–34 (2021).
Article Google Scholar
Christaller, W. Die Zentralen Orte in Suddeutschland. (Gustav Fischer, 1933).
Castells, M. The information age: Economy, society and culture (3 volumes). Blackwell, Oxf. 1997, 1998 (1996).
Google Scholar
Taylor, P. J., Hoyler, M. & Verbruggen, R. External urban relational process: introducing central flow theory to complement central place theory. Urban Stud. (Edinb., Scotl.) 47, 2803–2818 (2010).
Article Google Scholar
Li, X. & Neal, Z. P. Are larger cities more central in urban networks: A meta-analysis. Global Networks 24, e12467 (2024).
Article Google Scholar
Prieto-Curiel, R., Schumann, A., Heo, I. & Heinrigs, P. Detecting cities with high intermediacy in the African urban network. Comput. Environ. Urban Syst. 98, 101869 (2022).
Article Google Scholar
Phelps, N. A., Fallon, R. J. & Williams, C. L. Small firms, borrowed size and the urban-rural shift. Reg. Stud. 35, 613–624 (2001).
Article Google Scholar
Meijers, E. J., Burger, M. J. & Hoogerbrugge, M. M. Borrowing size in networks of cities: City size, network connectivity and metropolitan functions in Europe. Pap. Regional Sci. 95, 181–198 (2016).
Article Google Scholar
Yang, T., Zhu, Y. & Du, J. Is there a borrowed size in China’s urban agglomerations?. Prog. Geogr. 41, 1156–1167 (2022).
Article Google Scholar
Meijers, E. From central place to network model: theory and evidence of a paradigm change. Tijdschr. voor Econ. Soc. Geogr.98, 245–259 (2007).
Article Google Scholar
Lee, J., Arts, J. & Vanclay, F. Investigating institutional barriers and opportunities to an integrated approach for transport and spatial development: Mega urban transport development in a rapidly developing city, Seoul. J. Urban Aff. 46, 40–62 (2024).
Article Google Scholar
Kelly, M., Nguyen, M. & Triandafyllidou, A. Why migrants stay in small and mid-sized cities: Analytical and comparative insights. J. Int. Migr. Integr. 24, 1013–1027 (2023).
Google Scholar
Tyvimaa, T. & Kamruzzaman, M. The effect of young, single person households on apartment prices: an instrument variable approach. J. Hous. Built Environ. 34, 91–109 (2019).
Article Google Scholar
Côté-Roy, L. & Moser, S. ‘Does Africa not deserve shiny new cities?’The power of seductive rhetoric around new cities in Africa. Urban Stud. 56, 2391–2407 (2019).
Article Google Scholar
Van Noorloos, F. & Kloosterboer, M. Africa’s new cities: The contested future of urbanisation. Urban Stud. 55, 1223–1241 (2018).
Article Google Scholar
Herbert, C. W. & Murray, M. J. Building from scratch: New cities, privatized urbanism and the spatial restructuring of Johannesburg after apartheid. Int. J. Urban Reg. Res. 39, 471–494 (2015).
Article Google Scholar
Berghmans, L., Schoovaerts, P. & Teghem, J. Jr Implementation of health facilities in a new city. J. Oper. Res. Soc. 35, 1047–1054 (1984).
Article CAS Google Scholar
Van Leynseele, Y. & Bontje, M. Visionary cities or spaces of uncertainty? Satellite cities and new towns in emerging economies. Int. Plan. Stud. 24, 207–217 (2019).
Article Google Scholar
Hashim, A. R. A. A. B. Branding the brand new city: Abu Dhabi, travelers welcome. Place Brand. Public Dipl. 8, 72–82 (2012).
Article Google Scholar
Faria, J. R., Ogura, L. M. & Sachsida, A. Crime in a planned city: The case of Brasília. Cities 32, 80–87 (2013).
Article Google Scholar
Jones, M. A. Optimum City Size. Aust. N.Z. J. Sociol. 9, 32–36 (1973).
Article Google Scholar
Li, S., Liu, K. & Li, J. Reflections and Enlightenments on the Population Size Determination in Urban Master Planning in China: A Case Study of Xi’an City. Cheng Shi Gui Hua 31, 22–32 (2022).
CAS Google Scholar
Parham, E., Law, S. & Versluis, L. in Proceedings-11th International Space Syntax Symposium, SSS 2017. 103.101-103.117 (SSS).
Arribas-Bel, D., Garcia-López, M. À & Viladecans-Marsal, E. Building(s and) cities: Delineating urban areas with a machine learning algorithm. J. Urban Econ. 125, 103217 (2021).
Article Google Scholar
Dijkstra, L., Poelman, H. & Veneri, P. The EU-OECD definition of a functional urban area. OECD Regional Development Working Papers 2019/11. https://doi.org/10.1787/d58cb34d-en (2019).
OECD/European Commission. Cities in the World: A new perspective on urbanisation. OECD Urban Studies, OECD Publishing, Paris, https://doi.org/10.1787/d0efcbda-en (2020).
Moreno-Monroy, A. I., Schiavina, M. & Veneri, P. Metropolitan areas in the world. Delineation Popul. Trends J. Urban Econ. 125, 103242 (2021).
Article Google Scholar
Arcaute, E. et al. Constructing cities, deconstructing scaling laws. J. R. Soc. Interface 12, 20140745 (2015).
Article Google Scholar
Bosker, M., Park, J. & Roberts, M. Definition matters. Metropolitan areas and agglomeration economies in a large-developing country. J. Urban Econ. 125, 103275 (2021).
Article Google Scholar
Australian Bureau of Statistics (ABS). Urban Centres and Localities, ABS Website, accessed 1 August 2025.https://www.abs.gov.au/statistics/standards/australian-statistical-geography-standard-asgs-edition-3/jul2021-jun2026/significanturban-areas-urban-centres-and-localities-section-state/urban-centres-and-localities. (2022).
Australian Government. Shaping a nation: Population growth and immigration over time. https://population.gov.au/publications/publications-shaping-nation (Canberra, 2018) (accessed 01/08/2025).
Santana Palacios, M. & El‑Geneidy, A. Cumulative versus gravity‑based accessibility measures: Which one touse? https://doi.org/10.32866/001c.32444 (2022).
Halás, M., Klapka, P. & Kladivo, P. Distance-decay functions for daily travel-to-work flows. J. Transp. Geogr. 35, 107–119 (2014).
Article Google Scholar
Zhang, F., Ning, Y. & Lou, X. The evolutionary mechanism of China’s urban network from 1997 to 2015: An analysis of air passenger flows. Cities 109, 103005 (2021).
Article Google Scholar
Lee, J. K. Transport infrastructure investment, accessibility change and firm productivity: Evidence from the Seoul region. J. Transp. Geogr 96, 103182 (2021).
Article Google Scholar
He, X. Energy effect of urban diversity: An empirical study from a land-use perspective. Energy Econ 108, 105892 (2022).
Article Google Scholar
Bleakley, H. & Lin, J. Portage and path dependence. Q. J. Econ. 127, 587–644 (2012).
Article Google Scholar
Tobler, W. R. A computer movie simulating urban growth in the Detroit region. Econ. Geogr. 46, 234–240 (1970).
Article Google Scholar
Vergori, A. S. & Arima, S. Low-cost carriers and tourism in the Italian regions: A segmented regression model. Ann. Tour. Res 97, 103474 (2022).
Article Google Scholar
Haider, M. Diminishing Returns To Density And Public Transit. https://doi.org/10.32866/10679 (2019).
Gurran, N. et al. Urban productivity and affordable rental housing supply in Australian cities and regions. (AHURI Final Report No. 353). https://www.ahuri.edu.au/research/final-reports/353. https://doi.org/10.18408/ahuri7320001 (Australian Housing and Urban Research Institute Limited 2021).
Lucas, K. Transport and social exclusion: Where are we now?. Transp. policy 20, 105–113 (2012).
Article Google Scholar
Currie, G. & Delbosc, A. Exploring public transport usage trends in an ageing population. Transportation 37, 151–164 (2010).
Article Google Scholar
Newman, P. & Kenworthy, J. Sustainability and cities: overcoming automobile dependence. (1998).
Litman, T. Evaluating Active Transport Benefits and Costs: Guide to Valuing Walking and Cycling Improvements and Encouragement Programs. (Victoria Transport Policy Institute, https://www.vtpi.org/nmt-tdm.pdf, 2023).
Glaeser, E. & Gyourko, J. The economic implications of housing supply. J. Econ. Perspect. 32, 3–30 (2018).
Article Google Scholar
Cervero, R. & Guerra, E. Urban densities and transit: A multi-dimensional perspective. (2011).
Ewing, R. & Cervero, R. Travel and the built environment: A meta-analysis. J. Am. Plan. Assoc. 76, 265–294 (2010).
Article Google Scholar
Gan, Q. & Hill, R. J. Measuring housing affordability: Looking beyond the median. J. Hous. Econ. 18, 115–125 (2009).
Article Google Scholar
Giuliano, G. & Dargay, J. Car ownership, travel and land use: a comparison of the US and Great Britain. Transportation Res. Part A: Policy Pract. 40, 106–124 (2006).
Google Scholar
Buehler, R. & Pucher, J. Walking and Cycling in Western Europe and the United States: trends, policies, and lessons. T34–4 (TR News, 2012).

Download references

Acknowledgements

The author thanks the two anonymous reviewers and the editor of the journal, Professor Xiaoling Zhang, for their insightful comments and suggestions. This paper forms the foundation for the successful Australian Research Council Discovery Project (DP250101843), which contributed to the development of the research agenda presented herein.

Author information

Authors and Affiliations

Monash Institute of Transport Studies, Department of Civil and Environmental Engineering, Monash University 23 College Walk, Clayton, VIC, 3800, Australia
Liton Md Kamruzzaman

Authors

Liton Md Kamruzzaman
View author publications
Search author on:PubMed Google Scholar

Contributions

L.K.: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Visualization, Writing – Original Draft, and Writing – Review & Editing.

Corresponding author

Correspondence to Liton Md Kamruzzaman.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Kamruzzaman, L.M. Towards a theory of sustainable city sizes. npj Urban Sustain 5, 66 (2025). https://doi.org/10.1038/s42949-025-00254-4

Download citation

Received: 16 January 2025
Accepted: 22 July 2025
Published: 13 August 2025
Version of record: 13 August 2025
DOI: https://doi.org/10.1038/s42949-025-00254-4