Main

Urban green space (UGS) is generally understood as natural or human-designed vegetated spaces such as urban parks, yards, gardens, forests, woodlands, green belts, some agricultural areas and many other forms1. What is labeled as ‘green space’ can encompass a wide range of vegetation quantity and quality and be configured in a variety of spatial patterns. These variations can impact human well being and ecosystem services provided by UGS2,3. UGS benefits range from various ecosystem services (for example, ambient temperature regulation, air pollution reduction, stormwater drainage) to cultural services and human well being effects such as stress reduction, attention restoration and the promotion of social interactions and physical activity4,5. Despite the benefits, if neglected or mismanaged, UGS can become breeding grounds for pests and vectors of disease and engender fears of crime, posing a disservice to public health and safety6,7.

By definition, UGS is interwoven with the urban fabric, including its built, natural and socioeconomic elements8. Features of the natural environment such as topography, soils and climatic conditions shape the conditions for vegetation growth and vegetation types9,10,11. Built environment with its buildings and infrastructure layouts and land-use regulations determine the availability of land for UGS10. Rapid urbanization, population growth and urban sprawl can threaten existing UGS and limit opportunities for new UGS creation12,13. Urban sprawl has been associated with lower amounts of UGS14. However, other urbanization-related indicators such as population density have shown negative15,16,17, positive11 or no association with the amount of UGS18. In addition, cities’ socioeconomic characteristics often determine the demand, preferences and resources available for creating, preserving and maintaining UGS19, with wealthier cities usually having more abundant UGS17.

Understanding relationships between cities’ characteristics and UGS can reveal important information about the determinants of UGS availability and distribution. Evaluating the relative importance of environmental factors, such as climate, which are difficult to alter, compared with the effects of policy-modifiable characteristics, such as cities’ built environment, can help adjust expectations about the feasibility and potential impacts of changes in UGS and greening initiatives. Finally, knowledge about the role of environmental and socioeconomic characteristics associated with UGS can help identify underserved areas and social inequalities. Such information can facilitate progress towards the United Nations Sustainable Development Goal 11.7—‘provide universal access to safe, inclusive and accessible, green and public spaces, in particular for women and children, older persons and persons with disabilities’20.

Despite abundant research from the Global North and China, UGS is understudied in Latin America (LA) and other low- and middle-income countries, although this scarcity is changing21,22,23,24,25. A recent review identified high research output from Brazil, Argentina and Chile, although UGS research in these countries has predominantly focused on a few large cities such as São Paulo (Brazil), Curitiba (Brazil), Buenos Aires (Argentina) and Santiago (Chile)26. Few studies have focused on other LA countries26,27, underscoring a knowledge gap within the region.

As one of the most urbanized and biodiverse regions in the world, LA is a unique setting in terms of UGS research needs and opportunities. Over 80% of LA residents are urbanites, making it the second-highest urbanized region in the world after North America28. Latin America has urbanized rapidly since the 1960s, and during this period of fast urbanization, there was a lack of effective policies to manage the swift growth of cities (for example, policies to ensure that high-density areas are strategically located and well planned, offering livable, walkable environments with essential urban services and economic opportunities)26. Without these measures, urbanization has led to dramatic shifts in land use, including the clearing of urban green spaces to accommodate city growth26,27. Still, some LA cities (for example, Curitiba in Brazil) have implemented strategies to increase green areas26. However, in most cities in the region, UGS provision is less prioritized compared with the more basic urban infrastructure projects such as housing and sanitation26,27. Rapid urbanization has also fueled the rise of informal settlements on the outskirts of cities, characterized by widespread poverty, inadequate housing and limited access to basic services such as water, sewage and sanitation29. Residents in these areas often face heightened risks of eviction, disease and violence, and frequently suffer from both a lack of green space and its poor quality29. Finally, LA cities are also extremely biodiverse and located in tropical, temperate and arid climate zones. Therefore, compared with more researched cities in colder climates and higher-income contexts, they have different kinds of vegetation. Because the capacity of UGS to cool air and provide other ecosystem services may depend on the prevailing climate30,31, the ability of UGS to provide ecosystem and health benefits in LA may not necessarily be aligned with what has been observed in the Global North or Asia, where UGS is far more researched than in LA32.

In line with global-scale studies pointing at a socioeconomic gradient in USG17, existing research indicates there is a shortage and unequal distribution of urban green space in LA cities. Namely, neighborhoods, cities and individuals with high socioeconomic status generally have better access to and make more use of green space32,33,34. However, a recent large-scale study of 371 LA cities revealed a negative relationship between greenness and city socioeconomic status22. The full extent of these patterns in LA cities is not known because there has been no research to document them using a large representative sample of LA cities.

The overall goal of this study is to investigate the quantity, spatial distribution and potential factors influencing UGS in Latin America, with a focus on city-level analysis and comparisons across cities. The specific objectives of this study are: (1) to describe UGS quantity and spatial distribution in 371 large LA cities (>100 k residents) in 11 countries, (2) to examine the variability of UGS quantity and spatial distribution across cities’ natural, built and socioeconomic characteristics, and (3) to assess the relative importance of the city’s climatic zone compared with its built environment and social characteristics on UGS. Through the first objective, this research contributes new insights into the UGS availability and distribution in this unique and less-studied world region. The second and third objectives provide an understanding of potential underlying factors that can explain between-city patterns in UGS in LA.

Results

Quantity and spatial distribution of UGS in 371 LA cities

We observed high variability in UGS characteristics across cities (Supplementary Tables 2 and 3). Greenness, measured by satellite-derived normalized difference vegetation index (NDVI), varied from 0.10 to 0.80, with a mean of 0.53. The range of green space coverage in cities was also large, ranging from 4% to 72%, with a mean of 41%. There was on average 123,220 m2 of green space area per 1,000 residents. The average park area per 1,000 residents was 2,142 m2, which is orders of magnitude lower than green space area per capita. Figure 1 describes UGS in three large cities in the region with different levels of UGS quantity.

Fig. 1: Urban green space in large LA cities.
figure 1

a, The median of annual maximum NDVI in 2017 for 371 cities in 11 LA countries obtained from the MODIS NDVI satellite imagery. bd, The distribution of green space in Teresopolis (Brazil) (b), Buenos Aires (Argentina) (c) and Lima (Peru) (d)—their locations in Latin America marked by red circles in a—according to the green space land-cover map from Ju et al.63. Each 30-m pixel of green space in bd is depicted in green; non-green space areas are in white. An extended explanation of the figure is available in Supplementary Material 1.

Variability in green space characteristics by city environments

Indicators of UGS varied across cities’ natural, built and socioeconomic environment (Table 1). Total and per capita amount of green space varied substantially across climate zones, with arid cities reporting lower NDVI, lower percent green space area and lower total green space area per capita than tropical or temperate cities. On average, the percentage of city area covered by green space in arid LA cities was 12–14% lower than in temperate and tropical cities. Arid cities in our sample were represented by hot (n = 55 cities) and cold (n = 24 cities) arid steppes and deserts, located throughout Mexico, Argentina and along the Pacific coast of Peru and Chile. Arid cities were also characterized by higher park availability compared with other climate zones. The green space landscape in arid cities was more fragmented (higher patch density), more isolated and less aggregated than in non-arid cities (Table 1). Several UGS indicators also varied by topography and coastal location, with higher NDVI, greater percent green space area and less fragmented green space observed in hillier cities, although flatter cities had more parks per capita.

Table 1 Description of city green space by city natural characteristics§

As for the built environment, city area was negatively associated with NDVI whereas other UGS characteristics showed little variability by city area (Table 2). Green space area per capita varied by population and population density. Predictably, green space area per capita decreased in more populous and more densely populated cities. In addition, more densely populated cities had less of their area covered by green space and had less park availability in terms of park count and park area per capita than less dense cities. Finally, cities with high intersection density—a measure of street connectivity, which can reflect the accessibility of green space35—had less UGS overall (lower NDVI, lower percent green space area and lower green space area per capita). However, these cities had a higher density of green space patches and greater isolation of these patches but also had more urban parks per capita.

Table 2 Description of city green space by city built environment§

We observed substantial variability in UGS across different characteristics of cities’ socioeconomic environment (Table 3). For the most part, cities with higher levels of socioeconomic well being had more abundant green space than their less socioeconomically advantaged counterparts. However, there appeared to be a nonlinear relationship between NDVI and gross domestic product (GDP) per capita and NDVI and unemployment, wherein cities with both high and low levels of these indicators had higher NDVI compared with the cities with moderate levels of those variables. UGS in socioeconomically advantaged cities also tended to be less fragmented and more clustered; the isolation of green space did not reveal a consistent pattern across cities’ socioeconomic characteristics.

Table 3 Description of city green space by city socioeconomic environment§

In summary, Tables 13 show that the greatest variability in the amount and configuration of green space and the availability of urban parks was observed by climate zone. Arid cities were characterized by lower amounts of green space, more fragmented and less aggregated green space, but higher availability of urban parks. Regarding the built environment, the highest levels of UGS variability were observed across population density and street intersection density. Cities with high levels of these indicators had less green space (measured by the percentage of green space area and green space area per capita). In cities with high street intersection density, the green space also tended to be more fragmented. Finally, cities with desirable socioeconomic environments tended to be greener, as measured by NDVI or percentage of green space area and had less fragmented and more aggregated green space. There was also some evidence of a U-shaped relationship between NDVI and several socioeconomic indicators. These conclusions are supported by the adjusted multivariable associations reported in Supplementary Table 6 and by the sensitivity checks in Supplementary Tables 7 and 8 and Supplementary Fig. 3.

The importance of city characteristics for UGS availability

The random forest regression explained 65% of variance in NDVI (root mean squared error (RMSE) = 0.08), 41% in percent green space area (RMSE = 9%) and 63% in green space area per capita (RMSE = 42,352 m2 per 1,000 persons). These results (Fig. 2) also showed that climate zone accounted for 54%, 43% and 28% of all variable importance in explaining NDVI, % green space area and green space area per capita, respectively. As explained in Methods, variable importance in a random forest is determined by measuring the decrease in model accuracy after randomly permuting the values of each variable. Variables that cause a larger decrease in accuracy when their values are permuted are deemed more important for the model’s predictions36. Other important variables for the quantity-related metrics were street intersection density, education and GDP per capita. The relative importance of climate zone was much smaller for the urban park metrics—17% and 8%, respectively, for the number of parks per capita and park area per capita. Education, population density and street intersection density had the highest importance for the park metrics.

Fig. 2: Relative importance of the natural, built and socioeconomic environment variables for the availability of urban green space and urban parks in LA cities.
figure 2

Bar graphs show the relative importance of the natural, built and socioeconomic environment variables for the availability of urban green space and urban parks in LA cities. Measures of variable importance were obtained from the random forest regressions described in Methods.

Discussion

UGS and cities’ natural environment

Our study revealed considerable heterogeneity in UGS characteristics across 371 cities in 11 LA countries. Climate zone was responsible for the largest differences in green space characteristics compared with other city factors, with arid cities characterized by lower green space amount, more fragmented and less aggregated green space, but higher urban park availability compared with temperate and tropical cities (Tables 13). Climate puts fundamental constraints on plant physiology and choices of the ‘locally appropriate’ vegetation in landscaping. Even though landscaping in cities may often use global horticultural market options (for example, turf grasses, boxwoods), local vegetation adapted to local climate is still likely to be present especially in urban forests that are remnants of historical forests8. Arid vegetation may be naturally sparser due to resource limitations and compact growth habits and less spectrally bright green due to plant investments in protective tissues to reflect excessive solar radiation and deter predators, all of which may impact NDVI37.

Green space in arid cities was, on average, more fragmented and less aggregated than in other climate zones. Therefore, the UGS ecosystem services in LA arid cities may differ from those in non-arid cities, even at similar levels of UGS quantity2. Evidence from the same cities included in our analysis has shown that (1) higher NDVI in arid cities—but not tropical or temperate—may have a small protective effect against heat-related mortality and (2) evenly distributed (as opposed to highly clustered) green spaces in arid cities have little impact on heat-related mortality3. These findings warrant more attention to the regional green space context of LA and suggest considering its climate when evaluating UGS-health associations. Future research may benefit from a more nuanced approach considering ecosystem disservices (allergens from tree pollen38, thermal comfort in the tropics39, perceptions of crime near large parks7 and so on), some of which can be unique to LA due to its climates and high urbanization40.

Despite having the least amount of vegetated green space, arid cities had the highest number of parks per capita (Table 1). It might be that in the absence of naturally available vegetation, arid cities purposefully invest in green urban parks to create recreational spaces. We speculate this may happen because native vegetation of these arid cities, such as shrubs, succulents and cacti, is not as conducive to typical recreational usages, for example, sports activities that require lawns. In other words, even if natural vegetation is widely available, its properties may not be aligned with recreational uses.

Several UGS indicators also varied by topography: hillier cities had more vegetation-based UGS that was also less fragmented, whereas flatter cities had more parks per capita. This can be attributed to the steep terrain in hillier cities, which is often inaccessible and thus undeveloped10. Abundant vegetation in hillier cities, as observed in our study, may provide protection from floods26. This is particularly important in LA cities, where informal developments often encroach on the natural vegetation in hilly outskirts, thereby increasing the risk of natural disasters among vulnerable populations26,40. Future research may benefit from examining intra-city variations in UGS by topography.

UGS and cities’ built environment and urban form

Larger cities by area had less vegetation-based UGS (Table 2). As cities grow, the demand for space increases, so it is more difficult to protect open space and preserve UGS41. Indeed, other work examining how green space quantity scales with city size in Southeast Asia11,16 has found that UGS scales sublinearly, meaning that UGS increases more slowly than city size. Whereas some studies show negative correlations between city size and UGS6,11,16, counter-evidence also exists15. Recent global findings suggest that socioeconomic factors modify this relationship: an increase in city size is associated with more UGS in high-income cities but with less UGS in low-income cities42. A similar analysis of our data (not reported) did not reveal such interactions. The negative relationship between city area and UGS quantity in LA cities is most likely a consequence of rapid urbanization, where green spaces on the city peripheries had to give way to urban sprawl26. As a result, there might be particularly little UGS in LA’s informal settlements, which are located on the cities’ outskirts29. Whereas we did not have data on UGS availability in the informal settlements, future research should explore this in detail to address the environmental equity needs of the vulnerable population residing there.

LA cities have notably high levels of street intersection density compared with other world regions43. Global North studies often link intersection density with desirable health outcomes, such as increased physical activity44 and healthy weight45. However, in LA cities higher intersection density was linked with several environmental and human health risks, such as higher concentrations of particulate matter and increased risk for hypertension46, body mass index and obesity47. Whereas our study did not include health data, future research can benefit from exploring the combined health effects of intersection density and UGS in LA cities.

Cities with higher population and street intersection densities in our sample had fewer and more isolated vegetation-based UGS (Table 2). Street intersection density reflects a city’s connectivity and enhances the accessibility of green spaces35. These results thus suggest that green space availability (as assessed here by UGS metrics) does not always coincide with its accessibility (as indicated by urban intersection density). Whereas these findings may seem somewhat circular (higher population density inevitably decreases the amount of resources per person), when considered alongside the findings on intersection density, they highlight the challenge of achieving rapid urbanization without sacrificing green space availability.

UGS and cities’ socioeconomic environment

Finally, there is substantial variability in UGS by cities’ socioeconomic characteristics. Positive associations between socioeconomic well being and greenness have been observed in LA cities at the neighborhood level48. This gradient was also observed in a recent global city-level study, but LA cities comprised only 24 of the 159 cities included17. However, other studies—both city- and sub-city level—reported opposite associations22,49,50,51. Our findings for NDVI, percent green space area and the per capita metrics (parks and green space area) also suggest a socioeconomic gradient in UGS. However, we also find that cities with both high and low levels of GDP per capita and unemployment are greener (have higher NDVI) compared with cities with moderate levels of those variables, signifying a potential nonlinear relationship. Ju et al.22 showed that higher GDP per capita in the same 371 LA cities was associated with lower NDVI both at city and sub-city levels, but higher GDP per capita was associated with a faster greening rate during 2000–2015. Our results based on a linear regression agree with those of Ju et al.22. These findings and other recent evidence52,53 challenge a common notion that socioeconomically disadvantaged cities have less green space—the so-called deprivation amplification hypothesis22,54.

High amounts of UGS in socioeconomically disadvantaged cities, as found in our study, are not necessarily a positive phenomenon given the ecosystem disservices of poorly maintained green space6. High greenness in high-GDP cities could reflect purposeful investment in urban parks, whereas in lower-GDP cities, high levels or greenness can be attributed to underdeveloped urban areas55. This explanation is partially supported by our findings linking higher urban parks per capita with higher GDP per capita (Table 3). We also find that green space in socioeconomically advantaged cities tends to be less fragmented and more clustered. To the extent that high levels of green space patch density may indicate an equitable green space distribution56, this finding points to potential within-city inequities in UGS distribution.

The importance of city characteristics for UGS availability

Climate zone—and especially being in an arid climate—had the highest relative importance for explaining variability in the two quantity UGS metrics (NDVI and % green space area), compared with the built and socioeconomic environment variables (Fig. 2). The importance of climate for UGS quantity aligns with findings from a recent global study, which showed that precipitation and temperature were the most determinative features for green space quantity, compared with socioeconomic and built environment features17. The paramount importance of climate for UGS quantity (as measured by NDVI, % green space area and green space area per capita), coupled with projected changes in temperature and rainfall variability in LA57, suggests that nature-based solutions to provide cooling and shade may be challenged by these climatic changes. This may be particularly true in hot arid cities in Latin America where the cooling effect of UGS is most needed. Whereas the characteristics of the built and socioeconomic environment matter less than climate for UGS quantity in LA, their relative importance nevertheless suggests that changes to UGS are possible if appropriate greening policies are adopted17.

In contrast to quantity-based UGS metrics, urban park availability was less constrained by climate and potentially more driven by residents’ demand as exemplified by the high relative importance of socioeconomic characteristics (Fig. 2). Therefore, compared with the quantity-based UGS metrics, urban park metrics more directly reflect the outcomes of urban planning and land-use changes. As we mention earlier, we speculate that arid cities may purposefully create parks because their native vegetation, such as shrubs, succulents and cacti, is less suitable for recreational use.

Limitations and strengths

Several limitations of this study should be mentioned. First, our UGS characteristics did not reflect specific vegetation types, which could provide a deeper understanding of potential social and ecological (dis)services. Second, the temporal alignment between the green space and socioeconomic data was not perfect: UGS data were from 2017 and urban parks from 2020, whereas the years of socioeconomic datasets varied by country, with 2005 the earliest year (for Guatemala). A temporal trend in NDVI from 2005–2017 (Supplementary Fig. 4)—which covers a time period longer than the misalignment period for most countries—shows that changes in NDVI were slow (interquartile range of NDVI change between 2005 and 2017 across all the cities is 0.009–0.062, with the mean increase in NDVI of 0.034). Because NDVI is highly correlated with % green space (Supplementary Fig. 1), we assume that changes in the UGS metrics derived from a land-cover map have also been slow. Therefore, the conclusions for NDVI and land-cover map-based UGS metrics should be minimally affected by the temporal misalignment. The misalignment between the time period of park data and socioeconomic variables is a limitation, and future research could investigate the temporal trends in urban park availability in the region. Third, there was a slight spatial mismatch between the scale of the units used for the analysis and the units used for measuring the socioeconomic variables. The latter were available for cities’ official administrative boundaries, whereas the first were assessed for cities’ urban extents. Fourth, the park data may be subject to geographical bias, similar to data derived from other geotagging-based platforms. That bias is probably more substantial for units of analysis smaller than city58 and therefore should not invalidate our results. Finally, the goal of this study was to create a region-wide understanding of green space availability at city level and describe UGS according to cities’ characteristics. Therefore, our findings do not easily generalize to smaller, within-city scales, which would be more appropriate for identifying UGS inequalities and accessibility. Despite these limitations, this study is the first large-scale attempt to describe UGS in LA and how it varies by cities’ natural, built and socioeconomic environments. The study combines data from satellite imagery, land-cover map and web-mining data techniques to provide extensive information on UGS and its potential drivers in 371 LA cities, which is representative of the region and its diverse natural and socioeconomic conditions.

Conclusions and practical implications

Several practical implications can be drawn from this study. Our findings suggest that overall, environmental conditions, such as climate zone, predetermine how much green space may be naturally available in a city. However, other parameters, such as UGS distribution within a city, and particularly urban park availability, are likely to be determined by urban planning policies.

From a planetary health perspective, large discrepancies in green space in the region may imply that cities differ in their capacity to adopt greening as a heat-mitigation and health-promotion strategy. This may be particularly challenging in arid cities where tree planting is especially difficult due to unfavorable climatic conditions. On the other hand, some cities are already very green, and thus further expanding green space may not be feasible. One of these high-green cities (Teresopolis, Brazil; Fig. 1) was found to also have a high heat-related mortality risk59. Such cities will need to develop other heat-mitigation strategies apart from green space expansion, for example, by ensuring equitable access to green space and thereby reducing climate vulnerability of underserved populations. This can be especially beneficial given the already unequal distribution of green space in LA cities32.

From a social and public health perspective, the nonlinear associations between some socioeconomic and UGS variables along with the high prevalence of informal settlements in LA cities call to adopt a more nuanced and context-sensitive approach for greening strategies. This is particularly important given some evidence of skewed perception of UGS benefits and UGS disservices. For example, park proximity was linked to higher and lower perceived social disorder in informal and formal areas in LA cities, respectively23. Therefore, in addition to the equal allocation of UGS, it is important to maintain its quality, especially in socioeconomically disadvantaged areas. An interesting direction for future research would be describing patterns of UGS within LA cities, while comparing formal and informal neighborhoods.

Finally, from a research methodology perspective, our study emphasizes the different kinds of information that can be provided by combining a wide variety of UGS metrics. Whereas vegetation-based metrics (for example, greenness, % green space area) offer insight into planetary health benefits (cooling, air quality, noise reduction), urban park metrics may represent opportunities for physical and social activities, which benefit physical and mental health. These metrics are not interchangeable, and future studies should account for these nuances in their choice and definition of green space metrics and ideally use several metrics reflecting multiple aspects of UGS.

Methods

Study sample and city characteristics

This study is part of SALURBAL (Urban Health in Latin America), a multi-disciplinary, multi-country collaborative project aimed at studying the determinants of urban health disparities in 371 large (at least 100,000 residents) cities in 11 Latin American countries representing more than 247 million people60. We used the entire sample of 371 cities to describe green space in this study. Geographically, the study focused on the urban extent, also known as built-up area22 or urban cluster60, within cities’ administrative boundaries (https://drexel-uhc.github.io/salurbal-city-selection-scrolly/). Cities’ urban extent was delimited as follows. First, the SALURBAL team obtained cities’ administrative boundaries. Within those boundaries, the urban extent was mapped using systematically identified built-up areas from satellite imagery60. Specifically, the urban extent (henceforth referred to as ‘cities’) was determined by applying a spatial clustering algorithm to the 12-m resolution Global Urban Footprint61 dataset, which distinguishes between built-up and non-built-up pixels. The urban extent was determined by classifying the Global Urban Footprint pixels into urban, suburban and peri-urban depending on the share of built-up pixels within an area of 1 km2 (ref. 60). Finally, the urban pixels were merged to represent urban extent—area of contiguous urban built-up within a city’s administrative boundary.

The SALURBAL collaboration also compiled extensive data on cities’ natural, built and socioeconomic environment using a combination of satellite images, census and survey data60. This study used city characteristics described in Extended Data Table 1. Supplementary Tables 13 provide summary statistics and other information on cities’ natural, built and socioeconomic characteristics. The unit of data collection for the green space metrics, natural and built environment was cities’ urban extent. As explained in Quistberg et al.60, population for the urban extent was estimated based on the ratio of built area in the urban extent to the total built area in each sub-city (for example, municipality); populations from each built-up sub-city unit were then added up to the urban extent. The three socioeconomic variables used in the study were available at the level of administrative city boundaries.

Greenness

We used three sources of green space data to derive green space measures. The first measure is area-level greenness represented by NDVI, a spectral index describing the amount and vigor of vegetation. NDVI data were obtained from the 250-m, 16-day NDVI product (MOD13Q1.006) of the Moderate Resolution Imaging Spectroradiometer (MODIS) instrument of the Terra satellite62. For every pixel in a city, we first computed the annual maximum NDVI value to capture peak greenness. Then, we took a spatial median of that set (across the city). Water bodies were excluded to avoid downward bias in calculated NDVI values. NDVI captures overall UGS conditions and is often used as a proxy for green space, but it does not distinguish among UGS types such as forests, parks and pastures, for example, nor does it directly measure the spatial configurations of UGS.

Green space quantity and configuration

Another group of metrics describe green space as a land-cover type and provide information on its spatial arrangement in space. These metrics were obtained from a land-cover map of green space in three consecutive steps. First, a green space land-cover map was created based on the 10-m resolution Sentinel-2 satellite images from 2017. More details on the green space map can be found in Ju et al.63 Briefly, green vegetated areas were identified by means of a supervised image classification using a single, median composite of Sentinel-2 Top of Atmosphere (TOA) reflectance images in 2017 as input, with clouds and water removed. The classification was done separately for every city to account for between-city differences in biome and urban form63. Using randomly collected validation samples in 11 randomly selected cities, the resulting green space map achieved an overall accuracy between 0.80 and 0.87 in arid cities (n = 3), between 0.76 and 0.89 in temperate cities (n = 3) and between 0.87 and 0.96 in tropical cities (n = 5)63. The result was a binary map with every 10-m pixel classified as green space or not green space. The urban green space identified by the map included such green land-cover types as green grass, shrubs, forests and farmland63. Second, using the map, green space patches or contiguous areas of greenery, were defined using the Moore neighborhood rule: a green space pixel (10 by 10 m grid cell) and the eight green space pixels around it were aggregated into the same green patch. To reduce computational intensity, the green patch map was then resampled to 30 m to compute the landscape metrics depicting green space spatial configuration. Finally, using the map of green patches, a series of landscape characteristics of green space were calculated, including percent green space in a city (based on total green space area), density of green patches, clumpiness of green patches and the mean nearest neighbor distance between green patches, among others (Extended Data Table 2). These measures are widely used in landscape ecology to describe the quantity, fragmentation, aggregation/clustering and isolation of green space, respectively64. We also computed a measure of per capita green space availability by dividing the total city green space area by city population.

Urban parks

In this study, urban parks were defined as delimited urban open spaces predominantly covered with vegetation. They are accessible and free to the public and are used for recreation, leisure and community gatherings. Urban parks were mapped using a combination of web data mining and remote sensing by Pina, Matos and Skaba (co-authors on this paper), as described in more detail in Supplementary Material 1.

The following criteria had to be met for a location to be identified as an urban park:

  1. 1.

    Be on the list of key terms suggested by local experts (Supplementary Material 1).

  1. 2.

    Among those key terms, a location would have to be categorized as ‘park’ on Google Cloud collaborators database (Google has a category attached to every geotagged item).

This resulted in Google park-categorized locations representing parks, reserves, sports fields, riverside areas (for example, streams and riverbanks), green trails, community gardens and nature preservation areas. The results were cleaned up to remove, for example, locations that had the word ‘park’ in them but were not in fact parks (for example, shopping centers with the word park in their name). Zoos, golf courses and green walls were excluded as they did not fit the definition of urban parks (free/accessible to the public). Cemeteries were excluded because they are primarily places for burial and are not used for recreation.

To ensure that these urban parks were indeed green and in cities, the following two steps were applied:

  1. 3.

    The NDVI data were used to find green areas within a 900-m buffer around the locations in step 2.

  1. 4.

    The street network from OpenStreetMap was used to delimit the boundary around the green areas—the boundary of green parks. If there was no street network around a prospective park location, it was not considered an urban park, as the absence of streets indicates it is outside an urban area.

The final classification represented a map of green urban parks in 371 cities in Latin America and was used as input to compute various metrics describing urban parks. The data were validated using official municipality-level data on parks in 77 cities, showing high agreement between the classification results and the official data. We used the number of parks per capita and park area per capita as the metrics for urban parks.

Statistical analysis

We first present a description of UGS in three example cities with low, moderate and high green space availability (Fig. 1). We then describe UGS quantity, its spatial configuration and urban park measures by categories of cities’ natural, built and socioeconomic environment characteristics. The categories for the cities’ characteristics were defined based on tertiles except for those describing natural environments, which were based on a climate classification, terrain steepness and coastal vs inland location. For the descriptive analysis, we used the ANOVA test of differences between group means. A post-hoc multi-comparison Tukey’s honestly significant difference (HSD) test was performed on the ANOVA to identify which subgroup means showed statistically significant differences65. The P value from Tukey’s HSD accounts for multiple comparisons within the categories of each city indicator and determines whether there is a statistically significant difference in the means of each green space variable across the categories of a city indicator.

To supplement the comparison of the means and assess the relative importance of city variables on UGS, we used random forest (RF) regressions. A random forest regression is a non-parametric method representing an ensemble of regression trees. An advantage of the RF method for analyzing UGS in relation to city-level characteristics is that the RF does not impose a predetermined form (for example, linear) on the relationships of interest. Trees in a RF are constructed by recursively splitting the data at various points based on the values of independent variables, with a random selection of independent variables considered at each split36. The splits aim to reduce the variability of the response variable within each leaf, and the model predicts the average value of a dependent variable within each leaf. The final prediction from a random forest regression is the average prediction over all trees. To run a RF regression, a set of hyperparameters, or parameter settings, needs to be specified. We performed a grid search to identify the optimal number of variables randomly sampled as candidates at each split, as this hyperparameter has been shown to exert the highest influence on predictive accuracy. We present results of the RF regression for the quantity metrics and urban parks and not for the spatial configuration metrics as the RF explained at most 26% of the variance for the configuration metrics.

From RF output we first utilized the measures of variable importance, which describe the magnitude of influence of each independent variable on the outcome, while holding the other variables constant. Second, we complemented the comparison of the means with visualizations from the partial dependence plots. These plots depict associations between each city characteristic and the measure of UGS, accounting for all the other covariates. Each point on a plot represents the average prediction made by the model for the value of a particular independent variable when all the other independent variables are controlled for.

Finally, as a sensitivity check, we reported adjusted multivariable associations from linear regressions between each UGS indicator and a set of city-level natural (climate zone, terrain, coastal location), built (population, population density, intersection density) and socioeconomic (GDP per capita, educational, unemployment) characteristics. We included these covariates after inspecting correlations among them (Supplementary Table 4). We did not include city area as it exhibited moderate correlations with the rest of the city-level predictors (the cut-off correlation coefficient was 0.6). For example, city area was strongly correlated with population (r = 0.96). The covariates were modeled in categories based on natural groupings (such as types of climate zone or coastal/inland location) or tertiles, similar to the descriptive analysis above. We also re-estimated the multivariable models with the same set of covariates as described in the previous paragraph and in addition included a random intercept for each country (cities nested within countries). Finally, we estimated linear regressions for each green space variable in relation to the continuous built and socioeconomic environment variables (as opposed to categorical as presented in the main results) and estimated interactions between climate zone, socioeconomic and built environment variables.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.