Introduction

China has set the goal of reaching peak carbon emissions by 2030 and achieving carbon neutrality by 2060. However, carbon emissions from the transportation sector account for 10% of the national total and are increasing rapidly, making them the third-largest source of greenhouse gas emissions1,2. Compared to the industrial and building sector, the progress in reaching the carbon peak in transportation lags behind, and the challenges of reducing emissions are more pronounced3. High-frequency, regular commuting behavior exacerbates congestion and carbon emissions, making it a critical area for breakthroughs in low-carbon transformation4. The built environment, through its spatial structure, land use layout, and transportation infrastructure, influences residents’ travel modes, distances, and frequency, becoming a key factor driving transportation carbon emissions5. In commuting research, big data can be employed to reveal commuting characteristics such as travel modes and distances6, reflecting commuting flows across cities, streets, communities, and grids7. Thus, analyzing the relationship between commuting carbon emissions and the built environment using multi-source big data is a crucial scientific approach for uncovering the driving factors of commuting emissions and facilitating precise emission reductions.

Early research on commuting carbon emissions was grounded in New Urbanism theory, which highlights population density, mixed land use, and pedestrian-friendly design as strategies to reduce dependence on motor vehicles8,9. The “5D” dimensions have been widely employed to quantify the influence of the built environment on travel patterns. Existing studies have primarily focused on macro-scale indicators such as population density and road networks10,11, with limited attention given to the potential effects of the street-level microenvironment on travel behavior and route choices. Research has largely relied on statistical yearbooks or survey data12, although mobile signaling and GPS trajectory data are increasingly being utilized13. However, integrating multi-source data still faces challenges related to standardization and data interoperability14. Existing models predominantly rely on linear regression or structural equation modeling, assuming spatial homogeneity in the built environment’s impact on carbon emissions, while overlooking the moderating role of local geographic contexts15,16.

Big data technology offers high-resolution analytical tools for urban studies17. Navigation data can accurately extract commuting origin–destination (OD) pairs and route choice preferences18, while street view images, analyzed through deep learning, can automatically extract street-level physical characteristics19. However, scholars have found that data such as street view images, points of interest (POI), and housing prices exhibit heterogeneity, and existing research often relies on single data sources in isolation, lacking cross-modal correlation analysis20. Ensemble learning models, such as XGBoost, are widely used to improve prediction accuracy, but their “black-box” nature can lead to feature importance rankings influenced by multicollinearity or overfitting21. Some researchers have sought to enhance interpretability through SHAP values, but their application in commuting carbon emissions research remains limited22.

Existing case studies primarily focus on megacities such as Beijing and Chengdu, and the applicability of their conclusions to medium-sized cities remains questionable23. Medium-sized cities are in a transitional phase from “single-center expansion” to “multi-center clusters24,” with suburbanization and residence-work separation coexisting25. These cities often implement a “spatial structure optimization-transportation system upgrading” collaborative strategy to avoid high-carbon lock-in effects26. However, there is currently a lack of detailed carbon emission driver maps for these cities, leading to a “one-size-fits-all” approach in policy design27.

The current research faces several limitations regarding data sources, particularly the lack of a comprehensive and multi-dimensional built environment indicator system. Additionally, there are technical challenges exist in data integration and spatial modeling, and empirical studies on low-carbon pathways in medium-sized cities remain limited. Therefore, the focus of this study is as follows: (1) Using multi-source data, such as commuting trajectory big data and semantic segmentation street view images, this research seeks to address the limitations of one-sidedness associated with traditional single-source data. (2) In response to the limitations of existing built environment indicators, the study incorporates socio-economic factors, street view perceptions, and urban land use conditions, providing a more detailed depiction of multi-dimensional built environment indicators. (3) Through stepwise variable selection, the study identifies key indicators influencing commuting carbon emissions at residential and workplace areas, and analyzes the spatial heterogeneity of these factors using geographically weighted regression models. (4) Using geographic detectors and the XGBoost framework, the study explores the interaction effects and importance rankings of these indicators, aiming to offer tailored policies and recommendations for low-carbon development in medium-sized cities.

Data and methods

Research framework

The study begins by extracting commuting characteristics such as commuting modes, commuting time, and commuting distance from Baidu commuting trajectory data. It then calculates the total and per capita commuting carbon emissions at street level. By extracting multiple data indicators, a built environment research framework is established. The study proceeds by summarizing the spatial correlation patterns of commuting data and the spatial distribution of commuting carbon emissions, applying spatial autocorrelation analysis methods to explore the spatial clustering features of commuting carbon emission indicators. Stepwise linear regression is used to select built environment variables, and a geographically weighted regression model is developed to analyze the spatial differential impacts of the built environment. Finally, geographic detectors are applied to investigate the interaction effects of the selected variables on commuting carbon emissions, and XGBoost is used to visualize the importance rankings of influencing factors, providing quantitative evidence for policy formulation (Fig. 1).

Fig. 1
Fig. 1
Full size image

Research framework.

The fishnet approach discretizes continuous space into regular grids, eliminating spatial statistical biases caused by irregular administrative boundaries and accurately capturing spatial differentiation28. Floor area ratio, defined as the ratio of total building area to land area, reflects the intensity of land development. Geometric calculations ensure precision, avoiding indirect estimation errors29. Dijkstra’s algorithm iteratively expands from the origin to the nearest unvisited node, utilizing a priority queue to ensure the optimal path, making it suitable for simulating commuting route choices30. XPath, through path expressions, precisely locates target nodes in HTML/XML documents, providing a structured, automated solution for extracting housing price information31. Gong Peng et al.32 broke through the limitations of single remote sensing classification by leveraging data complementarity, providing reliable basic data for urban land use in construction areas. The fishnet analysis, computational geometry, Dijkstra’s algorithm, and other methods are general paradigms in spatial analysis, requiring only the replacement of basic data to reproduce results. XPath parsing logic adapts to different website structures, and ResNet50 can be replaced with other CNN architectures to accommodate computational resource constraints.

Research area and data

Research area

Jinan, the capital of Shandong Province and a key city in the Yellow River Basin, leverages its position as a transportation hub within the “Beijing-Shanghai Corridor” to shape its commuting patterns, which are characterized by “high mobility and multi-center work-residence separation33.” As of August 2024, the city’s total number of private vehicles has reached 4.04 million, including approximately 3.47 million cars and 223,000 new energy vehicles. The commuting structure reflects a complex coexistence of “high-carbon” and “low-carbon” components34,35. The city’s transportation network is based on a “three-ring, twelve-radial” high-speed road framework, with peak congestion on the main roads exceeding 90%, and a distinct tidal commuting pattern. The density of the rail transit network is relatively low compared to cities of similar size, and bus coverage in peripheral areas remains insufficient36,37,38.

This study focuses on the central urban area of Jinan City (Jinan Urban Master Plan 2016–2035). The total area is approximately 720 square kilometers, with a population of 4.35 million (Fig. 2). The map is based on the standard map (Lu SG(2023)026) downloaded from the Standard Map Service website of the Ministry of Natural Resources of China, without modifications to the base map.

Fig. 2
Fig. 2
Full size image

Study area.

Data

Commuter trajectory data

In this study, users’ commuting patterns were mined based on long-term time-series user behavior data, and path planning algorithms were used to restore possible commuting paths in the context of known places of employment, residence and commuting patterns. Combining the static origin and destination (OD) and trajectory recovery data, the travel segment is identified by spatial density clustering and other methods, the travel chain is extracted, and the commuter distribution and commuter behavior characteristics are obtained by analyzing the segmented travel characteristics. The specific methodological framework is shown in Fig. 3.The study recovered 3.52 million commuting trajectories for 4.92 million people in November 2024. Attributes such as OD trajectory point coordinates, travel time and travel mode contained in the commuting trajectory data can effectively support the calculation of subsequent commuting carbon emissions39. Appendix Fig. 1 and Appendix Fig. 2 illustrate commuting modes and commuting time data within the study area.

Fig. 3
Fig. 3
Full size image

Commuter trajectory extraction diagram.

Street view perception data

To obtain the required data, the Baidu Maps API (https://map.baidu.com/) was used to capture street view images along the road network of Jinan City. The selected points were distributed across different regions of the city, as illustrated in Fig. 4. At each point, four street view images were collected from different angles; however, a small number of locations lacked 360-degree image coverage. In total, 154,020 street view images were collected for the extraction of streetscape elements.

Fig. 4
Fig. 4
Full size image

Spatial distribution of shooting points for requesting street view images.

Figure 5 presents the convolutional architecture of ResNet50, which processes input images through multiple stages of downsampling to generate feature maps at different scales. These feature maps are subsequently integrated to construct a comprehensive representation for segmentation40,41. The downsampling operation in ResNet50 is implemented using convolutional layers with a stride of 2 and 3 × 3 kernel size, ensuring the alignment of scales across different feature maps. Bilinear interpolation is employed for both the downscaling and upscaling of feature map dimensions, followed by a 1 × 1 convolutional layer to fuse the final feature map. The ResNet50 architecture effectively combines feature maps from various scales at each stage of scale transition, thereby preserving and enhancing multiscale image features, ultimately optimizing the retention of detailed image information across multiple resolutions42.

Fig. 5
Fig. 5
Full size image

Simplified ResNet50 architecture with a depiction of building blocks.

Drawing on references43,44, nine street–view element indicators were chosen to depict the built environment (Fig. 6). These indicators can be classified into two types: objective visual perceptions and subjective emotional perceptions.The three visual indicators of openness, greenness and traffic signs are calculated directly from the components of different features. Table 1 presents the calculation formulas and concise descriptions of these visual perception indicators. The other six indicators represent emotional perceptions, such as beauty, boredom, depression, safety, liveliness, and wealth. These are obtained by establishing a rating model based on machine-learning algorithms. The features employed to train this model are also extracted from various ground-object components45. For the development of the rating model, an emotional-perception dataset from46 was employed (Fig. 7).

Fig. 6
Fig. 6
Full size image

Streetscape elements distribution.

Table 1 Formulas and introductions for visual perceptions.
Fig. 7
Fig. 7
Full size image

Examples of semantic segmentation.

Built environment data

This study referred to the “5D” dimension of the built environment47, and added socio-economic attributes48, urban land use level49 and streetscape elements into the data set.

The built environment data set was constructed from seven aspects: density, points of interest richness, design, destination proximity, socioeconomic attributes, urban land use and street view perception. For details, see Table 2. Appendix Fig. 3 illustrates the built environment data within the study area.

Table 2 Built environment data.

The multi-source data used in this study include population data, facility point of interest (POI) data, urban road network data, building profile data, urban housing price data, and urban land use data of Jinan City, as detailed in Table 3.

Table 3 Data source.

Gong Peng et al.32 conducted China’s land use mapping by comprehensively utilizing remote sensing data, night light data, Internet location data, and point of interest data. In this study, the original data are simplified into five categories of first-level land classification, including residential, commercial, industrial, public service facilities, and transportation land (Fig. 8). The Maximum Patch Proportion, Spread Degree, Shannon Diversity, and Degree of Agglomeration were employed to calculate the land use characteristics in the study area. As presented in Table 3, the maximum patch proportion represents the percentage of the largest single patch in the total area of the study area. The spread degree indicates the spatial diffusion or aggregation of land use patch types, while Shannon diversity measures the diversity and evenness of patch types. The degree of clustering quantifies the spatial clustering of similar patch types.

Fig. 8
Fig. 8
Full size image

Urban land use distribution map.

Research methods

Calculation method of commuting carbon emission

To accurately estimate commuting carbon emissions in the main urban area of Jinan, this study employs a “bottom-up” approach. By multiplying the carbon emission factors of each commuting mode by the corresponding commuting distance, individual daily commuting carbon emissions are calculated. Based on commuting trajectory OD data, the total commuting carbon emissions at the street level are calculated using residential locations (O) and workplaces (D) as spatial units. This approach provides a clearer understanding of the differential impacts of the built environment at residential and workplace locations on commuting carbon emissions, facilitating the precise identification of emission sources50. Commuter travel emissions are calculated based on commuting mode and distance, as expressed in the specific formula provided in Eq. (1). The calculation for daily commuter carbon emissions is as follows:

$$T_{{\text{n}}} = \sum\nolimits_{{{\text{i}} = 1}}^{{\text{n}}} {D_{{\text{i}}} } \times {\text{K}}_{{\text{i}}}$$
(1)

where: Tn is the total carbon emission of a commuter in a single day (g); i is a certain mode of transportation; Di is the total distance (km) traveled by the mode of transportation i; Ki is the (g/ person ·km). The transportation modes in this study are divided into three categories: driving, public transportation and green commuting. The traffic travel distance is calculated by track OD points and path data in path planning.

In this study, the carbon emission factor values of different transportation modes were determined according to the Set of Greenhouse Gas Emission Coefficients of Chinese Products throughout their life Cycle (2022) released by the Ministry of Ecology and Environment in 2022, and the relevant data of Shandong Province were verified. The carbon emission factors of different commuting modes used in this study are shown in Table 4.

Table 4 Carbon emission factor coefficients of different traffic.

In parallel, considering the rapid adoption of new energy vehicles, the study obtained ownership data for both traditional fuel vehicles and new energy vehicles by referring to the Statistical Bulletin of National Economic and Social Development of Jinan City. The study further estimated the level of carbon emissions generated by private cars during commuting by incorporating the proportion of vehicle types.

As of November 2024, Jinan has opened and operated three subway lines, covering a total distance of approximately 96.335 km with 46 stations. Given the relatively small number of subway lines and the short operation period of the subway system, fluctuations in subway operational efficiency has not had a significant impact on the overall transportation system. As the study aims to analyze the characteristics of residents’ commuting carbon emissions, factors such as ride-sharing car commuting, temporal variations in traffic carbon emissions, vehicle type differences, and indirect carbon emissions are not considered in the calculation process.

Spearman correlation test

To verify the accuracy of commuter carbon emissions calculated using the top-down method, 395,400 signaling data points provided by mobile phone operators (MPSD) in November 2024 were used as an additional data source to capture travel patterns in the main urban area of Jinan City. Spearman correlation analysis was employed to assess the relationship between the top-down estimates values and those derived from signaling data. Spearman correlation analysis is widely used to measure the monotonic association between variables, with a higher correlation coefficient indicating a stronger correlation51. It is also suitable for analyzing variables that are not normally distributed. As shown in Table 5, the correlation coefficients of commuting data obtained from different data sources within the same category exceeds 0.8, demonstrating the effectiveness of the commuting carbon emissions calculation method.

Table 5 Commuter carbon emissions testing.

Spatial autocorrelation analysis

In this study, ArcGIS was used to correlate OD points with street attributes to calculate the street-scale commuting carbon emission index. GeoDa 1.18 was used to analyze the Moran index and hotspots of commuter carbon emissions and travel modes in the main urban area of Jinan City, obtaining the spatial distribution characteristics of commuter carbon emissions and travel modes52. The calculation formula is as follows53:

$$Moran's{\text{ }}I = \frac{{\sum\nolimits_{{i = 1}}^{n} {\sum\nolimits_{{j = 1}}^{n} {W_{{ij}} \left( {X_{i} - \bar{X}} \right)\left( {X_{j} - \bar{X}} \right)} } }}{{S^{2} \sum\nolimits_{{i = 1}}^{n} {\sum\nolimits_{{j = 1}}^{n} {W_{{ij}} } } }}$$
(2)
$$G{\text{i}}^{*} = \frac{{\sum\nolimits_{{{\text{j}} = 1}}^{N} {Wij \bullet X_{j} - \overline{X}\sum\nolimits_{j = 1}^{N} {Wi_{j} } } }}{{\sqrt {\frac{{\sum\limits_{j = 1}^{N} {X_{j}^{2} } }}{N} - \left( {\overline{X}} \right)^{2} } \bullet \sqrt {\frac{{N\sum\limits_{j = 1}^{N} {W_{ij}^{2} - \left( {\sum\limits_{j = 1}^{N} {W_{ij} } } \right)} }}{N - 1}} }},\forall j \ne i$$
(3)

In the Eqs. (2) and (3), N is the number of streets. Xi, Xj are the carbon emissions of the street. X is the average carbon emission of X, and Wij represents the spatial weight matrix between street i and street j.

The value range of Moran’s I is [− 1, 1]. Moran’s I > 0 indicates spatial positive correlation, where the research objects exhibit spatial clustering. When Moran’s I < 0, it indicates spatial negative correlation, and the object of study is spatially discrete. When Moran’s I = 0, the study objects are randomly distributed.

Stepwise linear regression

In order to further analyze the relationship between urban built environment factors and commuting carbon emissions, this study adopted the stepwise regression method to screen indicators, and established a linear regression model using the screened indicators. The calculation formula is as follows54:

$$Y = \beta_{0} + \beta_{1} {\text{x}}_{1} + \beta_{2} {\text{x}}_{{2}} + \ldots \beta_{{\text{i}}} {\text{x}}_{{\text{i}}} + \varepsilon$$
(4)

β0 is a constant coefficient; β(1,2,… i) is the independent variable coefficient; ε is the residual; Suppose that H0: β(1,2,… i) = 0, and the collected samples are used to test the significance of F statistic (or t statistic) for H0.

$$F = \frac{{\sum\limits_{{{\text{i}} = 1}}^{{\text{n}}} {{{\left( {\hat{y}_{i} - \overline{y}} \right)^{2} } \mathord{\left/ {\vphantom {{\left( {\hat{y}_{i} - \overline{y}} \right)^{2} } m}} \right. \kern-0pt} m}} }}{{\sum\limits_{i = 1}^{n} {{{\left( {{\hat{\text{y}}}_{i} - \overline{y}} \right)^{2} } \mathord{\left/ {\vphantom {{\left( {{\hat{\text{y}}}_{i} - \overline{y}} \right)^{2} } {\left( {n - m - 1} \right)}}} \right. \kern-0pt} {\left( {n - m - 1} \right)}}} }}$$
(5)

For a given confidence α (usually 95%), the statistic F should be:

$${\text{p}}\left( {F \le F_{\alpha } \left( {{\text{m,n}} - {\text{m}} - 1} \right)} \right) \le 1 - \alpha$$
(6)

If formula (6) is not valid, then the hypothesis is considered to be invalid and the regression equation is considered significant.

Geographically weighted regression

This study used geographically weighted regression (GWR) to reveal the spatial non-stationarity and spatial heterogeneity of influencing factors for commuter carbon emissions55. The model was as follows:

$${\text{y}}_{{\text{i}}} = \beta_{0} \left( {{\text{u}}_{{\text{i}}} ,v_{i} } \right) + \sum\nolimits_{i = 1}^{k} {\beta_{i} } \left( {u_{i} ,v_{i} } \right)x_{ik} + \varepsilon_{i}$$
(7)

where: yi is the dependent variable, (ui,vi) is the geographic coordinates of the i-th sample space unit, the parameter βi varies with location, xik is the independent variable, and εi is the random error term. In this study, the Gaussian function method was selected and Akaike information criterion (AICc) was used as the selection criterion for the model. Model diagnostic tests is used to evaluate the prediction accuracy and stability of the model, and its calculation formula is as follows:

$$\omega_{{{\text{ij}}}} = {\text{exp}}\left[ { - {{{\text{(d}}_{{{\text{ij}}}} } \mathord{\left/ {\vphantom {{{\text{(d}}_{{{\text{ij}}}} } {\text{b}}}} \right. \kern-0pt} {\text{b}}}{)}^{{2}} } \right]$$
(8)

where, ωij is the weight of the variable of sample space unit j relative to street i, di is the distance between two points, and bandwidth b is a non-negative attenuation parameter describing the functional relationship between weight and distance.

Geographic detector model

The geographic detector model (GDM) provides a convenient way to examine the mechanisms of commuting carbon emissions. The basic idea of GDM is that when one or more variables have a significant impact on commuting carbon emissions, the independent variables should have a spatial pattern similar to that of commuting carbon emissions. The advantage of GDM is that it can assess the interactive effects of two factors on commuting carbon emissions56. The calculation formula is as follows:

$${\text{q}}_{x,y} = 1 - \frac{{\sum\limits_{i = 1}^{m} {N_{X,i} \sigma_{{Y_{X,i} }}^{2} } }}{{N\sigma_{Y}^{2} }}\left( {i = 1,2, \ldots m} \right)$$
(9)

where: i represents the classification of factor X, Nx,i and N denote the number of units in layer i and the total number of units in the entire area, respectively.σ2YX,i and σY2 correspond to the discrete variance of Y values within layer i and across the whole region, respectively. qx,y represents the explanatory power of factor X on variable Y, where the value of q ranges from [0,1]. A value of q closer to 1 indicates a stronger explanatory power of factor X on variable Y.

XGBOOST algorithm

To enhance the interpretability of the selected variables, the XGBoost framework was used to analyze the importance of the influencing factors. This gradient-boosted decision tree ensemble offers improved computational efficiency and predictive performance compared to traditional gradient boosting methods57.

$$O{\text{bj}} = \sum\limits_{{{\text{i}} = 1}}^{{\text{n}}} {L\left( {y_{i} ,\hat{y}_{i} } \right)} + \sum\limits_{k = 1}^{K} {\Omega \left( {f_{k} } \right)}$$
(10)

where, L is the loss function, which is used to measure the error between the predicted value \(\hat{y}_{i}\) and the true value yi, and \(\Omega\) is the regularization term, which is used to control the complexity of the model and avoid overfitting.

Results

Analysis of commuting carbon emission characteristics

Analysis of commuter connection characteristics

From the spatial distribution characteristics of commuter connections (Fig. 9), the commuter flow of each street presents a single-center structure with “commuter center” as the core, and also shows a tendency to develop into a multi-center structure. As shown in the figure, the spatial visualization of commuting flow between streets with more than 500 commuters reveals the presence of multiple high-value flow areas across streets in the main urban area, which are densely distributed and have frequent commuting between workplaces and residences. The network density is high, making these areas significant employment agglomeration centers.The eastern high-tech Development Zone, the Lingang Economic Development Zone, the new materials Industry Park and other areas also maintain a high commuting links with the main city and its surrounding streets, forming a secondary employment centers.

Fig. 9
Fig. 9
Full size image

Street to street commuter connections. The figure (N1-N50) indicates that the total number of street commutes ranks between 1 and 50.The map shows the top 10 streets for commutes.

The proportion of car commuting in the central urban area is low, such as in Beitan Street, Guanzhaying Street and Dongguan Street, while the proportion of car commuting in the peripheral areas is high, such as Daqiao Town and Cuizhai Town, and the proportion of car commuting shows a gradual increasing trend from the center to the periphery.The proportion of bus commuting is higher in the northeast than in the southwest, and the proportion of buses at the end streets of Tangye, Baiyun Lake and Huanghe Town is higher. Overall, the proportion of green commuting is higher in the southwest than in the northeast. The main reason is that the southwest is limited by the mountainous landscapes and land use patterns, resulting in lower traffic network density and bus accessibility compared to the northeast, which contributes to a higher proportion of green commuting (Fig. 10).

Fig. 10
Fig. 10
Full size image

Commuting mode distribution.

Spatial distribution characteristics of commuting carbon emissions

Based on the total carbon emissions of residential streets (Fig. 11a), residential streets in the main urban area of Jinan City exhibit unbalanced spatial distribution of carbon emissions. The densely populated areas near the three-ring road, such as Beiyuan Street, Yaoshan Street and Licheng Economic Development Zone, have higher carbon emissions.The residential and commuting carbon emissions of central urban areas and peripheral streets are relatively low.Streets dominated by employment activities have higher carbon emissions (Fig. 11b), with typical examples including Duandian Town, Ping An Street, and Jibei Economic Development Zone. In contrast, streets with low carbon emissions are primarily concentrated in central districts and peripheral areas with lower employment intensity.

Fig. 11
Fig. 11
Full size image

Carbon emission distribution.

The per capita carbon emission of residential areas show a pattern of lower levels in central urban areas and higher levels in peripheral areas (Fig. 11c).Streets with relatively high per—capita carbon emissions are mainly located in Wufengshan Street in Changqing District, Zhangqiu Economic Development Zone, and new towns in Jiyang County, all of which are part of new town clusters. Areas with relatively high per—capita carbon emissions are concentrated in the northern sector of the city (Fig. 11d), such as Daqiao Town, Huihe Town in Tianqiao District and Shanghe Economic Development Zone, which mainly focus on the processing industry.

Spatial autocorrelation analysis of commuting carbon emissions

Spatial autocorrelation analysis acts as a bridge between the description of phenomena and the explanation of underlying mechanisms, enhancing the understanding of the spatial distribution of commuting carbon emissions. The clustering map of residential commuting carbon emissions reveals that the inner ring areas of the central zone form a “low-low” (LL) cluster, while streets adjacent to the core area form a “high-low” (HL) cluster. Areas such as Wang She Ren-Tang Ye Street exhibit a “high-high” (HH) clustering pattern (Fig. 12a,b). The central urban area is characterized by relatively short commuting distances and higher public transportation usage. High-value clustering areas are residential areas located closer to the city center, with higher population density and a higher number of long-distance commuters.

Fig. 12
Fig. 12
Full size image

Spatial autocorrelation analysis of commuting carbon emissions.

In the clustering map of workplace commuting carbon emissions, a large “low-low” (LL) cluster is observed within the third ring road of the central urban area, while the “high-high” (HH) clusters are predominantly concentrated in the eastern and northern new town areas (Fig. 12c,d). In Jinan’s central urban area, a significant proportion of commuters live near their workplaces, a better work-residence balance, and as a result, the total carbon emissions from workplaces are relatively low. Streets such as Shunhua Road and Tangye Street, dominated by high-tech industries, attract a large number of workers and have a considerable impact on surrounding areas, such as Gangou Street and Caishi Town, leading to a high density of long-distance commuters. Consequently, these areas exhibit high workplace commuting carbon emissions.

Stepwise linear regression analysis of commuting carbon emissions

Gradual linear regression of residential carbon emissions

According to the coefficients from the stepwise regression analysis (Table 6), the regression coefficients for road network density, the number of subway stations, the proportion of land used for transportation facilities, and the number of commercial and residential units are all positive. This suggests that an increase in road network density, the proportion of land allocated to transportation facilities, the number of subway stations, and the number of commercial and residential units to an increase in residential carbon emissions. The coefficients for industrial land use and tourist attractions are negative. Areas with a higher proportion of industrial land typically exhibit a concentration of employment, leading to shorter work-residence distances. Tourist demand often stimulates the development of public transportation or non-motorized transport systems, thereby reducing carbon emissions.

Table 6 Gradual linear regression of total carbon emissions from residential areas.

The regression results indicate that certain perceived elements extracted from streetscapes significantly influence per capita commuting carbon emissions (Table 7). Greenness and Safety demonstrate a significant negative effect on per capita commuting carbon emissions at residential locations. Residential areas with higher greenness generally offer better walking and cycling environments, as well as improved accessibility to public transportation, thereby encourages residents to choose low-carbon travel modes. Streets with higher safety are often equipped with adequate lighting systems, surveillance facilities, and well-designed vehicle–pedestrian separation, all of which promote the use of non-motorized transportation modes.

Table 7 Gradual linear regression of carbon emissions per inhabitant.

Gradual linear regression of workplace carbon emissions

According to the partial regression coefficient Beta in the model results (Table 8), the regression coefficients of the number of companies, the length of the road network, the average price of housing, the area of residential land, the number of subway stations and the area of public service facilities are all positive. This indicates that larger areas of residential, commercial, and industrial land and a greater number of transportation stations are associated with higher total workplace carbon emissions.

Table 8 Gradual linear regression of total workplace carbon emissions.

Based on the standardized coefficients from the regression results, the factors influencing per capita workplace carbon emissions are ranked as follows: Floor Area Ratio > Accessibility > Road Network Density > Number of Tourist Attractions > Openness > Depression (Table 9). Among these, the Floor Area Ratio has the greatest impact, with a more pronounced effect than Accessibility and Road Network Density, while the influence of Depression is the weakest.

Table 9 Gradual linear regression of carbon emissions per capita at work site.

Geographical weighted regression analysis of commuting carbon emissions

Geographical weighted regression of residential carbon emissions

The regression coefficients for the area of land used for transportation facilities and tourist attractions exhibited a gradient increase from the southern to the northern region, with the southern part of the main urban area was mainly negative, while the northern part was mainly positive. The regression coefficients of Longdong-Ganggou and Lingang Economic Development Zone are the largest, indicating that transportation facilities significantly promote residential population agglomeration in these areas (Fig. 13b). Phuket—Guanzhuang, Wenzu town and other areas are close to Zhangqiu Economic Development Zone, have tourism resource endowments comparable to. Residents are more likely to choose to visit nearby, so the regression coefficient is relatively low or negative (Fig. 13a).

Fig. 13
Fig. 13
Full size image

Regression coefficient of GWR model of total carbon emissions from residential areas.

For residential areas, Cao Fan Town is situated in an urban–rural fringe area with poor security and inadequate lighting conditions. In industrial areas such as Sun Geng Street, high pollution risks and low perceived safety lead to increased private car use. In contrast, newly developed communities in areas like Zhi Yuan-Yaojia Street, with well-planned infrastructure, have lower safety risks, making residents more inclined to choose non-motorized or green transportation modes (Fig. 14a). Additionally, the regression coefficient for Greenness reveals three distinct distribution patterns: the core urban area (high Greenness + low carbon emissions), the suburban area (medium Greenness + medium carbon emissions), and the outer suburbs (low Greenness + high carbon emissions) (Fig. 14b). In the core urban area, carbon emission reduction is achieved through “short-distance commuting + non-motorized transport dominance”, which drives the detailed layout of green spaces. In suburban new towns, the function of green spaces is mixed and insufficient, requiring reliance on subways and buses for emission reduction. In the outer suburbs, fragmented green spaces lead to higher dependence on private cars.

Fig. 14
Fig. 14
Full size image

Regression coefficient of GWR model of carbon emissions per inhabitant.

Geographical weighted regression of workplace carbon emissions

The regression coefficient of the southern mountainous area and the eastern new town group is lower than that of other regions (Fig. 15). Tourism is the main industries to be developed in the southern mountain area, high-tech development zones are located in the eastern part of the city, and the new and old energy conversion Zone in the north is China’s second state-level new area after Xiongan New Area, aiming to promote ecological protection and high-quality development of the Yellow River Basin49. By appropriately enriching the types of companies and developing low-carbon and environmental protection industries according to local conditions, the total carbon emissions of the workplace can be reduced.

Fig. 15
Fig. 15
Full size image

Regression coefficient of GWR model of total carbon emissions from work site.

For workplaces, Yaojia Street, Shunyu Road, and Wang She Ren Street are characterized by high-density communities with moderate to low openness. These streets, however, exhibit strong functional diversity, with short-distance commuting being the predominant mode. In contrast, Daqiao Town and Xiying Town, through the development of new ecological areas, maintain a high level of openness. However, these regions are located in the outer suburbs of the city, and long-distance commuting increases carbon emissions (Fig. 16a). Meanwhile, CuiZhai Town, Lingang Street, and other monofunctional singular “dormitory towns”, while creating a sense of residential oppression for residents, also promote the use of private cars (Fig. 16b).

Fig. 16
Fig. 16
Full size image

GWR model regression coefficient of per capita carbon emissions at work site.

Importance analysis using XGBoost

To clarify the effects of the built environment, the XGBoost machine learning method was used to rank the importance of significant independent variables across four models for a comprehensive analysis (Fig. 17). The number of commercial and residential units plays a dominant role in the scale of carbon emissions, with its spatial clustering significantly increasing the separation between work and residence58. High-density development reduces pedestrian spaces, leading to greater dependence on private cars59. Green vision rate regulates travel patterns through psychological perception60, and its mechanism of action forms a synergistic effect with security perception61 and road network density. The dominance of the number of companies and enterprises stems from the continuous and rigid commuting demand62, while the service radius of commercial and medical POIs has localized characteristics63.Floor area ratio affects individual choices through dual pathways. Although a high floor area ratio enhances the balance between work and residence64, the sense of oppression leads to an increase in motor vehicle preference65.

Fig. 17
Fig. 17
Full size image

Ranking of Importance Based on XGBoost.

Two-factor interaction analysis

In this study, the independent variables selected by multiple linear regression were analyzed by using geographic detector, and bivariate interaction analysis was conducted on total commuting carbon emissions and per capita commuting carbon emissions respectively. Total commuting carbon emissions are primarily influenced by the interaction between the number of commercial and residential units and the proportion of land allocated to transportation infrastructure (Fig. 18a). Commercial residential land provides housing for individuals engaged in commercial activities, while transportation infrastructure land offers the essential support for these activities66.Among the interactive variables affecting per capita commuting carbon emissions, road network density plays the most prominent role (Fig. 18b). High road network density facilitates more convenient transfers between attractions, reducing travel time and enhancing tourism efficiency67. Additionally, increased road network density improves access to a broader range of tourist destinations, allowing visitors to experience more diverse attractions68.

Fig. 18
Fig. 18
Full size image

Interaction analysis of commuting carbon emissions. The picture is self-drawn by the author.

Discussion

The hierarchical structure of the number of commuters and commuting distance

The commuting population and commuting distances in the central urban area of Jinan exhibit distinct spatial patterns. The commuting distances in both the central urban area and the suburbs are relatively short, and these regions are designated as the “commuting cold zone” in this study. Areas with larger commuting populations and longer commuting distances are primarily concentrated between the second and third rings of the central urban area (Fig. 19a), which are referred to as the “commuting hot zone”. As land resources in the central urban area become increasingly scarce, residential areas have gradually expanded toward the city’s suburbs69. This outward expansion has increased the proportion of residential land within the “commuting hot zone”, thereby raising commuting demand. The residential hotspots areas are primarily inhabited by middle- and low-income groups, who frequently commute across regions to employment centers in the city center, while another group commutes to factories or farmlands in the suburbs, requiring longer commuting distances (Fig. 19b). The commuting cold zones in the suburbs are characterized by short commuting times, reflecting the presence of specialized industries such as agriculture, forestry, animal husbandry, and fishery activities, which are spatially dispersed across the suburbs70 (Fig. 19c). Industries requiring substantial labor and capital investment, such as manufacturing and construction, are typically concentrated in specific industrial parks.

Fig. 19
Fig. 19
Full size image

Commuter cold zone and commuter hot zone.

Differences in the spatial impact of street perception characteristics on total and per capita commuting carbon emissions

The results of spatial analysis indicate that the influence of street perception characteristics on per capita commuting carbon emissions is significantly stronger than their impact on total commuting carbon emissions. Per capita carbon emissions reflect individuals’ commuting behavior patterns and are directly influenced by their travel choices71. Street perception characteristics can significantly influence individuals’ travel modes by shaping residents’ subjective experiences and decisions, thereby reducing per capita carbon emissions. Total carbon emissions are comprehensively influenced by the size of the regional population, the scale of economic activities, and commuting demands72. Even if per capita emissions decline, when the population grows or commuting demand increases, total emissions may rise rather than fall. The influence of street perception characteristics on total emissions is indirectly reflected through “individual behavior change × population size”, and its effect is easily diluted by macro trends.

Emission reduction efficiency of tourist attractions in the core area and the low-carbon renewal model

We observed that the negative impact of tourist attractions on carbon emissions in Jinan’s core area is greater than that of transportation facilities. Tourist attractions in the core area serve not only as visiting nodes but also restructure urban functions through a mixed development model of “business, tourism, culture, and residence73”. Scenic spots have created a large number of service industry jobs (such as tour guides, catering, and retail), among which the night economy has further promoted the adoption of flexible working systems, increased the proportion of off-peak commuting, and reduced idle carbon emissions caused by peak-hour congestion73. The practice in Jinan’s core area demonstrates that in high-density built-up areas, catalytic mixed-use development of scenic spots achieves far greater emission reduction efficiency than upgrading traditional transportation facilities. This provides a low-carbon renewal model for similar historical cities (e.g., Xi’an and Luoyang): rather than infinitely expanding transportation supply, it is better to activate the intrinsic carbon reduction potential of existing resources through “spatial acupuncture”.

The differentiated impact of openness and spatial depression on suburban street commuting carbon emissions

The opposing effects of Openness and Spatial Depression on commuting carbon emissions in suburban areas stem from the complex interaction between urban form, behavioral psychology, and transportation patterns. Wide streets with continuous green belts can reduce psychological resistance to walking or cycling, increasing the share of slow commuting74. In contrast, depressed streets (characterized by confined spaces) often coincide with inefficient road networks, posing detour risks during commuting. Frequent roadside parking further encroaches on slow-moving spaces. Such confined environments often trigger elevated cortisol levels, making residents more inclined to drive to avoid environmental stress. For future suburban urban renewal, environmental psychology indicators (e.g., sky visibility and green view index) should be integrated into the statutory planning system. Integrating digital twin technology to simulate the emission reduction effects of visual interventions can achieve coordinated progress in “spatial healing” and “carbon emission reduction”.

Conclusion

This study focuses on the central urban area of Jinan, with the primary aim of analyzing the relationship between commuting carbon emissions and the built environment. It utilizes multiple data sources, including commuting trajectories and street view images, to collect data on commuting carbon emissions and the built environment. Using a methodological framework that includes linear selection, spatial autocorrelation and geographically weighted regression, XGBoost importance ranking, and geographic detector interaction analysis, this research investigates the impact of the built environment on commuting carbon emissions. The goal is to provide theoretical insights for urban planning and development in medium-sized cities. The research findings are as follows:

  1. (1)

    Based on the current situation of commuter connections, the per capita commuting distance in the main urban area of Jinan City is 6 km, and private vehicles are the main commuting mode. The proportion of commuting by bus, bike and car is 22%, 38% and 40% respectively. The spatial distribution of the driving proportion and per capita commuting distance exhibits the characteristics of “low in the center and high in the periphery”.

  2. (2)

    From the perspective of spatial characteristics, streets with higher total carbon emissions in residential areas are primarily concentrated in the residential clusters near the Third Ring Road. Meanwhile, streets with high total carbon emissions in the workplace are mostly employment oriented streets such as Yanshan Street, Jiyang Street, and Yanwen Community. In residential areas, streets with higher per capita carbon emissions are located in the Changling Mountain area, Shengfu Zhuang area, and Lianhua Mountain area. The low-value cluster area is primarily situated within the central urban zone, whereas the high-value cluster area displays a markedly different distribution pattern.

  3. (3)

    The regression results reveal significant spatial heterogeneity in the driving factors of commuting carbon emissions, with the impact of transportation infrastructure land on total carbon emissions exhibiting a north–south gradient. In residential areas, perceived safety and greenness levels show a spatial coupling effect. The greenness coefficient exhibits hierarchical differentiation among the central urban area, the near suburbs, and the far suburbs. Workplace carbon emissions are primarily influenced by the spatial functional composition, with high-density mixed-use communities achieving low-carbon emissions through short-distance commuting. The openness coefficient exhibits contrasts in areas characterized by a single function. Carbon emission intensity in the southern mountainous, tourist-oriented areas is lower than in the industrially dominated eastern new town, confirming the regulatory effect of industrial type on the carbon emission baseline.

  4. (4)

    Regarding the order of importance of factors and their interactive effects, the number of commercial and residential units and enterprise agglomeration are the dominant factors for total carbon emissions at residential and workplace locations, respectively. Per capita emissions are more influenced by individual behavior and environmental interactions. Furthermore, commercial and residential areas and transportation infrastructure land exhibit a complementary effect, while the strong interaction between road network density and tourist attractions highlights the capacity of high-density road networks to enhance tourism efficiency.

The study suggests that policymakers should implement differentiated spatial governance based on the north–south (east–west) gradient effects of the city, promoting “TOD + work-residence balance” development. Green space optimization should be prioritized, with the core urban area advancing the “500-m green view” project and the far suburbs developing ecological corridors to reduce green space fragmentation. A “carbon emission—spatial perception” monitoring platform should be established, integrating mobile signaling and street view data to track latent perceptual factors in real time.

There are some limitations to the data in this study. On the one hand, the study calculated the carbon emission index based on big data of commuting trajectory, but commuting modes were only divided into driving, public transportation and cycling, which were not detailed enough compared with the questionnaire survey. On the other hand, using streets as the research unit, this study may encounter the modifiable areal unit problem (MAUP), which can have uncertain effects on the results of spatial data analysis and thus affect the accuracy of the correlation analysis between built environment factors and commuting carbon emissions to a certain extent. Therefore, the next step involves establishing a grid using geographic information technology to more accurately measure the built environment of residential and workplace locations. A multi-scale analysis framework, such as “city-district-street-living circle-grid”, can be constructed, with appropriate geographical units selected to define scales and partitions, thereby enriching the conclusions and enhancing the accuracy of the research.