Abstract
Cities encompass a mixture of artificial, human, animal, and nature-based sounds, which through long and short-term exposures, can impact on physical and mental health. Yet, most epidemiological research has focused on only transportation noise, leaving a significant gap in understanding the health impacts of other urban sound types, especially in sub-Saharan Africa (SSA). We conducted a large-scale measurement campaign in Accra, Ghana, collecting audio recordings and sound levels from 129 locations between April 2019-June 2020. We classified sound types with a neural network model and then used Random Forest land use regression to predict prevalences of different sound types citywide. We then developed a composite metric integrating sound levels with the prevalence of sound types. Road traffic sounds dominated the urban core, while human and animal sounds were prominent in high-density and peri-urban areas, respectively. Our high-resolution approach provides a comprehensive characterization of the complexity of urban sounds in a major SSA city, paving the way for new epidemiological studies on the health impacts of exposure to diverse sound sources in the future.
Similar content being viewed by others
Introduction
Cities have complex mixtures of mechanical, human, and nature-based sounds varying in space and time1 that can impact physical and mental health in different ways2,3,4,5. In many parts of the world, research, planning, and policy efforts have focussed on characterizing, regulating, and mitigating transportation noise, such as that from road, rail, and aircraft traffic6,7,8with the overwhelming majority of epidemiological evidence dedicated to these sources6,9,10,11,12. However, the reality is that in most cities, transport-noise coexists with sounds from other activities, such as construction and industrial works, neighbour noise, restaurants, bars, and night-life, shopping and commerce, and in some places, even religious activities13,14,15,16,17. While these other sources of environmental noise may also influence physical and mental health, they have not been well characterized in most cities. This is certainly the case in rapidly urbanizing cities around the world, such as in sub-Saharan Africa (SSA)14,18,19,20,21,22,23,24.
Nature-based sounds are generally perceived in city spaces as providing a calming and relaxing environment13,25,26. This is further supported by a limited number of studies that have demonstrated in controlled environments (i.e., virtual reality) that physiological stress recovery can occur after short-term exposure to nature-based animal (e.g., birdsong) or geophysical (e.g., water features) sounds3,27,28,29,30,31. Other types of sounds in cities can also evoke positive psychological responses, such as religious or culturally significant sounds (e.g., church bells)13 which have increasingly led to the inclusion of positive soundscape planning into urban design and policies32,33,34,35.
Exceeding global trends, cities in SSA are undergoing significant expansion and economic transformations, resulting in changes in urban land use (e.g., built-up versus natural areas)36,37,38,39,40 and transportation infrastructure41,42,43which will have an impact on the sound environment32. Fast growing SSA cities are now characterised by glaring urban transport problems, including traffic congestion, long commute times, and traffic related noise pollution41,42,44,45. Compared to European and North American cities46,47,48there is very limited research on the complexity of the sound environment in SSA cities, including on the health effects of exposure to environmental noise sources45,49. This poses a barrier to local urban planning efforts and noise mitigation efforts32but also for conducting epidemiological studies investigating the adverse, and even potentially beneficial, health effects of exposure to different sound environments, at city or population scale, in SSA.
Leveraging advancements in sensor networks and Machine Learning (ML) models50,51 we conducted a study that combined city-wide ML classified audio recordings and sound levels with spatial prediction modelling approaches (i.e., Random Forest Land Use Regression) to characterise and spatially predict the diverse sound environment in a major sub-Saharan African city over a period of approximately one year. We also developed a composite index which reflects both the loudness of sound in an area and the frequency (i.e., prevalence) of different sound types. The data can aid local urban sound planning and design efforts in Accra and support future planned epidemiological studies investigating the direct and interactive effects of exposure to different urban sounds. Lastly, our approach to characterising the complexity of the urban sound environment could be translatable to other SSA cities in the future to fill this major data gap and support urban sound policy design and planning in cities across the continent.
Results
Detected sounds from audio recordings
Valid audio recording data was obtained from 1,090 site-days spanning 129 measurement sites (Fig. 1) across the Greater Accra Metropolitan Area (GAMA, a 1500 km2 metropolis with ~ 4 million residents) which were passed through a pre-trained neural network to classify different types of city sounds (see Methods section for details and model accuracy metrics). At the measurement locations, road-transport sounds were the most prevalent in both the day and night-time (median percentage of time present in the day (78%) and night (58%), while animal and insect sounds were also common, particularly at night (median: 55% of time present). The median percentage of time that music sounds were present was between 5 and 6% and < 1% for geophysical nature sounds (see Supplementary Information (SI)).
Locations of fixed and rotating measurements sites with environmental audio data in the Greater Accra Metropolitan Area, Ghana. Road network is from OpenStreetMap in 2019 52 and the Greater Accra Metropolitan Area (GAMA) and Accra Metropolis boundaries are from Ghana Statistical Service (census year 2010)53. The inset shows background maps of Africa and Ghana (ESRI (Environmental Systems Research Institute)). A description of the land use categories used to sample the locations of the sites is in the Methods.
Spatial model predictive accuracy and variable importance
The amount of vegetation (Normalised Difference Vegetation Index (NDVI)) in an area was an important predictor of sound type prevalence in almost all Random Forest Land Use Regression (LUR) models. In addition, predictor variables representing road-traffic (e.g., road-networks) were important in both the road-transport and animal and insect sound models. Some of the most important variables in the music, speech, and nature sound models were related to population and building density and land use (more details in the SI).
The median absolute errors (MAEs) of the road-transport day and night models ranged from 10 to 16%, and the correlation of predicted and observed values ranged from 0.68 to 0.69, when holding out 10% of random measurement site data 10-times from the model building process (CV10%sites). The accuracy metrics for the animal models were similar (see SI). The MAEs of the speech models ranged from 4 to 13% with CV10%sites, and the correlation of predicted and observed values ranged from 0.53 to 0.60. The music LUR models were more accurate at predicting low levels of music sound prevalence than they were at predicting higher levels of music sound prevalence since there were limited data points at the higher ends of the distribution to train the model. This is reflected in the fact that the MAE was relatively low, being driven down by the majority of observations at the lower end of the distribution that were predicted correctly. The correlation between the observed and predicted values for music sounds was moderate-low (CV10%sites: 0.38–0.42). The geophysical nature sound LUR models could not generalize and predict accurately to new locations when the values were greater than 0% prevalence, as the distribution of nature sounds at our sites were highly skewed towards zero (see model performance statistics in SI). Thus, we do not present geophysical nature sound predictions further in this paper.
Spatiotemporal variation in predicted sounds
Predicted road-transport sounds dominated the soundscape of the GAMA, present nearly half of the time during the daytime (median [IQR]: 56% [46, 69]) and a significant portion of the night-time as well (median [IQR]: 35% [28, 46]) (Fig. 2). Though the prevalence of road-traffic sounds were generally lower at night-time (Fig. 2) and these sounds were far from spatially uniform, with notable variation across the region (Fig. 3). The quietest zones were the northern peri-urban residential areas, where daytime road-transport sound prevalence barely reached 30% and dropped below 10% at night. As expected, the highest concentrations were found near major road networks and in densely populated urban centres like Accra Metropolis (AMA), where road-transport sound prevalence peaked at 80% during the day and 62% at night. Eastern urban hubs like Tema and Ashaiman also experienced high levels, with road-traffic sounds present 69% of the day and 46% of the night. Interestingly, a few locations (< 2%) in the GAMA had more road-traffic noise at night than during the daytime.
The spatial distribution of predicted animal and insect sounds showed a strong inverse correlation with road-transport noise (Pearson correlation: -0.90 during the day, -0.87 at night). During the daytime, animal and insect sounds were typically present about a third of the time in Accra Metropolis (median: 31%) and near major roads (median: 35%). At night, the prevalence of these sounds increased, especially in the peri-urban areas of northern GAMA. Across the four sound types, animal and insect sounds were the only sound category where at almost all locations the percentage of time present was higher at night than during the day (Fig. 2). Music and speech exhibited similar spatial patterns (correlation: 0.84 at night to 0.91 during the day), with the highest levels observed in densely populated neighbourhood’s, particularly within Accra Metropolis. (Fig. 3). At all locations across the GAMA, speech sounds were more prevalent during the day-time than the night-time, as would be expected. A very small number of locations (< 10%) had a higher percentage of time where music sounds were present during the night-time, compared with the day-time (Fig. 2).
Predicted percentage of the day- and night-time that road-transport, animal and insect, music, and speech sounds were present (%) at locations in the Greater Accra Metropolitan Area (GAMA). Grey areas on the map represent areas excluded from prediction as they are considered to be out of sample (e.g., water bodies, forest/ grassland with no human settlement and no roads).
Sound type index
For applications with city-scale measurements, we derived a composite metric (called the ‘SoundType Index’) which represents both the frequency of sound types (%) and measured sound levels (dBA) across space and time (see Methods for details). The road-transport soundtype index values had the greatest variation (see Table 1) but also the highest values, on average, compared to other sound types (Fig. 4), particularly in the Accra Metropolitan area reflecting both the fact that this type of sound is common, but also is present with higher measured sound levels (Fig. 5). In the night-time, the road-transport soundType index values were lower than day-time (Table 1). While animal and insect sounds were relatively prevalent in the GAMA, no areas had an animal and insect soundType index exceeding > 0.5, reflecting the fact that areas in the GAMA where there were a high prevalence of animal and insect sounds were also generally the places with lower sound levels (i.e., in the peri-urban periphery) (Fig. 5 and Figure SI2 in the Supplementary Information)). The median music soundType index values in the day and night were low (< 0.03, min: 0.0, max: 0.21), largely a result of the lower levels of predicted music sound prevalence in the GAMA compared with other sound types. Though the median music soundType index values were higher in Accra Metropolis, reflecting the increased prevalence of that sound type in the city-centre.
Distributions of daytime predicted sound type prevalence (% of time present) across areas with varying predicted sound levels (Lday, dBA). Sound levels for each area in the GAMA were predicted from a LUR model described elsewhere54. Circular points represent the sample median, black horizontal lines the quartile range, and the coloured regions the density of the sample distribution. Night-time values are shown in the SI (Figure SI2).
Discussion and conclusions
As we have shown, low-cost audio recording devices55 coupled with machine learning56,57,58,59 and spatial prediction models can help fill the data gap on sound environments and sources of noise in cities in SSA and elsewhere60. The detailed information that we collected and modelled on urban sounds, and their spatial and temporal patterns, can be of aid to local policy makers and city planners, to identify areas with acoustic qualities that could be mitigated (e.g., traffic noise pollution) or conversely areas where sounds might be preserved or enhanced (e.g., constructing parks which enhance birdsong). Exposure scientists and environmental epidemiologists can also leverage the approaches presented in this paper to generate city-scale, high-resolution data on the prevalence of different types of sounds in cities, allowing for the inclusion of these exposure surfaces in epidemiological studies investigating the adverse, as well as potentially beneficial, health effects of exposure to different types of sounds along with the more traditional metrics representing sound levels and events from transportation sources (i.e., Lden, Lnight).
Road-transport and animal sounds (e.g., often representing bird song) were regularly patterned across the city and the models were able to predict gradients of prevalence with a good degree of accuracy. Unfortunately, the Random Forest LUR models were not able to accurately predict geophysical nature sounds due to the lack of representation of the model training data at prevalences higher than 0%. In the future, additional audio measurements in targeted nature-based spaces in Accra could be collected, classified, and included in the models training dataset to improve the spatial prediction performance. Furthermore, many types of geophysical nature-sounds, such as the timing of rainfall events or the rustling of leaves during windier days, are not patterned across space but are patterned across time. Therefore, it is no surprise that our spatial predictors were not able to capture the variation in the geophysical nature sound category across the city in our models, which may be more temporal in nature.
We found that during the day-time, spatially predicted road-transport, music, and human speech sound prevalences were generally higher than during the nighttime. However, animal and insect sound prevalences were generally higher at most locations during the night-time than in the day-time (Fig. 2). With five sound type categories (including attempting to model nature-based geophysical sounds) and two time periods (day and night), this resulted in the development of 10 separate models, representing diverse predicted trends over space and time. However, we recommend that future work be carried out to further refine and develop models with an hourly resolution (as we previously developed for sound level metrics in Accra54) and additionally to capture and define the evening period in a distinct way. The evening period represents a key time window in which individuals are likely exposed to distinct sound profiles while they are at home, yet not yet asleep, and also captures the period in which the evening traffic commute begins to subside in some locations. It would also allow for the development of a 24-hour based metric, similar to Lden, which is commonly used to estimate health effects of exposure to noise across the full 24-hour day and penalises different time periods (night-time, and evening) based on increased sensitivity to noise during these periods. Further methodological work would need to be carried out to consider how to develop such a metric with these sound type data inputs, and in particular how to handle penalties for different types of sounds.
We found that in the GAMA, predicted road-transportation sounds were pervasive and generally accompanied with higher sound levels. Demand for motorized transport has rapidly increased in Accra over the last few decades as the population61 and the city limits have continued to expand61. This has resulted in worsening traffic congestion, particularly in Accra Metropolis42,61. Private vehicles or privately owned minibuses/vans known locally as tro tros are currently the primary means of getting around the city’s main arteries. Greater Accra also overwhelmingly has the highest proportion of registered vehicles in Ghana, more than half of the total national number42. At the same time, urban growth has resulted in the re-development of natural lands, effectively decreasing the percentage of forested areas in the GAMA from 34% of land area in 1992 to 6.5% in 2015 39, particularly in and around Accra Metropolis. However, many neighbourhood’s in Accra have dense and mixed-land use, resulting in a mixture of different types of sounds contributing to the overall sound environment, which was reflected in our findings. So, while measurements were taken along the road-side, road-traffic sounds were not the only contributors to the measured Lday and Lnight levels (dBA). Many roads in Accra and the accompanying sidewalks have mixed uses and functions, such as road-side commercial sales, outdoor dining/events, and even play for children62. As we have shown, the collection of audio recordings in noise measurement campaigns can provide additional contextual information on the sources of the measured sound levels, which do not automatically distinguish the source themselves (i.e., without detailed frequency analysis).
Urban green spaces (with trees/plants) or private gardens can facilitate what are typically described in soundscape studies as ‘tranquil/desirable’ sounds13 by providing a habitat for birds as well as being a source of sound itself (e.g., rustling leaves). Certain types of vegetation may also reduce noise from other sources in the surrounding environment by acting as a natural attenuator through absorption and diffusion63,64. While controlled experimental or neighbourhood-scale soundscape studies have found that geophysical nature-based and animal sounds such as birdsong have been linked to enhanced feelings of subjective wellbeing and physical markers of stress reduction3,4,29,65,66these relationships have not been confirmed in population-based large-scale epidemiological studies. The SoundType index, which jointly represents both the sound level and the frequency of different sound types in an area, derived from measurements and LUR models, is well placed for inclusion and testing in future epidemiological studies in Accra of both the negative, as well as potential beneficial, health impacts of exposure to different types of sounds. For example, the derived exposure datasets will be linked up with new health data in Accra, collected as part of the Accra Schools Health and Environment study (ASHES), a cross-sectional study of over 900 school children’s health and development, as well as the Accra Birth Cohort (ABC) study, longitudinally tracking birth outcomes and developmental health impacts among pregnant women.
Strengths
The data collection campaign in Accra produced millions of sound level and audio records covering every hour of the day, day of the week, and month of the year at diverse locations in the city. Our systematic selection of monitoring equipment that was cost-effective, energy-efficient and battery powered, lightweight, and socially unobtrusive optimized data collection and was less labour intensive in a setting with unreliable electricity. Modelling and spatially predicting sound type prevalences in high-resolution at city scale using audio recording classifications and LUR models, to our knowledge, is one of the first few attempts to do so in any urban setting. The use of Random Forest modelling allowed for computationally efficient and relatively accurate prediction of many of the sound types. While there is area for improvement (as discussed below), this work shows potential applications for other cities in SSA and elsewhere.
Limitations
There are several limitations to this work. While we used a wide variety of spatial predictor variables, we were not able to obtain spatially and/or temporally resolved information on traffic flow, volume, and fleet composition. Inclusion of this information could have improved model predictive accuracy, particularly for transport-related sounds, as we have already shown in previous work with measured sound levels (non-source resolved)67. We had to make the assumption that the spatial predictors were stationary in time and represented the period when the audio recordings were taken. This assumption may not be true for all spatial predictors as the census data which was used to estimate population density was generated in 2010 and the dataset used to estimate land cover dates to 2014. The temporal misalignment of some of the predictor variables may be especially relevant for the urbanizing periphery areas of the GAMA.
The different types of sounds identified from each audio recording by the sound classification neural network (as described in Clark et al. 2021 45) is partly a function of how clear the recording was, how loud/prominent the specific sound was in the recording, but also the neural network itself and the unique nature of the training dataset. It is also likely the case that sometimes specific sound categories were not identified within an audio recording because they were masked by other sounds (e.g., road-traffic masking out sounds of rustling leaves), which could be a function of the fact that the audio recorders are low-cost, and inevitably lower precision and accuracy, instruments. However, we do not believe that masking effects had a major impact on the spatial distribution of the predicted sound type prevalence’s for the following reasons: the accuracy of the neural network (based on human labels) was fairly good - between 76 and 93% for road-traffic, animals and insects, human speech, and music – and the relatively small percentage of non-accurate labels could be from either masking or misclassification45; also the spatial patterning of sound sources that were identified visually from analysis of time-lapse camera imagery collected at the same locations68 largely followed the same patterning found with audio analysis. Finally, we were interested in those sounds which were clearly identifiable in the audio recordings, and supposedly by the human ear also, which better reflects the human experience of the sound environment at each place and time. Also, due to data limitations, we had to aggregate classifications into broad categories, within which there may be different human perceptions and preferences for different sound types (e.g., preference some bird songs and calls but not from other birds). As the acoustic classification model returned the top 3 predicted sound classes, we were able to detect overlapping sounds in each place and time. However, the model is not able to provide an estimate of how predominate each sound type is, and therefore they are counted equally, which is a limitation of this work. To manage storage constraints and maintain privacy, we recorded 10-second audio clips every 10 min—yielding six samples per hour across all hours and days. We assume this temporal sampling strategy sufficiently captured the variation in sound sources over time. Still, implementing this type of urban acoustic monitoring may not be practical in all contexts, especially where different legal and privacy frameworks apply. Some alternative approaches rely on high-resolution (L125ms) spectral data, though these require more advanced—and often more costly—equipment, which can limit data collection capacity. Lastly, while our sound classification accuracy could likely be improved by either building a custom model or retraining an existing one using local audio data, these steps were beyond the scope of the current project. Furthermore, while our underlying acoustic classification models used Audio Set, other open-access datasets used to train urban noise models exist, such as UrbanSound8K or ESC 50. Future work should investigate the applicability of these datasets to African urban contexts.
Conclusion
In conclusion, we conducted a study that integrated city-wide audio recordings and sound level measurements with machine learning techniques to characterize and spatially predict the varied sound environment in a major sub-Saharan African city. Additionally, we developed a composite metric that reflects both the magnitude of sound levels in specific areas and the prevalence of different sound types, showing a new approach for how the complexity of urban sound environments can be captured and characterised through measurements. The insights gained from this research can assist in local urban sound planning and design initiatives in Accra, while also supporting future epidemiological studies investigating the direct and combined health effects of exposure to diverse urban sounds on city residents, filling a major evidence gap in SSA. Moreover, our distinctive approach to characterising the complexity of urban sound profiles holds potential for application in other SSA cities, addressing a significant data gap and supporting the development of urban sound policies across the continent.
Methods
Study area
Our study was conducted in the Greater Accra Metropolitan Area (GAMA, 1500 km2), the most densely populated area in Ghana and the political, economic and administrative capital. This metropolitan region (~ 4 million people in 2019) includes Accra Metropolis as its core (~ 2 million people), Tema to the east (~ 200,000 people)69and the expanding suburban municipalities to the northeast and northwest.
Data collection
Environmental monitoring campaign
Study design and data collection procedures are described in detail elsewhere70. Briefly, we conducted environmental monitoring at 146 locations in the GAMA for 7-day (136 rotating site locations) and around 1-year periods (10 fixed site locations) between April 10th 2019 and June 11th 2020 (Fig. 1). The measurement campaign was briefly interrupted in a period between March and May 2020 due to COVID-19 pandemic lockdown. The fixed sites were purposefully chosen through substantial knowledge of the local area to represent a diversity of land use, socioeconomic, and transport features. The rotating sites were selected through stratified random sampling with strata representing land use71 and sites were oversampled in Accra Metropolis which is the most populous area (~ half of the population in GAMA lives in Accra Metropolis). The four land use categories from a 2014 World Bank map included: medium/low-density residential; high-density residential; commercial, business and industrial areas; and ‘other’ areas (e.g., parks, forest, agricultural areas) (details in Clark et al. 2020 70). Rotating sites were evening sampled across these four categories and 50% of sites were additionally located within Accra Metropolis. Briefly, medium/low-density residential are typically formal residential areas and can have paved or unpaved roads which are double or single lane. The housing property can have yards, fences, walls, and/ or driveways and there is a clear demarcation of where one home ends and another begins. High-density residential are typically informal residential areas and could be classified as shantytowns or slums. Roads (paved or unpaved) are typically narrow, sometimes unidentifiable, and buildings are typically small with a lack of a clear demarcation of where one home ends and another begins. Other areas are the least built-up areas and can have an abundance of forest land, grass land, shrubs, barren land, water, and/or be classified as a park. Commercial, business and industrial (CBI) areas are places which are either dominated by commercial, business, industrial, or government activities. These areas can have large buildings, though that is not a requirement.
As we previously reported in Clark et al. 2021 45, we also overlaid these site locations onto city-wide geographic data representing vegetation levels and population density and summarized the levels of these descriptors by our land use categories. Specifically, population density (people/km2) within census enumeration areas (Median EA area: 0.03km2) was obtained from the 2010 Ghana National Population Housing census. This is the most up to date and reliable record of population distribution within Accra at the small-area level. We obtained estimates of vegetation by calculating the Normalized Difference Vegetation Index (NDVI) from 30 m spatial resolution Landsat 8 satellite imagery collected at the mid-point of the campaign in January 2020. This imagery also had the lowest amount of cloud cover in the potential set72. NDVI values close to zero indicate no green vegetation (mostly built-up areas), and values close to + 1 indicate a high abundance of available vegetation. The mean NDVI level was 0.09 in high-density residential areas, 0.13 in medium/low-density residential areas, 0.10 in CBI areas and 0.22 in ‘other’ peri-urban areas. Population density across the sites also followed a predictable pattern with the highest mean population density in high-density residential areas (35,833) and the lowest in ‘other’ peri-urban areas (1701).
Audio data collection and sound type classification
We deployed low-cost full spectrum, audio recorders (AudioMoth, Open Acoustic Devices (Oxford, UK)) with analog MEMS microphones to record the outdoor acoustic environment45,70. The AudioMoths were set to a sampling rate of 32 kHz to capture the majority of sound in the audible range57. The audio recorders recorded for 10 s every 10 min, recording onto a microSD card. Specifically, we collected audio throughout the day (6 samples per hour, every hour) and across days (Monday - Sunday) at each measurement site in the city. For the fixed sites, we additionally have data spanning 12 months of the year. At each measurement site, the monitors were housed in a functional and protective enclosure (see study protocol for additional information70) that was attached to a pole at ~ 4 (± 1) meters above ground and oriented towards the public streetscape. Audio recorders were programmed to run continuously for 7-day periods, after which they were either swapped out and installed at a new set of rotating sites or replaced with a fresh batch at existing fixed sites. Audio recorders were deployed regardless of weather conditions.
We also collected 1-minute integrated A-weighted sound levels (decibels, dBA) with Sound Level Meters that were co-located with audio recorders at each measurement site. We used the Noise Sentry Sound Level Meter (SLM) datalogger (NSRT_mk3) from Convergence Instruments, Canada to collect 1-minute integrated A-weighted sound levels. The device is small and rugged, built to withstand temperatures in the range of − 20 °C to 60 °C, protected against water and dust, and has a digital MEMS microphone. Prior to each monitoring session, the sound level meters (SLM) were calibrated with a CA114 sound calibrator at 94.0 dB ± 0.3 dB and 1000 Hz ± 0.5%. The Noise Sentry’s have a noise floor of 30 dBA. Pre-campaign sound level monitor-monitor precision tests showed good agreement (see Clark et al. 2020 70) and the duplicate measurements that we conducted throughout the campaign at 16% of rotating sites indicated that the monitors did not drift from each other over time45. More details on the acoustic instruments, quality assurance and control checks can be found in the previously published study protocol70 and descriptive paper summarising the measurements45. We also previously developed the spatial LUR model for the environmental sound level metrics (Lday, Lnight) and thus the development of these models is reported elsewhere54.
We discontinued data collection at one fixed site located near the ocean as initial recorders were damaged due to rust from the contact of the airborne sea-salt with the exposed circuit board. Furthermore, audio recordings from 16 rotating sites were lost due to equipment malfunction or theft. The final analysis utilized audio data from 9 fixed (yearlong) sites and 120 rotating (weeklong) sites (Fig. 1), consisting of a total of 1,232,640 site-seconds and 1,370,880 site-seconds of audio data from rotating and fixed sites, respectively.
In order to detect the presence or absence of different types of sounds across the measurement’s sites and over time, each 10-second audio recording clip was run through a pre-trained acoustic classification neural network59,73. Details on our use of the acoustic classification model (DEEP-Hybrid DataCloud project73 based on Changsong et al. 2018 59) and its validation can be found in our previous paper45. In brief, the open-source model came pre-trained on the Audio Set dataset published from Google. Audio Set consists of 5,800 h of 10-second manually labelled YouTube videos and a structured hierarchical ontology of hundreds of sound classes. Audio Set is weakly labelled meaning that only the presence or absence of audio events are known in a clip. The neural network used in our analysis is based on the Multi-level Attention Model framework outlined by Changsong et al. 2018 59. It processes each audio clip by first converting it into a spectrogram, which is then passed through a series of fully connected layers. Both the final and intermediate outputs of these layers feed into multiple attention modules. These modules are designed to focus on the most informative parts of the audio file while filtering out irrelevant elements like background noise. The outputs from these attention components are then combined to produce the final classification result.
After applying our own audio recordings to the pre-trained model, we obtained the model’s top three predicted sound classes (top 3 chosen as highest levels of accuracy) as the presence of different types of sounds in cities can be overlapping (i.e., not mutually exclusive). Because these are predicted from the model, we call these ‘detected’ sound classes. We then grouped the sound classes into higher order sound type categories for acoustic environment characterization that were adapted from the ISO 2018 standard for acoustic environment characterization74:
-
road-transport sounds (e.g., engine sounds from motorized vehicles, honking),
-
animal and insect sounds (e.g., birdsong),
-
outdoor music (e.g., music played on the radio, people singing),
-
human speech (e.g., people talking, laughing),
-
airplane sounds,
-
geophysical nature sounds (e.g., wind, rain, thunder).
We also had a category for ‘other’ which consisted of all of the remaining miscellaneous sounds. As we detailed previously45we tested the accuracy of the model’s predictions within each sound type category by comparing agreement against manual sound labels applied by a researcher that was blinded to the model results from 150 randomly selected audio files (1/3 from rotating sites, 2/3 from fixed sites). The randomly selected audio files from the fixed sites were representative across different times of the day. The accuracy of model predictions was 93% for music sounds (Positive Predictive Value (PPV): 71%; Negative Predictive Value (NPV): 97%), and 80% for speech (PPV: 83%; NPV: 79%), 79% for animal and insects sounds (PPV: 97%; NPV: 74%), and 76% for road-transport sounds (PPV 76%; NPV: 78%). Geophysical nature sound classifications had accuracy of 87% (PPV: 100%; NPV: 83%), though the frequency at which this sound type was present in the audio files was low (4 audio clips out of 150)45. Airplane sounds had an accuracy value of 82%, though the positive-predictive value was low, only at 4% (NPV: 99%). As such, we do not consider airplane sounds further in this study (more details published elsewhere45). We also did not model ‘other’ sounds in this study, as we assumed that it would be uninformative given that these are a mixture of miscellaneous sounds (e.g., ‘clatter’, ‘alarm, ‘smash’) with minimal spatial variation and predictable patterning in the prevalence across land use factors45.
To prepare the audio classifications for the city-wide models, we calculated the prevalence of the detected sound types at each monitoring location for each date of monitoring. We additionally separated the data by the day (06:00–21:59) and night-time (22:00–05:59) periods in line with Ghana Environmental Protection Agency (EPA) classifications. Prevalence was defined as the percentage of clips where a particular sound type was present (i.e., prevalence) at any given location and date, calculated as the proportion of that sound type predicted as present (yes = 1, no = 0) at that location and during the monitoring period over the total number of audio recordings collected at that location and during the monitoring period, multiplied by 100. These prevalence figures for each sound type category formed the dependent variables in our LUR model as described below.
Spatial predictor variables
To capture land use/ land cover, we used a raster dataset at 20 m resolution that mapped four land cover classes across the GAMA from Spot 5 imagery attributed to the year 2014. The land cover classes were (i) formal residential neighbourhoods with small regular planned buildings, (ii) informal residential neighbourhoods (e.g., shantytowns, slums) with small irregular buildings, (iii) industrial and business areas, with large buildings and, and (iv) other areas (e.g., parks, forests, water, grassland)71. To characterize vegetation in Accra, we calculated the Normalized Difference Vegetation Index (NDVI) from 30 m resolution satellite imagery72. We obtained a Landsat 8 satellite product held on the U.S. Geological Survey department website attributed to a cloud free day in January 2020. January was also considered as a mid-point in the measurement campaign. The Normalized Difference Vegetation Index (NDVI) was calculated from raster bands 4 and 5. We also retrieved data on waterways in Accra from OSM in 2019. To estimate building density, we made use of a spatial dataset of building footprints attributed to the year 2019/202075. We estimated human population density from the most recently available Ghana census (2010) summarized within census enumeration areas (EAs)53. Census EAs are small geographic units with average population of 750–800 people and area 0.03–0.04 km2 within the GAMA. To capture road-traffic sources of sound, we used a road-network shapefile from OpenStreetMap (OSM) downloaded in 2019 76. Road-types were grouped as (i) major roads (includes highways, motorways, trunk roads), (ii) secondary and tertiary roads (iii) minor roads, and (iv) all roads. To capture locations of human activity, we obtained the latitude and longitude locations of churches, mosques, hospitals, primary and secondary schools, restaurants, shopping centres and markets, bars, pubs, and nightclubs from the Google Places API in 2019. We also obtained locations of bus stations/terminals from Google Places as an indicator of both human activity and road-transport sources. Finally, we retrieved data on elevation above sea level from a digital elevation model (DEM) for Africa at 90 m resolution77. More detailed information on our predictor variables can be found in our environmental noise level land use regression (LUR) modelling paper54.
Consistent with our noise level LUR modelling approach54we created multiple circular buffers (50, 100, 200, and 500 m) around the locations of the measurement sites78,79,80,81,82. We then calculated the total length of each road category, the total number of bus stations/terminals, the total number of places of human activities, mean land cover, the average NDVI, the average population density, the total number of buildings, and the total length of waterways, within each buffer. We additionally calculated the distance of each measurement site to the nearest major road83.
Spatial modelling and prediction of sound types (random forest land use regression (LUR))
We used a land use regression (LUR) approach to model and predict the prevalence of different sound types across the GAMA. LUR models are commonly used to estimate spatial variability in air pollution within cities84 and have been increasingly applied to noise in high and middle-income countries82,85,86,87,88 and also recently in Accra54. An LUR model regresses spatially, and sometimes temporally, structured environmental data (e.g., sound source prevalence) on geospatial predictors that represent a range of factors in the urban environment that are associated with the emission, propagation and attenuation of sound. We specifically used Random Forest (RF) models for sound source prevalence prediction89. Random Forest are an ensemble learning method, made up of combinations of classification/regression decision trees, which on their own (single trees), are highly sensitive to being overfit. Though Random Forests overcome this drawback by combining multiple trees, which are made up of bootstrapped observations and a random subset of predictor variables89. Random Forests are also robust to the inclusion of many variables with little or no importance and place no distributional assumptions on the outcome variables89. Random Forest models have also been shown to achieve higher accuracy in predicting noise pollution related outcomes compared with linear regression in five Canadian cities85.
We constructed 10 different models. Five models for each sound type and separate models for the day and night-times. The dependent variables represented the average prevalence (i.e., % of time present) of the different types of sounds detected in the day (06:00–21:59) and night-times (22:00–05:59) at each monitoring location. Additionally, since we had an imbalance between the number of days of audio measurements at fixed sites (range: 227–322 days) compared with rotating sites (7-days), we randomly sampled 30 days of data from each fixed site to be included in the models. In this way, we took advantage of the additional temporal information captured at fixed sites across a 12-month period, without flooding the model with data collected from only a handful of locations (a similar approach was used to analyse objects detected from street view imagery collected in Accra from the same campaign68).
We selected the best buffer radii for each spatial predictor variable and corresponding model with the random permutation’s method. This method provides a robust measure of the importance of predictors in a Random Forest model90 and has been used by previous spatial noise modelling studies using Random Forest85. Specifically, this method generates a random permutation of a variable and calculates the difference in model predictive accuracy when the random permutation is made compared with when the variable’s true values are used. If a predictor variable is important, random permutations will result in a larger decrease in model accuracy compared to permutations of other variables. Finally, if we a priori considered a spatial predictor variable not to be sound producing in either the day or night-time, the variable was not included in that corresponding model. For example, we included the variable representing the number of schools in the daytime but not the night-time models, as we assumed that activities in and around schools produce sounds primarily during the daytime.
We fit Random Forest models with 500 trees each to the sound type dependent variables and a suite of spatial predictor variables. We evaluated the fit and external predictive accuracy of the final models with cross validation. We ran 10-fold cross validation holding out data from 10% of random measurement sites (CV10%sites). From the cross-validations, we calculated the accuracy of the model’s predictions by computing the median absolute error and mean absolute error, which are measures of deviation, and the mean error, which is a measure of bias. The importance of each predictor variable in each model was assessed with the same method used to choose the best buffer size, the random permutation method. We inspected the distribution of the model’s residuals over multiple time-intervals, including days of the week and months of the year. To check for spatial autocorrelation in the models’ errors, we (i) visually inspected maps of model ME across measurement sites and ran Moran’s I tests of spatial autocorrelation on the models’ residuals. We also combined the residuals across models to see if there was any clustering of model prediction error present in a combined way. Specifically, we first took the absolute value of the residuals from all models and then standardized them to have the same range, between 0 and 1. We then summed the standardized absolute residuals across the sound type models, for the day and night separately.
We generated predicted surfaces for the prevalence (% of time present) of different sound types for the day and night-time, for all areas in the GAMA, at 50 m resolution. Though we did restrict predictions to areas that our measurement sites were representative of so that we did not predict out of sample. Thus, we excluded areas that did not contain roads, were fully covered by water bodies, and/ or areas that were fully grassland/forest. In addition to mapping the predicted presence of sound types across the GAMA, we also summarized the median, interquartile range (IQR), minimum and maximum values within the GAMA and across spatial sub-groups such as within Accra Metropolis and by proximity to major, secondary/tertiary, and minor roads.
Composite sound type index
Finally, we aimed to derive a composite metric which can characterize the multifaceted nature of measured sound in cities, by combining data on measured sound levels with the frequency of sound types across cities. To construct this composite metric (which we call the ‘SoundType Index’) we used the predicted surfaces of the prevalence of sound types (described above) combined with predicted surfaces of day-time (Lday (dBA)) and night-time (Lnight (dBA)) annual average A-weighted equivalent continuous sound levels which were generated from LUR models described in a previous publication54. We first pre-processed the data so that both variables were on the same scale91. Specifically, we transformed the sound type prevalences from percentages (0-100%) to proportions (0–1). We also normalized the sound level data into the same range by subtracting each value by the data minimum and dividing by the data range. While this allows for comparison of levels within our data and across locations within Accra, it may hinder comparison of this composite metric when applied to other cities and regions around the world where the variation and range of sound levels can differ to that found in Accra.
We calculated separate composite metrics for each type of sound and for the day and night-times separately. We did this with geometric aggregation and equal weighting (Formula in SI (SI2)). In the future, alternative iterations of the metric with various weighting schemes could be tested and compared against relevant health outcome and/or well-being data when that data becomes available in Accra. A graphical illustration of how we created the aggregate metric combining sound levels and sound types can be seen in Fig. 6.
Graphical example of the creation of the SoundType Index. Example shown represents the day-time road-transport index. The figure shows the geometric aggregation of normalized day-time sound levels (Normalized (Lday)) with the day-time proportion of time that road-transport sounds were predicted to be present and the resulting road-transport sound composite metric values at each raster cell.
All analyses were performed in R (R version 3.6). The Random Forest models were implemented using the randomForest package in R and data visualizations with the tmap, ggplot2, and ggdist R packages.
Data availability
The predicted environmental sound level datasets (Lday, Lnight, Lden) which were reported on in a previous publication are hosted on Zenodo and can be downloaded from: https://zenodo.org/records/11223249 . The predicted sound type surface datasets derived from this work are available from the corresponding author on request.
References
ISO 12913-1. 2014 Acoustics — Soundscape Part 1: Definition and conceptual framework.
Kang, J. et al. Sound environment: High-versus low-density cities. in Designing high-density cities for socinvironmental sustainability 163–180 (2010).
Ratcliffe, E., Gatersleben, B. & Sowden, P. T. Bird sounds and their contributions to perceived attention restoration and stress recovery. J. Environ. Psychol. 36, 221–228 (2013).
Depledge, M. H., Stone, R. J. & Bird, W. J. Can natural and virtual environments be used to promote improved human health and wellbeing? Environ. Sci. Technol. 45, 4660–4665 (2011).
Basner, M. et al. Auditory and non-auditory effects of noise on health. Lancet 383, 1325–1332 (2014).
World Health Organization. WHO environmental noise guidelines for the European region. (2018).
Quartieri, J. et al. A review of traffic noise predictive noise models. Recent Adv. Appl. Theor. Mech 72–80 (2009).
King, E. A. & Murphy, E. Environmental noise - ‘Forgotten’ or ‘ignored’ pollutant? Appl. Acoust. 112, 211–215 (2016).
Basner, M. & McGuire, S. WHO environmental noise guidelines for the European region: A systematic review on environmental noise and effects on sleep. Int. J. Environ. Res. Public. Health. 15, 519 (2018).
Guski, R., Schreckenberg, D. & Schuemer, R. WHO environmental noise guidelines for the European region: A systematic review on environmental noise and annoyance. Int. J. Environ. Res. Public. Health. 14, 1539 (2017).
Clark, C. & Paunovic, K. WHO environmental noise guidelines for the European region: A systematic review on environmental noise and cognition. Int. J. Environ. Res. Public. Health. 15, 285 (2018).
Stansfeld, S. A. & Matheson, M. P. Noise pollution: Non-auditory effects on health. Br. Med. Bull. 68, 243–257 (2003).
Kang, J. Urban soundscape. in Urban Sound Environment 43–107 (Routledge, (2017).
Kazeem, Y. & Dahir, A. L. African cities are battling escalating noise pollution - but religion stands in the way. Quartz Africa (2018).
Tong, H., Aletta, F., Mitchell, A., Oberman, T. & Kang, J. Increases in noise complaints during the COVID-19 lockdown in spring 2020: A case study in greater london, UK. Sci. Total Environ. 785, 147213 (2021).
Lee, P. J. & Jeong, J. H. Attitudes towards outdoor and neighbour noise during the COVID-19 lockdown: A case study in London. Sustain. Cities Soc. 67, 102768 (2021).
Hong, A., Kim, B. & Widener, M. Noise and the city: leveraging crowdsourced big data to examine the spatio-temporal relationship between urban development and noise annoyance. Environ. Plan. B Urban Anal. City Sci. 47, 1201–1218 (2019).
Kaledzi, I. Ghana asks mosques to turn down the noise and use WhatsApp for call to prayer. (2018). https://www.dw.com/en/ghana-asks-mosques-to-turn-down-the-noise-and-use-whatsapp-for-call-to-prayer/a-43373007
Samagwa, D., Mkoma, S. & Tungaraza, C. Investigation of noise pollution in restaurants in Morogoro municipality, tanzania, East Africa. J Appl. Sci. Environ. Manag 13, (2010).
Zakpala, R. N., Armah, F. A., Sackey, B. M. & Pabi, O. Night-time decibel hell: mapping noise exposure zones and individual annoyance ratings in an urban environment in Ghana. Scientifica (Cairo). 1–11. https://doi.org/10.1155/2014/892105 (2014).
Ebare, M. N., Omuemu, V. O. & Isah, E. C. Assessment of noise levels generated by music shops in an urban City in Nigeria. Public. Health. 125, 660–664 (2011).
Hoque, M. & Kabir, H. Traffic induced noise level in different places at the Dhaka capital City of Bangladesh. Bangladesh J. Environ. Sci. 38, 41–46 (2020).
Abankwa, E. & Agyemang, A. Impact of noise in the industry and commercial areas in ghana: case study of the Kumasi metropolis. Int. J. Eng. Res. Appl. 07, 11–19 (2017).
Knott, S. & Gyamfi, K. ‘If you complain they see you as evil’: accra’s religious noise problem. The Guardian (2019).
van den Bosch, M. Ode sang. Urban natural environments as nature-based solutions for improved public health – A systematic review of reviews. Environ. Res. 158, 373–384 (2017).
Ross, M. & Mason, G. J. The effects of preferred natural stimuli on humans’ affective states, physiological stress and mental health, and the potential implications for well-being in captive animals. Neurosci. Biobehav Rev. 83, 46–62 (2017).
Iyendo, T. O. Sound as a supportive design intervention for improving health care experience in the clinical ecosystem: A qualitative study. Complement. Ther. Clin. Pract. 29, 58–96 (2017).
Medvedev, O., Shepherd, D. & Hautus, M. J. The restorative potential of soundscapes: A physiological investigation. Appl. Acoust. 96, 20–26 (2015).
Annerstedt, M. et al. Inducing physiological stress recovery with sounds of nature in a virtual reality forest - results from a pilot study. Physiol. Behav. 118, 240–250 (2013).
Bakolis, I. et al. Urban mind: using smartphone technologies to investigate the impact of nature on mental well-being in real time. Bioscience 68, 134–145 (2018).
Ulrich, R. S. et al. Stress recovery during exposure to natural and urban environments. J. Environ. Psychol. 11, 201–230 (1991).
Radicchi, A. et al. Sound and the healthy City. Cities Heal. 5, 1–13 (2020).
Welsh Government. Noise and soundscape action plan. (2018).
Mayor of London. The London Plan: The spatial development strategy for Greater London. (2021).
City of London. Noise strategy 2016 to 2026. (2016).
Mabin, A., Butcher, S., Bloch, R. & Peripheries Suburbanisms and change in sub-Saharan African cities. Soc. Dyn. 39, 167–190 (2013).
Wania, A., Kemper, T., Tiede, D. & Zeil, P. Mapping recent built-up area changes in the City of Harare with high resolution satellite imagery. Appl. Geogr. 46, 35–44 (2014).
Agyemang, F. S. K., Silva, E. & Poku-Boansi, M. Understanding the urban Spatial structure of Sub-Saharan African cities using the case of urban development patterns of a Ghanaian city-region. Habitat Int. 85, 21–33 (2019).
Addae, B. & Oppelt, N. Land-use/land-cover change analysis and urban growth modelling in the greater Accra metropolitan area (GAMA), Ghana. Urban Sci. 3, 26 (2019).
Angel, S., Parent, J., Civco, D. L., Blei, A. & Potere, D. The dimensions of global urban expansion: estimates and projections for all countries, 2000–2050. Prog Plann. 75, 53–107 (2011).
Sietchiping, R., Permezel, M. J. & Ngomsi, C. Transport and mobility in sub-Saharan African cities: an overview of practices, lessons and options for improvements. Cities 29, 183–189 (2012).
Imoro Musah, B., Peng, L. & Xu, Y. Urban congestion and pollution: A quest for cogent solutions for Accra City. IOP Conf. Ser. Earth Environ. Sci 435, (2020).
Ehebrecht, D., Heinrichs, D. & Lenz, B. Motorcycle-taxis in sub-Saharan africa: current knowledge, implications for the debate on informal transport and research needs. J. Transp. Geogr. 69, 242–256 (2018).
Amegah, A. K. & Agyei-Mensah, S. Urban air pollution in sub-Saharan africa: time for action. Environ. Pollut. 220, 738–743 (2017).
Clark, S. N. et al. Space-time characterization of community noise and sound sources in accra, Ghana. Sci. Rep. 1–12. https://doi.org/10.1038/s41598-021-90454-6 (2021).
Kang, J. Urban noise evaluation. in Urban Sound Environment (Routledge, (2017).
Hammer, M. S., Swinburn, T. K. & Neitzel, R. L. Environmental noise pollution in the united states: developing an effective public health response. Environ. Heal Perspect. 122, 115 (2014).
European Environment Agency. Environmental noise in Europe – 2020. European Environ. Agency (2020).
Chen, Y., Hansell, A. L., Clark, S. N. & Cai, Y. S. Environmental noise and health in low-middle-income-countries: A systematic review of epidemiological evidence. Environ. Pollut. 316, 120605 (2023).
Salamon, J. & Bello, J. P. Deep convolutional neural networks and data augmentation for environmental sound classification. IEEE Signal. Process. Lett. https://doi.org/10.1109/LSP.2017.2657381 (2017).
Green, M. & Murphy, D. Environmental sound monitoring using machine learning on mobile devices. Appl. Acoust. 159, 107041 (2020).
OpenStreetMap. Planet dump. (2015). https://planet.openstreetmap.org
Ghana Statistical Service. Ghana Population and Housing Census. (2010).
Clark, S. N. et al. Spatial modelling and inequalities of environmental noise in accra, Ghana. Environ. Res. 214, 113932 (2022).
Hill, A. P. et al. AudioMoth: evaluation of a smart open acoustic device for monitoring biodiversity and the environment. Methods Ecol. Evol. 9, 1199–1211 (2018).
Cartwright, M. et al. Sonyc urban sound tagging (Sonyc-Ust): a multilabel dataset from an urban acoustic sensor network. Detect Classif. Acoust. Scenes Events 2019 (2019).
Fairbrass, A. J. et al. CityNet—Deep learning tools for urban ecoacoustic assessment. Methods Ecol. Evol. https://doi.org/10.1111/2041-210X.13114 (2018).
Salamon, J. & Bello, J. P. Deep convolutional neural networks and data augmentation for environmental sound classification. IEEE Signal. Process. Lett. 24, 279–283 (2017).
Changsong, Y., Barsim, K. S., Kong, Q. & Yang, B. Multi-level attention model for weakly supervised audio classification. ArXiv Prepr arXiv doi:180302353. (2018).
Hong, K. Y., Pinheiro, P. O. & Weichenthal, S. Predicting outdoor ultrafine particle number concentrations, particle size, and noise using street-level images and audio data. Environ. Int. 144, 106044 (2020).
Arup & Cities Alliance. Future proofing cities: Ghana - metropolitan cities. (2016).
Nathvani, R. et al. Measurement of urban vitality with time-lapsed street-view images and object-detection for scalable assessment of pedestrian-sidewalk dynamics (In review). ISPRS J. Photogramm. Remote Sens.
Kang, J. Urban noise mitigation. In Urban Sound Environment 175–199 (Routledge, (2017).
Kotzen, B. & English, C. Environmental noise barriers: Guide to their visual and acoustic design. (1999).
Jia, Y., Ma, H. & Kang, J. Characteristics and evaluation of urban soundscapes worthy of preservation. J. Environ. Manage. 253, 109722 (2020).
Buxton, R. T., Pearson, A. L., Allou, C., Fristrup, K. & Wittemyer, G. A synthesis of health benefits of natural sounds and their distribution in National parks. Proc. Natl. Acad. Sci. U S A. 118, 6–11 (2021).
Nathvani, R. et al. Beyond here and now: evaluating pollution Estimation across space and time from street view images with deep learning. Sci. Total Environ. 903, 166168 (2023).
Nathvani, R. et al. Characterisation of urban environment and activity across space and time using street images and deep learning in Accra. Sci. Rep. 12, 20470 (2022).
Ghana Statistical Service. Greater Accra Population: Population by sex and district 2010 and 2019. (2019). https://statsghana.gov.gh/nationalaccount_macros.php?Stats=MTA1NTY1NjgxLjUwNg==/webstats/s679n2sn87
Clark, S. N. et al. High-resolution Spatiotemporal measurement of air and environmental noise pollution in sub-Saharan African cities: pathways to equitable healthy cities study protocol for accra, Ghana. BMJ Open. 1–10. https://doi.org/10.1136/bmjopen-2019-035798 (2020).
World Bank. 2014 Land Cover Classification of Accra, Ghana. (2014).
U.S Geological Survey. Landsat products. https://www.usgs.gov/core-science-systems/nli/landsat
Lopez Garcia, A. et al. A cloud-based framework for machine learning workloads and applications. IEEE Access. 8, 18681–18692 (2020).
ISO 12913-2. 2018 Acoustics - Soundscape - Part 2: Data collection and reporting requirements.
Price, R. & Hallas, M. IN41A-02 Mapping every building and road in sub-Saharan Africa. in AGU Fall Meeting (2019).
Barrington-Leigh, C. & Millard-Ball, A. The world’s user-generated road map is more than 80% complete. PLoS One. 12, e0180698 (2017).
Verdin, K. L. Digital Elevation Model (DEM) from the Hydrologic Derivatives for Modeling and Analysis (HDMA) database -- Africa. (2017). https://www.sciencebase.gov/catalog/item/591f6d02e4b0ac16dbdde1c7
Agent, K. R. & Zegeer, C. V. Propagation of Traffic Noise. (1981).
Ghinet, S. et al. Atmospheric propagation of aircraft acoustic signature from high altitude. INTER-NOISE 2019 MADRID – 48th Int. Congr. Exhib. Noise Control Eng. (2019).
Corbisier, C. Living with noise. U S Department Transportation (2003). https://www.fhwa.dot.gov/publications/publicroads/03jul/06.cfm
Aguilera, I. et al. Application of land use regression modelling to assess the Spatial distribution of road traffic noise in three European cities. J. Expo Sci. Environ. Epidemiol. 25, 97–105 (2015).
Ragettli, M. S. et al. Statistical modeling of the Spatial variability of environmental noise levels in montreal, canada, using noise measurements and land use characteristics. J. Expo Sci. Environ. Epidemiol. 26, 597–605 (2016).
Lee, M. et al. Land use regression modelling of air pollution in high density high rise cities: A case study in Hong Kong. Sci Total Environ 592, (2017).
Hoek, G. et al. A review of land-use regression models to assess Spatial variation of outdoor air pollution. Atmos. Environ. 42, 7561–7578 (2008).
Liu, Y. et al. Comparison of land use regression and random forests models on estimating noise levels in five Canadian cities. Environ. Pollut. 256, 113367 (2020).
Harouvi, O., Ben-Elia, E., Factor, R., de Hoogh, K. & Kloog, I. Noise Estimation model development using high-resolution transportation and land use regression. J. Expo Sci. Environ. Epidemiol. 28, 559–567 (2018).
Wang, V. S. et al. Temporal and Spatial variations in road traffic noise for different frequency components in metropolitan taichung, Taiwan. Environ. Pollut. 219, 174–181 (2016).
Walker, E. D. et al. Spatial and Temporal determinants of A-weighted and frequency specific sound levels—An elastic net approach. Environ. Res. 159, 491–499 (2017).
Biau, G. & Scornet, E. A random forest guided tour. Test 25, 197–227 (2016).
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
OECD. Handbook on constructing composite indicators: Methodology and user guide. (2008).
Acknowledgements
This work was supported by the Pathways to Equitable Healthy Cities grant from the Wellcome Trust [209376/Z/17/Z]. For the purpose of open access, the authors have applied a CC BY public copyright license to any Author Accepted Manuscript version arising from this submission. SC was supported by a Canadian Institutes for Health Research (CIHR) PhD scholarship as well as an Imperial College Presidents PhD scholarship. RA, JB and ME are also supported by Climate Change Resilient Equitable Healthy Cities in Africa (CLARITY-Africa) grant (227779/Z/23/Z) from the Wellcome Trust.
Author information
Authors and Affiliations
Contributions
SNC, MT, and MB conceptualised the manuscript. SNC, ME, JB, and MB developed the methodology and SNC conducted the analysis, and visualisations. AH, SAM, and GO provided resources and ME and SNC funding acquisition. Data was curated by SNC, RA, RN, ASA, JN, JMB, SB, and AH. Supervision was provided by RA, ME, JB, MT, and MB. SNC wrote the manuscript and all authors reviewed and provided edits.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Clark, S.N., Arku, R.E., Ezzati, M. et al. Moving beyond the noise: geospatial modelling of urban sound environments in a sub-Saharan African city. Sci Rep 15, 21403 (2025). https://doi.org/10.1038/s41598-025-06537-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-06537-1