Abstract
The study of bird vocal behavior provides insights into ecology, evolution, and conservation. Prior research has mostly focused on bird songs, with limited attention to calling behavior. Although the origins and functions of these vocalizations differ, no detailed annotated acoustic datasets are currently available including both bird songs and calls. Here, we present an acoustic dataset with standardized recordings and detailed annotations of singing and calling behavior in the Dupont’s lark (Chersophilus duponti), a passerine with spatially variable vocal behavior. Recordings were collected across a large spatial scale (20 populations) to capture geographic variation in vocalizations. The dataset includes 4,297 annotated songs from 191 singing males, representing 401 song types, and 795 annotated calls from 97 calling males, representing 80 call types. Annotations provide precise categorization of song and call types, enabling comparisons of individual and population vocal repertoires, geographic variation, and potential effects of habitat fragmentation on vocal behavior. This dataset is intended for reuse in studies of bird vocal behavior, comparative analyses of song and call patterns, and research on passerine acoustic communication.
Similar content being viewed by others
Background & Summary
Given the multiple implications, bird vocalizations have attracted the interest of scientists and ornithologists since the late 19th and early 20th centuries1,2. Among other aspects, the study of avian vocal behavior provide valuable insights into: (i) individuals’ reproductive status, including its role in mate attraction and courtship signaling3,4, (ii) physical condition and individual identity5,6, (iii) the processes driving local adaptation and population differentiation7,8, (iv) group cohesion and social structure, as seen in contact calls used by flocking species9, (v) the provision of parental care, including nestling begging calls10, (vi) the location and sharing of food resources, such as food calls in corvids or cooperative breeders11, and (vii) alarm and predator-related communication, including threat, mobbing or distress calls aimed at deterring predators or warning conspecifics12. Despite this interest from disciplines as ecology, evolution, behavior, and neurobiology, much remains to be discovered regarding the origin, evolution, and function of bird vocalizations13. Indeed, the evolution and function of bird vocalizations are context dependent, varying among species, populations, and studies. This variation among research is partly explained by the wide diversity of vocalizations produced by birds, which include songs, territorial, warning, group, distress, and begging calls, among others14. Two of the commonest vocalizations produced by birds, and thus the most studied, are songs and territorial calls (hereafter “calls”).
Songs are typically complex and, in many bird species, learned from tutors. In males, and in some species also in females, songs are often linked to several aspects of sexual selection, including mate attraction, intra- and inter-sexual competition, and communication within and between pairs, including duetting species. In contrast, calls, which can also be produced by both sexes, are commonly described as innate and tend to be shorter than songs, and often associated with territorial defense or immediate communicative purposes14,15,16. It is worth noting that the distinction between what is a song and a call is not always clear may vary among species17. Comparative studies of both vocalization types offer valuable insights into both the ecology and behavior of the species and the origin and function of songs and calls. For example, analysing diel and monthly variation in songs and calls for the same species can clarify the distinct functions of each vocalization type18,19. Similarly, seasonal changes in vocal output may also provide important information about the breeding period and migration timing of birds20,21 and behavioral responses to environmental changes, such as reactions to intruders or predators22. Consequently, although still rare in the literature, comparative studies of both songs and calls recorded in the field hold potential to deepen our understanding of avian vocal behavior and spatial organisation, laying potential for interpreting population dynamics in changing ecological contexts.
In addition to behavioral and ecological applications, in recent decades, the study of bird vocalizations has also become an important tool in conservation biology23. Vocal learning occurs on many bird species, including passerines, parrots and hummingbirds24, which facilitates the development of regional dialects (i.e., distinct spatial vocalization patterns) that offer crucial insights into the adult dispersal, gene flow and connectivity between populations25,26. Likewise, analysing the variations in the degree of vocalization sharing among groups of neighbouring and non-neighbouring individuals can reveal patterns of social interactions and habitat fragmentation, as disruptions in vocal exchange often indicate barriers to adult movement and increased isolation among groups and populations27,28. In this sense, acoustic parameters such as the diversity of vocalizations within and among individuals and the vocal activity rate serve as indicators of population status and viability29,30, as well as reproductive status, with higher complexity and frequency of vocalizations often linked to active breeding and better condition of individuals3,4. Thus, by studying birds’ vocal behavior, researchers can non-invasively detect early-warning signs of environmental change, assess population viability and reproductive success, and guide conservation monitoring and management efforts23.
In recent years, several large-scale collaborative initiatives have substantially increased the availability of avian acoustic data, including publicly accessible repositories such as Xeno-Canto (www.xeno-canto.org), Avibase31, and the Macaulay Library (www.macaulaylibrary.org). These platforms have facilitated global access to thousands of crowd-sourced recordings, enabling research on geographic variation in bird vocalizations32,33, global analyses on bird song frequency34, and other large-scale ecological patterns35. In addition to these repositories, numerous annotated acoustic datasets have emerged as valuable resources for studying avian vocal behavior. Such datasets are typically generated within the context of regional monitoring programs, species-specific research projects, or conservation initiatives. Therefore, most of these datasets remain constrained in geographic and taxonomic coverage and often rely on opportunistic sampling, limiting their suitability for broader ecological applications13,36,37. A notable exception in terms of scale is the recently published World Annotated Bird Acoustic Database (WABAD), which contains fine-scale annotations of over 1000 bird species from 27 countries and distributed across 13 biomes38. While WABAD represents a major step forward in the curation of global acoustic datasets, its annotations do not explicitly distinguish the different vocalization types detected (e.g., song, territorial call, contact call, alarm call), thereby limiting its applicability to fine-scale behavioral studies. In parallel, the development of smartphone applications powered by deep learning algorithms—most prominently BirdNET39 and Merlin40—has greatly expanded ornithological citizen science participation worldwide. These mobile applications have generated tens of millions of georeferenced acoustic detections that have been used to examine song dialect boundaries, migratory movements, and seasonal vocal activity at continental scales41. Nonetheless, these data also lack standardised sampling protocols and detailed annotations distinguishing vocalization types, which reduces their utility for behavioral and ecological analyses requiring precise vocalization-type categorisation.
To address the lack of acoustic datasets with detailed annotations of different song and call types, we present a fully curated and annotated acoustic dataset on the vocal behavior of the Dupont’s lark (Chersophilus duponti) covering most of the species’ distribution area in Europe. We selected the Dupont’s lark as a target species because it has been the subject of extensive acoustic research19,28,29,42,43,44,45, providing a strong background for further comparisons. Moreover, the Dupont’s lark utters songs and calls at a high rate and exhibits extraordinary spatial variability among populations, often including distinct dialects among groups of males inhabiting the same habitat patch28, which makes the Dupont’s lark an ideal model for investigating differences in song and call usage and assessing the effects of habitat fragmentation on birds’ vocal behavior. To enhance reproducibility and facilitate further use of the dataset, we: (i) followed a standardized field protocol ensuring the complete recording of individuals’ song and call repertoires, and (ii) applied a strict annotation protocol categorizing every recorded song and call recorded at the type level. Additionally, our dataset is accompanied by two metadata files. One contains vocalization level information, including the onset and offset times of each vocalization within the recording, minimum and maximum frequency, vocalization type labels, and the ID of the recorded individual. The other metadata file provides detailed information for each recorded male (date and population where recorded, geographic coordinates) and for each surveyed population, enabling further analyses relating individual or population vocal behavior to habitat metrics. We hope that our dataset—currently the largest publicly available acoustic dataset with detailed annotations of both song and call types for a bird species—will significantly contribute to advancing research on avian ecology, vocal behavior, and conservation.
Methods
The Dupont’s lark (Chersophilus duponti) is a resident, territorial, and socially monogamous passerine, although extra-pair copulations may occur46. The species is distributed through North Africa and Spain, the only European country where the species occurs. It is a diurnal species with high vocal activity during the hour before sunrise. The Dupont’s lark uses many vocalization types, mainly songs and territorial calls, which have been extensively studied28,42,43,45, but its repertoire also includes distress calls, warning calls, and clucking calls, which are more rarely uttered42,45. Songs and territorial calls (calls hereafter), the two vocalization types considered in this study, are only uttered by males and both uttered at high output during the dawn chorus of the species19. Songs, which are learned after summer dispersal from nearby adult males29, typically includes 3–8 discrete song types, which are largely shared and repeated in the same order by neighbouring males29. A song type consists of a small number of song units (usually 2–4) occurring together in a specific sequence (Fig. 1). The territorial call of the Dupont’s lark consists of discrete, short whistles — typically between one and three — that are regularly repeated in the same order29 (Fig. 1). Territorial calls have been proposed to be learned before dispersal after a short learning phase in summer47.
Sonograms showing the songs (A,B) and calls (C,D) of four Dupont’s larks. Boxes indicate the different song and call types identified within each individual. The classification of songs and calls was based on spectrogram morphology, temporal patterns, frequency characteristics, and number of notes. For example, song type S1 and call type C1 each appear twice in sonograms A and C, respectively. The population where each sonogram was recorded is indicated in brackets after the corresponding capital letter, according to the ID column in Table 1.
Study area
The study was carried out from March to June (main breeding period of the Dupont’s lark48) during 2024 and 2025, covering 20 steppe remnants occupied by the species in Spain (Fig. 2). The study areas (hereinafter referred to as “populations”) were patches of shrub-steppe vegetation, with some scattered trees embedded within a matrix dominated by cereal crops. We selected 20 populations that were well distributed across the European range of the species and differed in patch size (ha, estimated as the total amount of potential vegetation within that area), Dupont’s lark abundance estimated through line transects, and isolation. Two populations were considered distinct when separated by more than 1 km of non-natural vegetation. Populations are considered here to represent spatially discrete areas, although they may not be completely genetically isolated. The patch size of the populations ranged from 80 to 1,903 ha, and the local population abundances, estimated through the line transect method49 varied between 3 and 77 males (Table 1).
Distribution range of the Dupont’s lark in Spain (grey circles). The green circles identify the 20 Dupont’s lark populations included in this study. Numbers correspond to the population id as shown in Table 1.
Bird recording
Each population was surveyed at least once per year during the hour before sunrise, when Dupont’s lark vocal activity reached its peak19. Males were recorded within their territories from the closest possible distance (ranging from 10–70 m) without disturbing their natural vocal behavior. Recordings were made via portable digital flash-memory recorders — a Marantz PMD 661 or a TASCAM DR40 —both equipped with a Sennheiser ME 67 shotgun microphone with a K6 powering unit and wind protection. The frequency response of these microphones is largely flat in the range 20–20,000 Hz. Both recorders were programmed to record in.wav format with a sampling rate of 44.1 kHz and 24-bit resolution. Recordings were obtained only under favourable weather conditions (i.e., no rain and windless days). The location of each recorded male was determined via a GPS device (3–5 m accuracy). Although birds were not individually marked, all recordings of a given male, as well as those males within a 500 m radius, were obtained on the same day to minimise the risk of recording the same individuals on successive days.
Following assessments carried out in prior studies, singing males were recorded for at least two minutes, which has been proven to be sufficient to ensure that the complete individual song repertoire is captured27. Nonetheless, four of the 191 singing males (2.1% of the total) were recorded for less than two minutes, with a mean recording length of 94 seconds. With respect to calling behavior, we recorded a minimum of three calls per individual, which is sufficient to capture the individual’s entire call repertoire42. In total, we recorded songs from 191 males and calls from 97 males. Due to low-light conditions under which recordings were obtained and shy behaviour of the species, it was not possible to reliably determine whether song and call recordings may correspond to the same individual. Therefore, individuals could not be matched between the song and call datasets. A summary table showing the number of individuals recorded singing and calling in each population, as well as summary statistics on the species’ vocal behavior of the species at the population level, is provided in Table 1.
Audio annotation protocol
For data standardisation we developed a detailed annotation protocol. The audio annotation process consisted of opening an audio recording in Raven Pro 1.650, a commercial software commonly used by the scientific community working with wildlife vocalizations. The annotations were made on the spectral view of Raven Pro 1.6, using the default configuration parameters (Window type = Hann, DFT size = 512 samples, brightness = 50, contrast = 50). The display axes were kept constant, with the Y-axis extending to 10 kHz and the X-axis showing approximately 16 s per screen. Nonetheless, the annotator was free to adjust the parameters of the spectral view, especially brightness and contrast, to aid in identifying the target vocalizations.
The audio annotation process was carried out by a single expert ornithologist (C.D.A-M), familiar with the Dupont’s lark vocal behavior, and consisted of visually and aurally checking each audio file in Raven Pro 1.6. The annotator marked with a box the portion of the file where a Dupont’s lark vocalization, song or call, occurred. The box was narrowed to include every Dupont’s lark vocalization in the recording and mark the exact starting and ending time, as well as the maximum and minimum frequency of each vocalization. We did not apply any quality criteria (e.g., considering only vocalizations of high signal-to-noise ratios or high amplitudes) to annotate Dupont’s lark vocalizations in the dataset; the only criterion was that the expert was able to identify the vocalizations in the recording and to ensure that the recorded vocalizations belonged to the target individual (i.e., background vocalizations of neighbouring males were discarded). In total, we identified 401 different song types from 4,297 annotated songs from 191 singing males recorded, and identified 80 different call types from 795 annotated calls recorded from 97 calling males.
Once the annotation of the entire audio file was completed, the annotations were exported as a.txt file with the default Raven Pro format and with the exact same name as the audio file. However, to allow users to open the annotations via a free audio software we also saved the annotations as a.txt file compatible with the Audacity software (https://audacityteam.org/). Both software programs allow users to create, open, and modify annotations of audio files.
Vocalization identification
To characterize the presence of different song types in the audio files we conducted a detailed visual and aural inspection of the sonograms. To ensure consistent classification, a single observer (C.D.A-M) assessed all the audio recordings blindly. Each song type detected was coded with an ordinal number (e.g., S1, S2, S3; Fig. 1). When we found a song type that had been previously catalogued, we labelled it accordingly, and if it was a new song type it was categorized with a new label and added to the catalogue. Following prior research, the classification of song types was based on spectrogram morphology, temporal patterns, frequency characteristics, and number of notes42,43. In summary, two song types were considered the same if they matched in at least 75% of their notes and showed high similarity in note shape, timing, and frequency (Fig. 1). The same methodological procedure was applied to characterize the presence of different call types in audio recordings.
Data Record
The dataset presented in this study is open access and accessible through Zenodo51. Within the Zenodo repository, there are two ZIP folders that follow the following structure:
/Song.zip/
└── DL_S1/
├── DL_S1.wav
└── DL_S1_Raven.txt
└── DL_S1_Audacity.txt
└── DL_S2/
├── DL_S2.wav
├── DL_S2_Raven.txt
├── DL_S1_Audacity.txt
└── --------------------
/Call.zip/
└── DL_C1/
├── DL_C1.wav
├── DL_C1_Raven.wav
└── DL_C1_Audacity.txt
└── --------------------
Song folder
This folder contains 191 subfolders identifying the Dupont’s lark males recorded singing during the study. Each folder is identified by its ID name, which is composed of the code “DL” identifying the species (Dupont’s Lark), the code “S” identifying the vocalization type (Song in that case) and a numeric value identifying each individual (for example, “DL_S1”). Inside the folder of each individual we can find their respective recording (.wav format) files and annotations (.txt files). The annotations are provided in two formats, Raven Pro and Audacity (see Data annotation section). Raven Pro annotations contain eight columns with the following information:
-
Selection: Selection ID
-
View: Whether the information comes from the spectrogram or from the waveform
-
Channel: Channel of origin of the recordings
-
Begin Time (s): Start second of the selection
-
End Time (s): End second of the selection
-
Low Freq (Hz): Lowest frequency of the selection
-
High Freq (Hz): Highest frequency of the selection
-
Song (or call) type: Song (or call) type
Each annotation in Audacity is in two consecutive columns: the first column refers (following the prior nomenclature) to the Begin Time (in s), the End Time (in s), and the song (or call) type, and the next row, which starts with a “\”, is the Low Freq (in Hz) and the High Freq (Hz).
Call folder
This folder follows the same structure and subfolders as those explained above, but contains in that case the calls of the 97 Dupont’s lark recorded calling during the study and using the code “C” (call) to identify the vocalization type (for example, “DL_C1”).
The dataset also includes six additional files to support further analyses: (1) a metadata file (“Metadata.csv”) containing detailed information for each recorded individual, including their ID, date of recording, approximate latitude and longitude (geographic coordinates in decimal degrees; rounded to within ±1 km owing to the threatened status of the species) of recording, population where recorded, and recorder (“Tascam” or “Marantz”) employed; (2) two binary presence-absence matrices (“Song_binary_matrix.csv” and “Call_binary_matrix.csv”) indicating the presence or absence of each song or call type identified; (3) two distance matrices (“Song_distance_matrix.csv” and “Call_distance_matrix.csv”) representing the spatial distances (in m) between all pairs of males and their Acoustic (Jacard) similarity indexes, and 4) an R script (“Technical_validation.R”) detailing the steps implemented to perform the technical validation.
Technical Validation
To ensure the technical quality and reliability of the annotated acoustic dataset, multiple quality-control procedures were implemented throughout all stages, from field recording to annotation, classification, and data processing.
Quality of field recordings
All the recordings were obtained: (i) under adequate weather conditions, (ii) using the same high-quality directional microphone, and iii) with a reduced distance to the bird, which guarantees audio clarity and minimises external noise. Moreover, we followed a strict protocol regarding the minimum recording length for each male and minimized the risk of recording the same individual between successive visits by avoiding revisiting the same territory.
Metadata standardization and file naming consistency
Each audio recording included in the dataset is accompanied by a complete metadata record including rounded geographic coordinates of the recorded male, date and hour of recording, and recording equipment used, among others. This detailed metadata ensures the traceability of each recording and facilitates its further integration into larger-scale ecological and bioacoustic studies. Moreover, to prevent mismatches between recordings and their annotation files, a strict naming convention was enforced to ensure that both files share the same nomenclature. This procedure required renaming the original audio and annotation files following the structure described in the Data record section. Moreover, the annotation files are provided in the format of two of the most common audio software programs (Raven Pro and Audacity), which ensures data traceability, interoperability and facilitates its further use by researchers via different analytical tools.
Consistency and agreement in audio annotation
Audio annotation was performed using professional audio software and by a single expert ornithologist (C.D.A-M) who was highly familiar with the Dupont’s lark vocalizations, following a standardized audio annotation protocol (see the Audio annotation protocol). The decision to have a single primary annotator ensured consistency in the classification process. Nonetheless, all cases in which i) the classification of a song or call type was uncertain, and ii) that a song or call type was annotated to be shared among populations, were reviewed by at least one additional experienced ornithologist to ensure its classification. This verification step involved independent inspection of the relevant spectrograms and audio files by up to two additional experts. Only when agreement was consistently agreed upon by at least two researchers was the vocalization type confirmed. This cross-checking procedure provides an additional safeguard against misclassification, ensuring that the final catalogue of song and call types is reliable and robust. Moreover, after this classification, the dataset underwent a thorough quality-control process to verify that the same vocalization type was never duplicated under different labels, and to confirm that all the annotation files correctly matched to their respective audio recordings.
Repertoire metrics and song sharing analysis
As the dataset was primarily designed for ecological and behavioral studies, we tested its applicability for ecological purposes by evaluating the impact of spatial distance among Dupont’s lark males on the acoustic similarity of songs and calls.
To do this, first, we constructed binary (presence/absence) matrices for each individual, representing the occurrence of each distinct song or call type. From these matrices, we calculated the acoustic similarity among each pair of males, estimated via the Jaccard similarity index. The Jaccard similarity index, which ranges from 0 (no shared elements) to 1 (identical elements), is a metric that quantifies similarity between two sets based on the proportion of shared elements (i.e., vocalizations) relative to the total number of elements present in either set52. Using the real coordinates (not rounded) of each individual we calculated the Euclidean distance (in meters) for each pair of males. For each vocalization type, the comparisons were made across three distance categories: (i) neighbouring males (<500 m), (ii) males inhabiting the same habitat patch but excluding pairs of males in the first group (>500 m), and (iii) those occupying different habitat patches.
The results revealed a clear pattern of decreasing acoustic similarity with increasing distance for both vocalization types, with values close to zero for males inhabiting different populations (Fig. 3). Moreover, for the same spatial categories, calls consistently presented greater acoustic similarity than songs did. Overall, the average acoustic similarity index for males within 500 m was approximately 0.45 for songs and 0.60 for calls, whereas the acoustic similarity index decreased to nearly zero for males separated by more than 500 m within the same population but remained at ~0.25 for calls within the same population.
Acoustic similarity (Jaccard) index of Dupont’s lark songs and calls among males across three spatial categories: neighbouring males (<500 m), males from the same population at >500 m, and males inhabiting different populations.
External validation with prior research
The quantitative measures derived from the dataset are consistent with previously published studies on Dupont’s lark vocal behavior. In particular, the mean individual call repertoire size of 2.46 agrees with prior research on the species (range 1.88 – 2.12)44. Likewise, the patterns of song sharing — including greater sharing among neighbouring males and similarity close to zero for males of different populations — match the species’ known spatial structuring of vocal repertoires28,42. This agreement with established findings provides additional evidence for the reliability and biological validity of the dataset.
Usage Notes
This section provides technical guidance to assist researchers who wish to reuse the dataset. The audio annotations are provided in plain text format compatible with both Raven Pro and Audacity software but can also be converted to CSV files or other readable formats for further processing. We recommend using programming languages such as R to efficiently read, manipulate, and analyze these data tables. An example script for loading and preparing the data for ecological analyses is included in the Code Availability section.
Regarding the further use of the acoustic dataset, given the spatial variability in song and call types among, and even within, populations, we recommend focusing on parameters such as song and call diversity (i.g., repertoire size) and song rate (i.e., vocal activity uttered per unit of time) for within- and among-population comparisons. In contrast, analyses relying on comparison of acoustic features, such as frequency parameters, should be avoided because of the high spatial variability (i.e. dialects) among populations, which limits meaningful comparisons of acoustic features.
Data availability
The dataset presented in this study is open access and accessible through Zenodo (https://doi.org/10.5281/zenodo.17264916). Detailed description regarding the files in the Zenodo repository can be read in the “Data Record” section of this article.
Code availability
The code used to reproduce the analyses in the Technical Validation section is open access and accessible on Zenodo51 with the name “Technical_validation.R”.
References
Wright, H. W. Morning awakening and even-song. Auk 29, 307–327 (1912).
Allen, F. H. More notes on the morning awakening. Auk 30, 229–235 (1913).
Amrhein, V., Körner, P. & Naguib, M. Nocturnal and diurnal singing activity in the nightingale: correlations with mating status and breeding cycle. Anim. Behav. 64, 939–944, https://doi.org/10.1006/anbe.2002.1974 (2002).
Upham-Mills, E. J., Reimer, J. R., Haché, S., Lele, S. R. & Bayne, E. M. Can singing rate be used to predict male breeding status of forest songbirds? A comparison of three calibration models. Ecosphere 11, e03005, https://doi.org/10.1002/ecs2.3005 (2020).
Gil, D. & Gahr, M. The honesty of bird song: multiple constraints for multiple traits. Trends Ecol. Evol. 17, 133–141, https://doi.org/10.1016/S0169-5347(02)02410-2 (2002).
Knight, E. et al. Individual identification in acoustic recordings. Trends Ecol. Evol. 39, 947–960, https://doi.org/10.1016/j.tree.2024.05.007 (2024).
Grant, B. R. & Grant, P. R. Cultural inheritance of song and its role in the evolution of Darwin’s finches. Evolution 50, 2471–2487, https://doi.org/10.1111/j.1558-5646.1996.tb03633.x (1996).
Sebastián-González, E. & Pérez-Granados, C. Geographic variation in acoustic signals in wildlife: a systematic review. J. Biogeogr. 52, e15116, https://doi.org/10.1111/jbi.15116 (2025).
Kondo, N. & Watanabe, S. Contact calls: information and social function. Jpn. Psychol. Res. 51, 197–208, https://doi.org/10.1111/j.1468-5884.2009.00399.x (2009).
Leonard, M. L. & Horn, A. G. Begging calls and parental feeding decisions in tree swallows (Tachycineta bicolor). Behav. Ecol. Sociobiol. 49, 170–175, https://doi.org/10.1007/s002650000290 (2001).
De Kort, S. R., Emery, N. J. & Clayton, N. S. Food sharing in jackdaws, Corvus monedula: what, why and with whom? Anim. Behav. 72, 297–304, https://doi.org/10.1016/j.anbehav.2005.10.016 (2006).
Laiolo, P., Vögeli, M., Serrano, D. & Tella, J. L. Two new calls from the Dupont’s Lark Chersophilus duponti and imitations by other Alaudids: ecological and monitoring implications. Ardeola 52, 167–172 (2005).
Recalde, N. M. et al. A densely sampled and richly annotated acoustic data set from a wild bird population. Anim. Behav. 211, 111–122, https://doi.org/10.1016/j.anbehav.2024.02.008 (2024).
Catchpole, C. K. & Slater, P. J. Bird song: biological themes and variations, 2nd edn (Cambridge Univ. Press, 2008).
Garamszegi, L. Z., Pavlova, D. Z., Eens, M. & Møller, A. P. The evolution of song in female birds in Europe. Behav. Ecol. 18, 86–96, https://doi.org/10.1093/beheco/arl047 (2007).
Amy, M., Salvin, P. & Leboucher, G. The functions of female calls in birds. In Advances in the Study of Behavior 50, 243–271, https://doi.org/10.1016/bs.asb.2017.11.002 (Academic Press, 2018)
Rose, E. M., Prior, N. H. & Ball, G. F. The singing question: Re‐conceptualizing birdsong. Biol Rev 97, 326–342, https://doi.org/10.1111/brv.12800 (2022).
LaZerte, S. E., Otter, K. A. & Slabbekoorn, H. Mountain chickadees adjust songs, calls and chorus composition with increasing ambient and experimental anthropogenic noise. Urban Ecosyst. 20, 989–1000, https://doi.org/10.1007/s11252-017-0652-7 (2017).
Pérez-Granados, C., Osiejuk, T. S. & López-Iborra, G. M. Dawn chorus interpretation differs when using songs or calls: the Dupont’s Lark Chersophilus duponti case. PeerJ 6, e5241, https://doi.org/10.7717/peerj.5241 (2018).
Slagsvold, T. Bird song activity in relation to breeding cycle, spring weather, and environmental phenology. Ornis Scand. 8, 197–222 (1977).
Sanders, C. E. & Mennill, D. J. Acoustic monitoring of nocturnally migrating birds accurately assesses the timing and magnitude of migration through the Great Lakes. Condor: Ornithol. Appl. 116, 371–383, https://doi.org/10.1650/CONDOR-13-098.1 (2014).
Hettena, A. M., Muñoz, N. & Blumstein, D. T. Prey responses to predator’s sounds: a review and empirical study. Ethology 120, 427–452, https://doi.org/10.1111/eth.12219 (2014).
Laiolo, P. The emerging significance of bioacoustics in animal species conservation. Biol. Conserv. 143, 1635–1645, https://doi.org/10.1016/j.biocon.2010.03.025 (2010).
Kroodsma, D. E. Learning and the ontogeny of sound signals in birds. In Acoustic Communication in Birds (eds Kroodsma, D. E., Miller, E. H. & Ouellet, H.) 1–23 (Academic Press, 1982).
Baker, M. C. & Cunningham, M. A. The biology of bird-song dialects. Behav. Brain Sci. 8, 85–133 (1985).
Salinas-Melgoza, A. & Wright, T. F. Evidence for vocal learning and limited dispersal as dual mechanisms for dialect maintenance in a parrot. PLoS ONE 7, e48667, https://doi.org/10.1371/journal.pone.0048667 (2012).
Brown, E. D. & Farabaugh, S. M. What birds with complex social relationships can tell us about vocal learning: vocal sharing in avian groups. In Social Influences on Vocal Development (eds Snowdon, C. T. & Hausberger, M.) 98–127, https://doi.org/10.1017/CBO9780511758843.007 (Cambridge Univ. Press, 1997).
Pérez-Granados, C., Osiejuk, T. & López-Iborra, G. M. Habitat fragmentation effects and variations in repertoire size and degree of song sharing among close Dupont’s Lark Chersophilus duponti populations. J. Ornithol. 157, 471–482, https://doi.org/10.1007/s10336-015-1310-6 (2016).
Laiolo, P., Vögeli, M., Serrano, D. & Tella, J. L. Song diversity predicts the viability of fragmented bird populations. PLoS ONE 3, e1822, https://doi.org/10.1371/journal.pone.0001822 (2008).
Pérez-Granados, C. et al. Vocal activity rate index: a useful method to infer terrestrial bird abundance with acoustic monitoring. Ibis 161, 901–907, https://doi.org/10.1111/ibi.12728 (2019).
Lepage, D. Avibase — The World Bird Database https://avibase.bsc-eoc.org/checklist.jsp (2021).
Otter, K. A., Mckenna, A., LaZerte, S. E. & Ramsay, S. M. Continent-wide shifts in song dialects of white-throated sparrows. Curr. Biol. 30, 3231–3235.e3, https://doi.org/10.1016/j.cub.2020.05.084 (2020).
Calviño-Cancela, M., Piña, L. & Martín-Herrero, J. Bioacoustic differentiation of calls in the chiffchaff complex. PeerJ 10, e14261, https://doi.org/10.7717/peerj.14261 (2022).
Mikula, P. et al. A global analysis of song frequency in passerines provides no support for the acoustic adaptation hypothesis but suggests a role for sexual selection. Ecol. Lett. 24, 477–486, https://doi.org/10.1111/ele.13662 (2021).
Silva-Jr, E. F., Diniz, P. & Macedo, R. H. Song varies with latitude, climate, and species richness in a Neotropical bird. Behav. Ecol. 33, 87–100, https://doi.org/10.1093/beheco/arab112 (2022).
Gómez-Gómez, J., Vidaña-Vila, E. & Sevillano, X. Western Mediterranean Wetland Birds dataset: A new annotated dataset for acoustic bird species classification. Ecol. Inform. 75, 102014, https://doi.org/10.1016/j.ecoinf.2023.102014 (2023).
Jamil, N., Norali, A. N., Ramli, M. I., Shah, A. K. M. K. & Mamat, I. SiulMalaya: an annotated bird audio dataset of Malaysia lowland forest birds for passive acoustic monitoring. Bull. Electr. Eng. Inform. 12, 2269–2281, https://doi.org/10.11591/eei.v12i4.5222 (2023).
Pérez-Granados, C. et al. WABAD: A world annotated bird acoustic dataset for passive acoustic monitoring. Ecology, 107, e70317, https://doi.org/10.1002/ecy.70317 (2026).
Kahl, S., Wood, C. M., Eibl, M. & Klinck, H. BirdNET: A deep learning solution for avian diversity monitoring. Ecol. Inform. 61, 101236, https://doi.org/10.1016/j.ecoinf.2021.101236 (2021).
Cornell Lab of Ornithology. Merlin Bird ID, version 3.7.1. Cornell Lab of Ornithology, Ithaca, NY, USA https://merlin.allaboutbirds.org/ (2025).
Wood, C. M., Kahl, S., Rahaman, A. & Klinck, H. The machine learning–powered BirdNET App reduces barriers to global bird research by enabling citizen science participation. PLoS Biol. 20, e3001670, https://doi.org/10.1371/journal.pbio.3001670 (2022).
Laiolo, P. & Tella, J. L. Habitat fragmentation affects culture transmission: patterns of song matching in Dupont’s lark. J. Appl. Ecol. 42, 1183–1193, https://doi.org/10.1111/j.1365-2664.2005.01093.x (2005).
Laiolo, P. & Tella, J. L. Landscape bioacoustics allow detection of the effects of habitat patchiness on population structure. Ecology 87, 1203–1214, 10.1890/0012-9658(2006)87[1203:LBADOT]2.0.CO;2 (2006).
Gómez-Catasús, J., Barrero, A., Llusia, D., Iglesias-Merchán, C. & Traba, J. Wind farm noise shifts vocalizations of a threatened shrub-steppe passerine. Environ. Pollut. 303, 119144, https://doi.org/10.1016/j.envpol.2022.119144 (2022).
Barrero, A., Gómez-Catasús, J., Pérez-Granados, C., Bustillo-de la Rosa, D. & Traba, J. Conspecific density and habitat quality drive the defence and vocal behaviour of a territorial passerine. Ibis 166, 826–843, https://doi.org/10.1111/ibi.13295 (2024).
Gómez-Catasús, J., Barrero, A., Garza, V. & Traba, J. Alondra ricotí — Chersophilus duponti (Vieillot, 1820). In Enciclopedia Virtual de los Vertebrados Españoles (eds Salvador, A. & Morales, M. B.) Museo Nacional de Ciencias Naturales. http://www.vertebradosibericos.org/ (accessed 10 Aug 2025) (2016).
Laiolo, P., Vögeli, M., Serrano, D. & Tella, J. L. Testing acoustic versus physical marking: two complementary methods for individual‐based monitoring of elusive species. J Avian Biol 38, 672–681, https://doi.org/10.1111/j.2007.0908-8857.04006.x (2007).
Pérez-Granados, C., López-Iborra, G. M., Garza, V. & Traba, J. Breeding biology of the endangered Dupont’s Lark Chersophilus duponti in two separate Spanish shrub-steppes. Bird Study 64, 328–338, https://doi.org/10.1080/00063657.2017.1359232 (2017).
Pérez-Granados, C. & López-Iborra, G. M. Assessment of counting methods used for estimating the number of territorial males in the endangered Dupont’s Lark. Ardeola 64, 75–84, https://doi.org/10.13157/arla.64.1.2017.sc2 (2017).
Cornell Lab of Ornithology. Raven Pro: Interactive Sound Analysis Software, version 1.6.4. Cornell Lab of Ornithology, Ithaca, NY, USA https://www.ravensoundsoftware.com/ (2023)
Pérez-Granados, C., Alonso-Moya, C., Barrero, A., Sáez-Gómez, P. & Traba, J. A large-scale acoustic dataset of a passerine with spatially variable vocal behavior: fine-scale annotations of song and call types. Zenodo. https://doi.org/10.5281/zenodo.17264916 (2025).
Jaccard, P. É. tude comparative de la distribution florale dans une portion des Alpes et des Jura. Bull Soc Vaudoise Sci Nat 37, 547–579 (1901).
Acknowledgements
The results are part of project PID2022-139294NB-I00, funded by the Ministerio de Ciencia e Innovación of Spain (MCIN/AEI/10.13039/501100011033/FEDER/UE). CPG was funded by the Ramón y Cajal 2024 Programme (RYC2024-048830-I) of the Spanish Ministry of Science, Innovation and Universities, funded by MICIU/AEI/10.13039/501100011033 and FSE+.
Author information
Authors and Affiliations
Contributions
C.P.-G. and C.D.A.-M. conceived the study, C.P.-G., C.D.A.-M., A.B. and P.S.-G. conducted the experiments and analysed the results, C.D.A.-M. managed the data collection and annotation, C.P.-G. and J.T. led the project management. All the authors contributed to the data annotation review and manuscript preparation.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Pérez-Granados, C., Alonso-Moya, C.D., Barrero, A. et al. A large-scale acoustic dataset of a passerine with spatially variable vocal behavior: fine-scale annotations of song and call types. Sci Data 13, 770 (2026). https://doi.org/10.1038/s41597-026-07131-4
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41597-026-07131-4





