A large-scale acoustic dataset of a passerine with spatially variable vocal behavior: fine-scale annotations of song and call types

Pérez-Granados, Cristian; Alonso-Moya, Cristina D.; Barrero, Adrián; Sáez-Gómez, Pedro; Bota, Gerard; Lahoz-Monfort, José J.; Osiejuk, Tomasz S.; Rogers, Alex; Schuchmann, Karl-L.; Traba, Juan

doi:10.1038/s41597-026-07131-4

Download PDF

Data Descriptor
Open access
Published: 30 March 2026

A large-scale acoustic dataset of a passerine with spatially variable vocal behavior: fine-scale annotations of song and call types

Cristian Pérez-Granados¹,
Cristina D. Alonso-Moya¹,
Adrián Barrero^2,3,
Pedro Sáez-Gómez^2,4,
Gerard Bota¹,
José J. Lahoz-Monfort⁵,
Tomasz S. Osiejuk⁶,
Alex Rogers⁷,
Karl-L. Schuchmann^8,9,10 &
…
Juan Traba^2,3

Scientific Data volume 13, Article number: 770 (2026) Cite this article

1593 Accesses
1 Altmetric
Metrics details

Abstract

The study of bird vocal behavior provides insights into ecology, evolution, and conservation. Prior research has mostly focused on bird songs, with limited attention to calling behavior. Although the origins and functions of these vocalizations differ, no detailed annotated acoustic datasets are currently available including both bird songs and calls. Here, we present an acoustic dataset with standardized recordings and detailed annotations of singing and calling behavior in the Dupont’s lark (Chersophilus duponti), a passerine with spatially variable vocal behavior. Recordings were collected across a large spatial scale (20 populations) to capture geographic variation in vocalizations. The dataset includes 4,297 annotated songs from 191 singing males, representing 401 song types, and 795 annotated calls from 97 calling males, representing 80 call types. Annotations provide precise categorization of song and call types, enabling comparisons of individual and population vocal repertoires, geographic variation, and potential effects of habitat fragmentation on vocal behavior. This dataset is intended for reuse in studies of bird vocal behavior, comparative analyses of song and call patterns, and research on passerine acoustic communication.

Detection ranges of forest bird vocalisations: guidelines for passive acoustic monitoring

Article Open access 09 January 2024

An Open Dataset for the Acoustic Monitoring of Nocturnal Migratory Birds in Europe

Article Open access 21 April 2026

Global birdsong embeddings enable superior transfer learning for bioacoustic classification

Article Open access 18 December 2023

Background & Summary

Given the multiple implications, bird vocalizations have attracted the interest of scientists and ornithologists since the late 19^th and early 20^th centuries^1,2. Among other aspects, the study of avian vocal behavior provide valuable insights into: (i) individuals’ reproductive status, including its role in mate attraction and courtship signaling^3,4, (ii) physical condition and individual identity^5,6, (iii) the processes driving local adaptation and population differentiation^7,8, (iv) group cohesion and social structure, as seen in contact calls used by flocking species⁹, (v) the provision of parental care, including nestling begging calls¹⁰, (vi) the location and sharing of food resources, such as food calls in corvids or cooperative breeders¹¹, and (vii) alarm and predator-related communication, including threat, mobbing or distress calls aimed at deterring predators or warning conspecifics¹². Despite this interest from disciplines as ecology, evolution, behavior, and neurobiology, much remains to be discovered regarding the origin, evolution, and function of bird vocalizations¹³. Indeed, the evolution and function of bird vocalizations are context dependent, varying among species, populations, and studies. This variation among research is partly explained by the wide diversity of vocalizations produced by birds, which include songs, territorial, warning, group, distress, and begging calls, among others¹⁴. Two of the commonest vocalizations produced by birds, and thus the most studied, are songs and territorial calls (hereafter “calls”).

Songs are typically complex and, in many bird species, learned from tutors. In males, and in some species also in females, songs are often linked to several aspects of sexual selection, including mate attraction, intra- and inter-sexual competition, and communication within and between pairs, including duetting species. In contrast, calls, which can also be produced by both sexes, are commonly described as innate and tend to be shorter than songs, and often associated with territorial defense or immediate communicative purposes^14,15,16. It is worth noting that the distinction between what is a song and a call is not always clear may vary among species¹⁷. Comparative studies of both vocalization types offer valuable insights into both the ecology and behavior of the species and the origin and function of songs and calls. For example, analysing diel and monthly variation in songs and calls for the same species can clarify the distinct functions of each vocalization type^18,19. Similarly, seasonal changes in vocal output may also provide important information about the breeding period and migration timing of birds^20,21 and behavioral responses to environmental changes, such as reactions to intruders or predators²². Consequently, although still rare in the literature, comparative studies of both songs and calls recorded in the field hold potential to deepen our understanding of avian vocal behavior and spatial organisation, laying potential for interpreting population dynamics in changing ecological contexts.

In addition to behavioral and ecological applications, in recent decades, the study of bird vocalizations has also become an important tool in conservation biology²³. Vocal learning occurs on many bird species, including passerines, parrots and hummingbirds²⁴, which facilitates the development of regional dialects (i.e., distinct spatial vocalization patterns) that offer crucial insights into the adult dispersal, gene flow and connectivity between populations^25,26. Likewise, analysing the variations in the degree of vocalization sharing among groups of neighbouring and non-neighbouring individuals can reveal patterns of social interactions and habitat fragmentation, as disruptions in vocal exchange often indicate barriers to adult movement and increased isolation among groups and populations^27,28. In this sense, acoustic parameters such as the diversity of vocalizations within and among individuals and the vocal activity rate serve as indicators of population status and viability^29,30, as well as reproductive status, with higher complexity and frequency of vocalizations often linked to active breeding and better condition of individuals^3,4. Thus, by studying birds’ vocal behavior, researchers can non-invasively detect early-warning signs of environmental change, assess population viability and reproductive success, and guide conservation monitoring and management efforts²³.

In recent years, several large-scale collaborative initiatives have substantially increased the availability of avian acoustic data, including publicly accessible repositories such as Xeno-Canto (www.xeno-canto.org), Avibase³¹, and the Macaulay Library (www.macaulaylibrary.org). These platforms have facilitated global access to thousands of crowd-sourced recordings, enabling research on geographic variation in bird vocalizations^32,33, global analyses on bird song frequency³⁴, and other large-scale ecological patterns³⁵. In addition to these repositories, numerous annotated acoustic datasets have emerged as valuable resources for studying avian vocal behavior. Such datasets are typically generated within the context of regional monitoring programs, species-specific research projects, or conservation initiatives. Therefore, most of these datasets remain constrained in geographic and taxonomic coverage and often rely on opportunistic sampling, limiting their suitability for broader ecological applications^13,36,37. A notable exception in terms of scale is the recently published World Annotated Bird Acoustic Database (WABAD), which contains fine-scale annotations of over 1000 bird species from 27 countries and distributed across 13 biomes³⁸. While WABAD represents a major step forward in the curation of global acoustic datasets, its annotations do not explicitly distinguish the different vocalization types detected (e.g., song, territorial call, contact call, alarm call), thereby limiting its applicability to fine-scale behavioral studies. In parallel, the development of smartphone applications powered by deep learning algorithms—most prominently BirdNET³⁹ and Merlin⁴⁰—has greatly expanded ornithological citizen science participation worldwide. These mobile applications have generated tens of millions of georeferenced acoustic detections that have been used to examine song dialect boundaries, migratory movements, and seasonal vocal activity at continental scales⁴¹. Nonetheless, these data also lack standardised sampling protocols and detailed annotations distinguishing vocalization types, which reduces their utility for behavioral and ecological analyses requiring precise vocalization-type categorisation.

To address the lack of acoustic datasets with detailed annotations of different song and call types, we present a fully curated and annotated acoustic dataset on the vocal behavior of the Dupont’s lark (Chersophilus duponti) covering most of the species’ distribution area in Europe. We selected the Dupont’s lark as a target species because it has been the subject of extensive acoustic research^{19,28,29,42,43,44,45}, providing a strong background for further comparisons. Moreover, the Dupont’s lark utters songs and calls at a high rate and exhibits extraordinary spatial variability among populations, often including distinct dialects among groups of males inhabiting the same habitat patch²⁸, which makes the Dupont’s lark an ideal model for investigating differences in song and call usage and assessing the effects of habitat fragmentation on birds’ vocal behavior. To enhance reproducibility and facilitate further use of the dataset, we: (i) followed a standardized field protocol ensuring the complete recording of individuals’ song and call repertoires, and (ii) applied a strict annotation protocol categorizing every recorded song and call recorded at the type level. Additionally, our dataset is accompanied by two metadata files. One contains vocalization level information, including the onset and offset times of each vocalization within the recording, minimum and maximum frequency, vocalization type labels, and the ID of the recorded individual. The other metadata file provides detailed information for each recorded male (date and population where recorded, geographic coordinates) and for each surveyed population, enabling further analyses relating individual or population vocal behavior to habitat metrics. We hope that our dataset—currently the largest publicly available acoustic dataset with detailed annotations of both song and call types for a bird species—will significantly contribute to advancing research on avian ecology, vocal behavior, and conservation.

Methods

The Dupont’s lark (Chersophilus duponti) is a resident, territorial, and socially monogamous passerine, although extra-pair copulations may occur⁴⁶. The species is distributed through North Africa and Spain, the only European country where the species occurs. It is a diurnal species with high vocal activity during the hour before sunrise. The Dupont’s lark uses many vocalization types, mainly songs and territorial calls, which have been extensively studied^28,42,43,45, but its repertoire also includes distress calls, warning calls, and clucking calls, which are more rarely uttered^42,45. Songs and territorial calls (calls hereafter), the two vocalization types considered in this study, are only uttered by males and both uttered at high output during the dawn chorus of the species¹⁹. Songs, which are learned after summer dispersal from nearby adult males²⁹, typically includes 3–8 discrete song types, which are largely shared and repeated in the same order by neighbouring males²⁹. A song type consists of a small number of song units (usually 2–4) occurring together in a specific sequence (Fig. 1). The territorial call of the Dupont’s lark consists of discrete, short whistles — typically between one and three — that are regularly repeated in the same order²⁹ (Fig. 1). Territorial calls have been proposed to be learned before dispersal after a short learning phase in summer⁴⁷.

Study area

The study was carried out from March to June (main breeding period of the Dupont’s lark⁴⁸) during 2024 and 2025, covering 20 steppe remnants occupied by the species in Spain (Fig. 2). The study areas (hereinafter referred to as “populations”) were patches of shrub-steppe vegetation, with some scattered trees embedded within a matrix dominated by cereal crops. We selected 20 populations that were well distributed across the European range of the species and differed in patch size (ha, estimated as the total amount of potential vegetation within that area), Dupont’s lark abundance estimated through line transects, and isolation. Two populations were considered distinct when separated by more than 1 km of non-natural vegetation. Populations are considered here to represent spatially discrete areas, although they may not be completely genetically isolated. The patch size of the populations ranged from 80 to 1,903 ha, and the local population abundances, estimated through the line transect method⁴⁹ varied between 3 and 77 males (Table 1).

Table 1 Summary table indicating the ID and name of each Dupont’s lark population, along with patch size (ha) and population size (number of males counted).

Full size table

Bird recording

Each population was surveyed at least once per year during the hour before sunrise, when Dupont’s lark vocal activity reached its peak¹⁹. Males were recorded within their territories from the closest possible distance (ranging from 10–70 m) without disturbing their natural vocal behavior. Recordings were made via portable digital flash-memory recorders — a Marantz PMD 661 or a TASCAM DR40 —both equipped with a Sennheiser ME 67 shotgun microphone with a K6 powering unit and wind protection. The frequency response of these microphones is largely flat in the range 20–20,000 Hz. Both recorders were programmed to record in.wav format with a sampling rate of 44.1 kHz and 24-bit resolution. Recordings were obtained only under favourable weather conditions (i.e., no rain and windless days). The location of each recorded male was determined via a GPS device (3–5 m accuracy). Although birds were not individually marked, all recordings of a given male, as well as those males within a 500 m radius, were obtained on the same day to minimise the risk of recording the same individuals on successive days.

Following assessments carried out in prior studies, singing males were recorded for at least two minutes, which has been proven to be sufficient to ensure that the complete individual song repertoire is captured²⁷. Nonetheless, four of the 191 singing males (2.1% of the total) were recorded for less than two minutes, with a mean recording length of 94 seconds. With respect to calling behavior, we recorded a minimum of three calls per individual, which is sufficient to capture the individual’s entire call repertoire⁴². In total, we recorded songs from 191 males and calls from 97 males. Due to low-light conditions under which recordings were obtained and shy behaviour of the species, it was not possible to reliably determine whether song and call recordings may correspond to the same individual. Therefore, individuals could not be matched between the song and call datasets. A summary table showing the number of individuals recorded singing and calling in each population, as well as summary statistics on the species’ vocal behavior of the species at the population level, is provided in Table 1.

Audio annotation protocol

For data standardisation we developed a detailed annotation protocol. The audio annotation process consisted of opening an audio recording in Raven Pro 1.6⁵⁰, a commercial software commonly used by the scientific community working with wildlife vocalizations. The annotations were made on the spectral view of Raven Pro 1.6, using the default configuration parameters (Window type = Hann, DFT size = 512 samples, brightness = 50, contrast = 50). The display axes were kept constant, with the Y-axis extending to 10 kHz and the X-axis showing approximately 16 s per screen. Nonetheless, the annotator was free to adjust the parameters of the spectral view, especially brightness and contrast, to aid in identifying the target vocalizations.

The audio annotation process was carried out by a single expert ornithologist (C.D.A-M), familiar with the Dupont’s lark vocal behavior, and consisted of visually and aurally checking each audio file in Raven Pro 1.6. The annotator marked with a box the portion of the file where a Dupont’s lark vocalization, song or call, occurred. The box was narrowed to include every Dupont’s lark vocalization in the recording and mark the exact starting and ending time, as well as the maximum and minimum frequency of each vocalization. We did not apply any quality criteria (e.g., considering only vocalizations of high signal-to-noise ratios or high amplitudes) to annotate Dupont’s lark vocalizations in the dataset; the only criterion was that the expert was able to identify the vocalizations in the recording and to ensure that the recorded vocalizations belonged to the target individual (i.e., background vocalizations of neighbouring males were discarded). In total, we identified 401 different song types from 4,297 annotated songs from 191 singing males recorded, and identified 80 different call types from 795 annotated calls recorded from 97 calling males.

Once the annotation of the entire audio file was completed, the annotations were exported as a.txt file with the default Raven Pro format and with the exact same name as the audio file. However, to allow users to open the annotations via a free audio software we also saved the annotations as a.txt file compatible with the Audacity software (https://audacityteam.org/). Both software programs allow users to create, open, and modify annotations of audio files.

Vocalization identification

To characterize the presence of different song types in the audio files we conducted a detailed visual and aural inspection of the sonograms. To ensure consistent classification, a single observer (C.D.A-M) assessed all the audio recordings blindly. Each song type detected was coded with an ordinal number (e.g., S1, S2, S3; Fig. 1). When we found a song type that had been previously catalogued, we labelled it accordingly, and if it was a new song type it was categorized with a new label and added to the catalogue. Following prior research, the classification of song types was based on spectrogram morphology, temporal patterns, frequency characteristics, and number of notes^42,43. In summary, two song types were considered the same if they matched in at least 75% of their notes and showed high similarity in note shape, timing, and frequency (Fig. 1). The same methodological procedure was applied to characterize the presence of different call types in audio recordings.

Data Record

The dataset presented in this study is open access and accessible through Zenodo⁵¹. Within the Zenodo repository, there are two ZIP folders that follow the following structure:

/Song.zip/

└── DL_S1/

├── DL_S1.wav

└── DL_S1_Raven.txt

└── DL_S1_Audacity.txt

└── DL_S2/

├── DL_S2.wav

├── DL_S2_Raven.txt

├── DL_S1_Audacity.txt

└── --------------------

/Call.zip/

└── DL_C1/

├── DL_C1.wav

├── DL_C1_Raven.wav

└── DL_C1_Audacity.txt

└── --------------------

Song folder

This folder contains 191 subfolders identifying the Dupont’s lark males recorded singing during the study. Each folder is identified by its ID name, which is composed of the code “DL” identifying the species (Dupont’s Lark), the code “S” identifying the vocalization type (Song in that case) and a numeric value identifying each individual (for example, “DL_S1”). Inside the folder of each individual we can find their respective recording (.wav format) files and annotations (.txt files). The annotations are provided in two formats, Raven Pro and Audacity (see Data annotation section). Raven Pro annotations contain eight columns with the following information:

Selection: Selection ID
View: Whether the information comes from the spectrogram or from the waveform
Channel: Channel of origin of the recordings
Begin Time (s): Start second of the selection
End Time (s): End second of the selection
Low Freq (Hz): Lowest frequency of the selection
High Freq (Hz): Highest frequency of the selection
Song (or call) type: Song (or call) type

Each annotation in Audacity is in two consecutive columns: the first column refers (following the prior nomenclature) to the Begin Time (in s), the End Time (in s), and the song (or call) type, and the next row, which starts with a “\”, is the Low Freq (in Hz) and the High Freq (Hz).

Call folder

This folder follows the same structure and subfolders as those explained above, but contains in that case the calls of the 97 Dupont’s lark recorded calling during the study and using the code “C” (call) to identify the vocalization type (for example, “DL_C1”).

The dataset also includes six additional files to support further analyses: (1) a metadata file (“Metadata.csv”) containing detailed information for each recorded individual, including their ID, date of recording, approximate latitude and longitude (geographic coordinates in decimal degrees; rounded to within ±1 km owing to the threatened status of the species) of recording, population where recorded, and recorder (“Tascam” or “Marantz”) employed; (2) two binary presence-absence matrices (“Song_binary_matrix.csv” and “Call_binary_matrix.csv”) indicating the presence or absence of each song or call type identified; (3) two distance matrices (“Song_distance_matrix.csv” and “Call_distance_matrix.csv”) representing the spatial distances (in m) between all pairs of males and their Acoustic (Jacard) similarity indexes, and 4) an R script (“Technical_validation.R”) detailing the steps implemented to perform the technical validation.

Technical Validation

To ensure the technical quality and reliability of the annotated acoustic dataset, multiple quality-control procedures were implemented throughout all stages, from field recording to annotation, classification, and data processing.

Quality of field recordings

All the recordings were obtained: (i) under adequate weather conditions, (ii) using the same high-quality directional microphone, and iii) with a reduced distance to the bird, which guarantees audio clarity and minimises external noise. Moreover, we followed a strict protocol regarding the minimum recording length for each male and minimized the risk of recording the same individual between successive visits by avoiding revisiting the same territory.

Metadata standardization and file naming consistency

Each audio recording included in the dataset is accompanied by a complete metadata record including rounded geographic coordinates of the recorded male, date and hour of recording, and recording equipment used, among others. This detailed metadata ensures the traceability of each recording and facilitates its further integration into larger-scale ecological and bioacoustic studies. Moreover, to prevent mismatches between recordings and their annotation files, a strict naming convention was enforced to ensure that both files share the same nomenclature. This procedure required renaming the original audio and annotation files following the structure described in the Data record section. Moreover, the annotation files are provided in the format of two of the most common audio software programs (Raven Pro and Audacity), which ensures data traceability, interoperability and facilitates its further use by researchers via different analytical tools.

Consistency and agreement in audio annotation

Audio annotation was performed using professional audio software and by a single expert ornithologist (C.D.A-M) who was highly familiar with the Dupont’s lark vocalizations, following a standardized audio annotation protocol (see the Audio annotation protocol). The decision to have a single primary annotator ensured consistency in the classification process. Nonetheless, all cases in which i) the classification of a song or call type was uncertain, and ii) that a song or call type was annotated to be shared among populations, were reviewed by at least one additional experienced ornithologist to ensure its classification. This verification step involved independent inspection of the relevant spectrograms and audio files by up to two additional experts. Only when agreement was consistently agreed upon by at least two researchers was the vocalization type confirmed. This cross-checking procedure provides an additional safeguard against misclassification, ensuring that the final catalogue of song and call types is reliable and robust. Moreover, after this classification, the dataset underwent a thorough quality-control process to verify that the same vocalization type was never duplicated under different labels, and to confirm that all the annotation files correctly matched to their respective audio recordings.

Repertoire metrics and song sharing analysis

As the dataset was primarily designed for ecological and behavioral studies, we tested its applicability for ecological purposes by evaluating the impact of spatial distance among Dupont’s lark males on the acoustic similarity of songs and calls.

To do this, first, we constructed binary (presence/absence) matrices for each individual, representing the occurrence of each distinct song or call type. From these matrices, we calculated the acoustic similarity among each pair of males, estimated via the Jaccard similarity index. The Jaccard similarity index, which ranges from 0 (no shared elements) to 1 (identical elements), is a metric that quantifies similarity between two sets based on the proportion of shared elements (i.e., vocalizations) relative to the total number of elements present in either set⁵². Using the real coordinates (not rounded) of each individual we calculated the Euclidean distance (in meters) for each pair of males. For each vocalization type, the comparisons were made across three distance categories: (i) neighbouring males (<500 m), (ii) males inhabiting the same habitat patch but excluding pairs of males in the first group (>500 m), and (iii) those occupying different habitat patches.

The results revealed a clear pattern of decreasing acoustic similarity with increasing distance for both vocalization types, with values close to zero for males inhabiting different populations (Fig. 3). Moreover, for the same spatial categories, calls consistently presented greater acoustic similarity than songs did. Overall, the average acoustic similarity index for males within 500 m was approximately 0.45 for songs and 0.60 for calls, whereas the acoustic similarity index decreased to nearly zero for males separated by more than 500 m within the same population but remained at ~0.25 for calls within the same population.

External validation with prior research

The quantitative measures derived from the dataset are consistent with previously published studies on Dupont’s lark vocal behavior. In particular, the mean individual call repertoire size of 2.46 agrees with prior research on the species (range 1.88 – 2.12)⁴⁴. Likewise, the patterns of song sharing — including greater sharing among neighbouring males and similarity close to zero for males of different populations — match the species’ known spatial structuring of vocal repertoires^28,42. This agreement with established findings provides additional evidence for the reliability and biological validity of the dataset.

Usage Notes

This section provides technical guidance to assist researchers who wish to reuse the dataset. The audio annotations are provided in plain text format compatible with both Raven Pro and Audacity software but can also be converted to CSV files or other readable formats for further processing. We recommend using programming languages such as R to efficiently read, manipulate, and analyze these data tables. An example script for loading and preparing the data for ecological analyses is included in the Code Availability section.

Regarding the further use of the acoustic dataset, given the spatial variability in song and call types among, and even within, populations, we recommend focusing on parameters such as song and call diversity (i.g., repertoire size) and song rate (i.e., vocal activity uttered per unit of time) for within- and among-population comparisons. In contrast, analyses relying on comparison of acoustic features, such as frequency parameters, should be avoided because of the high spatial variability (i.e. dialects) among populations, which limits meaningful comparisons of acoustic features.

Data availability

The dataset presented in this study is open access and accessible through Zenodo (https://doi.org/10.5281/zenodo.17264916). Detailed description regarding the files in the Zenodo repository can be read in the “Data Record” section of this article.

Code availability

The code used to reproduce the analyses in the Technical Validation section is open access and accessible on Zenodo⁵¹ with the name “Technical_validation.R”.

References

Wright, H. W. Morning awakening and even-song. Auk 29, 307–327 (1912).
Article Google Scholar
Allen, F. H. More notes on the morning awakening. Auk 30, 229–235 (1913).
Article Google Scholar
Amrhein, V., Körner, P. & Naguib, M. Nocturnal and diurnal singing activity in the nightingale: correlations with mating status and breeding cycle. Anim. Behav. 64, 939–944, https://doi.org/10.1006/anbe.2002.1974 (2002).
Article Google Scholar
Upham-Mills, E. J., Reimer, J. R., Haché, S., Lele, S. R. & Bayne, E. M. Can singing rate be used to predict male breeding status of forest songbirds? A comparison of three calibration models. Ecosphere 11, e03005, https://doi.org/10.1002/ecs2.3005 (2020).
Article Google Scholar
Gil, D. & Gahr, M. The honesty of bird song: multiple constraints for multiple traits. Trends Ecol. Evol. 17, 133–141, https://doi.org/10.1016/S0169-5347(02)02410-2 (2002).
Article Google Scholar
Knight, E. et al. Individual identification in acoustic recordings. Trends Ecol. Evol. 39, 947–960, https://doi.org/10.1016/j.tree.2024.05.007 (2024).
Article PubMed Google Scholar
Grant, B. R. & Grant, P. R. Cultural inheritance of song and its role in the evolution of Darwin’s finches. Evolution 50, 2471–2487, https://doi.org/10.1111/j.1558-5646.1996.tb03633.x (1996).
Article PubMed Google Scholar
Sebastián-González, E. & Pérez-Granados, C. Geographic variation in acoustic signals in wildlife: a systematic review. J. Biogeogr. 52, e15116, https://doi.org/10.1111/jbi.15116 (2025).
Article Google Scholar
Kondo, N. & Watanabe, S. Contact calls: information and social function. Jpn. Psychol. Res. 51, 197–208, https://doi.org/10.1111/j.1468-5884.2009.00399.x (2009).
Article Google Scholar
Leonard, M. L. & Horn, A. G. Begging calls and parental feeding decisions in tree swallows (Tachycineta bicolor). Behav. Ecol. Sociobiol. 49, 170–175, https://doi.org/10.1007/s002650000290 (2001).
Article Google Scholar
De Kort, S. R., Emery, N. J. & Clayton, N. S. Food sharing in jackdaws, Corvus monedula: what, why and with whom? Anim. Behav. 72, 297–304, https://doi.org/10.1016/j.anbehav.2005.10.016 (2006).
Article Google Scholar
Laiolo, P., Vögeli, M., Serrano, D. & Tella, J. L. Two new calls from the Dupont’s Lark Chersophilus duponti and imitations by other Alaudids: ecological and monitoring implications. Ardeola 52, 167–172 (2005).
Google Scholar
Recalde, N. M. et al. A densely sampled and richly annotated acoustic data set from a wild bird population. Anim. Behav. 211, 111–122, https://doi.org/10.1016/j.anbehav.2024.02.008 (2024).
Article Google Scholar
Catchpole, C. K. & Slater, P. J. Bird song: biological themes and variations, 2nd edn (Cambridge Univ. Press, 2008).
Garamszegi, L. Z., Pavlova, D. Z., Eens, M. & Møller, A. P. The evolution of song in female birds in Europe. Behav. Ecol. 18, 86–96, https://doi.org/10.1093/beheco/arl047 (2007).
Article Google Scholar
Amy, M., Salvin, P. & Leboucher, G. The functions of female calls in birds. In Advances in the Study of Behavior 50, 243–271, https://doi.org/10.1016/bs.asb.2017.11.002 (Academic Press, 2018)
Rose, E. M., Prior, N. H. & Ball, G. F. The singing question: Re‐conceptualizing birdsong. Biol Rev 97, 326–342, https://doi.org/10.1111/brv.12800 (2022).
Article PubMed Google Scholar
LaZerte, S. E., Otter, K. A. & Slabbekoorn, H. Mountain chickadees adjust songs, calls and chorus composition with increasing ambient and experimental anthropogenic noise. Urban Ecosyst. 20, 989–1000, https://doi.org/10.1007/s11252-017-0652-7 (2017).
Article Google Scholar
Pérez-Granados, C., Osiejuk, T. S. & López-Iborra, G. M. Dawn chorus interpretation differs when using songs or calls: the Dupont’s Lark Chersophilus duponti case. PeerJ 6, e5241, https://doi.org/10.7717/peerj.5241 (2018).
Article PubMed PubMed Central Google Scholar
Slagsvold, T. Bird song activity in relation to breeding cycle, spring weather, and environmental phenology. Ornis Scand. 8, 197–222 (1977).
Article Google Scholar
Sanders, C. E. & Mennill, D. J. Acoustic monitoring of nocturnally migrating birds accurately assesses the timing and magnitude of migration through the Great Lakes. Condor: Ornithol. Appl. 116, 371–383, https://doi.org/10.1650/CONDOR-13-098.1 (2014).
Article Google Scholar
Hettena, A. M., Muñoz, N. & Blumstein, D. T. Prey responses to predator’s sounds: a review and empirical study. Ethology 120, 427–452, https://doi.org/10.1111/eth.12219 (2014).
Article Google Scholar
Laiolo, P. The emerging significance of bioacoustics in animal species conservation. Biol. Conserv. 143, 1635–1645, https://doi.org/10.1016/j.biocon.2010.03.025 (2010).
Article Google Scholar
Kroodsma, D. E. Learning and the ontogeny of sound signals in birds. In Acoustic Communication in Birds (eds Kroodsma, D. E., Miller, E. H. & Ouellet, H.) 1–23 (Academic Press, 1982).
Baker, M. C. & Cunningham, M. A. The biology of bird-song dialects. Behav. Brain Sci. 8, 85–133 (1985).
Article Google Scholar
Salinas-Melgoza, A. & Wright, T. F. Evidence for vocal learning and limited dispersal as dual mechanisms for dialect maintenance in a parrot. PLoS ONE 7, e48667, https://doi.org/10.1371/journal.pone.0048667 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Brown, E. D. & Farabaugh, S. M. What birds with complex social relationships can tell us about vocal learning: vocal sharing in avian groups. In Social Influences on Vocal Development (eds Snowdon, C. T. & Hausberger, M.) 98–127, https://doi.org/10.1017/CBO9780511758843.007 (Cambridge Univ. Press, 1997).
Pérez-Granados, C., Osiejuk, T. & López-Iborra, G. M. Habitat fragmentation effects and variations in repertoire size and degree of song sharing among close Dupont’s Lark Chersophilus duponti populations. J. Ornithol. 157, 471–482, https://doi.org/10.1007/s10336-015-1310-6 (2016).
Article Google Scholar
Laiolo, P., Vögeli, M., Serrano, D. & Tella, J. L. Song diversity predicts the viability of fragmented bird populations. PLoS ONE 3, e1822, https://doi.org/10.1371/journal.pone.0001822 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Pérez-Granados, C. et al. Vocal activity rate index: a useful method to infer terrestrial bird abundance with acoustic monitoring. Ibis 161, 901–907, https://doi.org/10.1111/ibi.12728 (2019).
Article Google Scholar
Lepage, D. Avibase — The World Bird Database https://avibase.bsc-eoc.org/checklist.jsp (2021).
Otter, K. A., Mckenna, A., LaZerte, S. E. & Ramsay, S. M. Continent-wide shifts in song dialects of white-throated sparrows. Curr. Biol. 30, 3231–3235.e3, https://doi.org/10.1016/j.cub.2020.05.084 (2020).
Article CAS PubMed Google Scholar
Calviño-Cancela, M., Piña, L. & Martín-Herrero, J. Bioacoustic differentiation of calls in the chiffchaff complex. PeerJ 10, e14261, https://doi.org/10.7717/peerj.14261 (2022).
Article PubMed PubMed Central Google Scholar
Mikula, P. et al. A global analysis of song frequency in passerines provides no support for the acoustic adaptation hypothesis but suggests a role for sexual selection. Ecol. Lett. 24, 477–486, https://doi.org/10.1111/ele.13662 (2021).
Article PubMed Google Scholar
Silva-Jr, E. F., Diniz, P. & Macedo, R. H. Song varies with latitude, climate, and species richness in a Neotropical bird. Behav. Ecol. 33, 87–100, https://doi.org/10.1093/beheco/arab112 (2022).
Article Google Scholar
Gómez-Gómez, J., Vidaña-Vila, E. & Sevillano, X. Western Mediterranean Wetland Birds dataset: A new annotated dataset for acoustic bird species classification. Ecol. Inform. 75, 102014, https://doi.org/10.1016/j.ecoinf.2023.102014 (2023).
Article Google Scholar
Jamil, N., Norali, A. N., Ramli, M. I., Shah, A. K. M. K. & Mamat, I. SiulMalaya: an annotated bird audio dataset of Malaysia lowland forest birds for passive acoustic monitoring. Bull. Electr. Eng. Inform. 12, 2269–2281, https://doi.org/10.11591/eei.v12i4.5222 (2023).
Article Google Scholar
Pérez-Granados, C. et al. WABAD: A world annotated bird acoustic dataset for passive acoustic monitoring. Ecology, 107, e70317, https://doi.org/10.1002/ecy.70317 (2026).
Kahl, S., Wood, C. M., Eibl, M. & Klinck, H. BirdNET: A deep learning solution for avian diversity monitoring. Ecol. Inform. 61, 101236, https://doi.org/10.1016/j.ecoinf.2021.101236 (2021).
Article Google Scholar
Cornell Lab of Ornithology. Merlin Bird ID, version 3.7.1. Cornell Lab of Ornithology, Ithaca, NY, USA https://merlin.allaboutbirds.org/ (2025).
Wood, C. M., Kahl, S., Rahaman, A. & Klinck, H. The machine learning–powered BirdNET App reduces barriers to global bird research by enabling citizen science participation. PLoS Biol. 20, e3001670, https://doi.org/10.1371/journal.pbio.3001670 (2022).
Article CAS PubMed PubMed Central Google Scholar
Laiolo, P. & Tella, J. L. Habitat fragmentation affects culture transmission: patterns of song matching in Dupont’s lark. J. Appl. Ecol. 42, 1183–1193, https://doi.org/10.1111/j.1365-2664.2005.01093.x (2005).
Article Google Scholar
Laiolo, P. & Tella, J. L. Landscape bioacoustics allow detection of the effects of habitat patchiness on population structure. Ecology 87, 1203–1214, 10.1890/0012-9658(2006)87[1203:LBADOT]2.0.CO;2 (2006).
Gómez-Catasús, J., Barrero, A., Llusia, D., Iglesias-Merchán, C. & Traba, J. Wind farm noise shifts vocalizations of a threatened shrub-steppe passerine. Environ. Pollut. 303, 119144, https://doi.org/10.1016/j.envpol.2022.119144 (2022).
Article CAS PubMed Google Scholar
Barrero, A., Gómez-Catasús, J., Pérez-Granados, C., Bustillo-de la Rosa, D. & Traba, J. Conspecific density and habitat quality drive the defence and vocal behaviour of a territorial passerine. Ibis 166, 826–843, https://doi.org/10.1111/ibi.13295 (2024).
Article Google Scholar
Gómez-Catasús, J., Barrero, A., Garza, V. & Traba, J. Alondra ricotí — Chersophilus duponti (Vieillot, 1820). In Enciclopedia Virtual de los Vertebrados Españoles (eds Salvador, A. & Morales, M. B.) Museo Nacional de Ciencias Naturales. http://www.vertebradosibericos.org/ (accessed 10 Aug 2025) (2016).
Laiolo, P., Vögeli, M., Serrano, D. & Tella, J. L. Testing acoustic versus physical marking: two complementary methods for individual‐based monitoring of elusive species. J Avian Biol 38, 672–681, https://doi.org/10.1111/j.2007.0908-8857.04006.x (2007).
Article Google Scholar
Pérez-Granados, C., López-Iborra, G. M., Garza, V. & Traba, J. Breeding biology of the endangered Dupont’s Lark Chersophilus duponti in two separate Spanish shrub-steppes. Bird Study 64, 328–338, https://doi.org/10.1080/00063657.2017.1359232 (2017).
Article Google Scholar
Pérez-Granados, C. & López-Iborra, G. M. Assessment of counting methods used for estimating the number of territorial males in the endangered Dupont’s Lark. Ardeola 64, 75–84, https://doi.org/10.13157/arla.64.1.2017.sc2 (2017).
Article Google Scholar
Cornell Lab of Ornithology. Raven Pro: Interactive Sound Analysis Software, version 1.6.4. Cornell Lab of Ornithology, Ithaca, NY, USA https://www.ravensoundsoftware.com/ (2023)
Pérez-Granados, C., Alonso-Moya, C., Barrero, A., Sáez-Gómez, P. & Traba, J. A large-scale acoustic dataset of a passerine with spatially variable vocal behavior: fine-scale annotations of song and call types. Zenodo. https://doi.org/10.5281/zenodo.17264916 (2025).
Article Google Scholar
Jaccard, P. É. tude comparative de la distribution florale dans une portion des Alpes et des Jura. Bull Soc Vaudoise Sci Nat 37, 547–579 (1901).
Google Scholar

Download references

Acknowledgements

The results are part of project PID2022-139294NB-I00, funded by the Ministerio de Ciencia e Innovación of Spain (MCIN/AEI/10.13039/501100011033/FEDER/UE). CPG was funded by the Ramón y Cajal 2024 Programme (RYC2024-048830-I) of the Spanish Ministry of Science, Innovation and Universities, funded by MICIU/AEI/10.13039/501100011033 and FSE+.

Author information

Authors and Affiliations

Biodiversity Management and Conservation Programme, Forest Science and Technology Centre of Catalonia, CTFC, Lleida, Spain
Cristian Pérez-Granados, Cristina D. Alonso-Moya & Gerard Bota
Terrestrial Ecology Group (TEG-UAM), Departamento de Ecología, Universidad Autónoma de Madrid, 28049, Madrid, Spain
Adrián Barrero, Pedro Sáez-Gómez & Juan Traba
Centro de Investigación en Biodiversidad y Cambio Global (CIBC-UAM), Universidad Autónoma de Madrid, 28049, Madrid, Spain
Adrián Barrero & Juan Traba
Department of Ecology and Evolution, Doñana Biological Station (CSIC), 41092, Seville, Spain
Pedro Sáez-Gómez
Instituto Pirenaico de Ecología, CSIC, Jaca, Huesca, España
José J. Lahoz-Monfort
Department of Behavioural Ecology, Institute of Environmental Biology, Faculty of Biology, Adam Mickiewicz University, Poznań, Poland
Tomasz S. Osiejuk
Department of Computer Science, University of Oxford, Oxford, OX1 3QD, UK
Alex Rogers
Institute for Science and Technology in Wetlands (INAU), Federal University of Mato Grosso (UFMT), Computational Bioacoustics Research Unit (CO.BRA), Fernando Correa da Costa Av. 2367, Cuiabá, MT, 78060-900, Brazil
Karl-L. Schuchmann
Zoological Research Museum A. Koenig (ZFMK), Ornithology, Adenauerallee 160, Bonn, 53113, Germany
Karl-L. Schuchmann
Postgraduate Program in Zoology, Institute of Biosciences, Federal University of Mato Grosso, Cuiabá, Brazil
Karl-L. Schuchmann

Authors

Cristian Pérez-Granados
View author publications
Search author on:PubMed Google Scholar
Cristina D. Alonso-Moya
View author publications
Search author on:PubMed Google Scholar
Adrián Barrero
View author publications
Search author on:PubMed Google Scholar
Pedro Sáez-Gómez
View author publications
Search author on:PubMed Google Scholar
Gerard Bota
View author publications
Search author on:PubMed Google Scholar
José J. Lahoz-Monfort
View author publications
Search author on:PubMed Google Scholar
Tomasz S. Osiejuk
View author publications
Search author on:PubMed Google Scholar
Alex Rogers
View author publications
Search author on:PubMed Google Scholar
Karl-L. Schuchmann
View author publications
Search author on:PubMed Google Scholar
Juan Traba
View author publications
Search author on:PubMed Google Scholar

Contributions

C.P.-G. and C.D.A.-M. conceived the study, C.P.-G., C.D.A.-M., A.B. and P.S.-G. conducted the experiments and analysed the results, C.D.A.-M. managed the data collection and annotation, C.P.-G. and J.T. led the project management. All the authors contributed to the data annotation review and manuscript preparation.

Corresponding author

Correspondence to Cristian Pérez-Granados.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Pérez-Granados, C., Alonso-Moya, C.D., Barrero, A. et al. A large-scale acoustic dataset of a passerine with spatially variable vocal behavior: fine-scale annotations of song and call types. Sci Data 13, 770 (2026). https://doi.org/10.1038/s41597-026-07131-4

Download citation

Received: 16 October 2025
Accepted: 25 March 2026
Published: 30 March 2026
Version of record: 22 May 2026
DOI: https://doi.org/10.1038/s41597-026-07131-4

A large-scale acoustic dataset of a passerine with spatially variable vocal behavior: fine-scale annotations of song and call types

Abstract

Similar content being viewed by others

Detection ranges of forest bird vocalisations: guidelines for passive acoustic monitoring

An Open Dataset for the Acoustic Monitoring of Nocturnal Migratory Birds in Europe

Global birdsong embeddings enable superior transfer learning for bioacoustic classification

Background & Summary