Abstract
EU directives (e.g. MSFD, Habitats Directive), along with OSPAR guidelines, mandate sustainable marine resource management across national borders. Benthic organisms are crucial for assessing marine ecosystem health, but their morphological identification is time-consuming and costly. High-throughput sequencing, particularly DNA metabarcoding, offers an alternative. However, DNA-based monitoring requires substantial investment in high-quality DNA reference libraries. The GEANS project (Genetic Tools for Ecosystem Health Assessment in the North Sea Region) aimed to develop efficient DNA-based tools for benthic biomonitoring. GEANS created a curated DNA reference library (COI) for species relevant to North Sea macrobenthos monitoring, using new sequences, non-public barcode sequences, and mined sequences from GenBank and BOLD. The library, stored in a dedicated BOLD project with photographs and metadata, includes DNA barcodes for 4005 specimens from 715 species, representing over 29% of North Sea macrobenthos species. Arthropoda is the most represented, while Bryozoa and Annelida have the lowest coverage. This DNA library is expected to facilitate fast, cost-effective environmental health assessments in the North Sea for public authorities and academics.
Similar content being viewed by others
Background & Summary
Macrobenthic invertebrates, animals larger than 1 mm, are key components in environmental monitoring and are extensively used for ecological status assessment of marine ecosystems worldwide because of their sensitivity to natural and anthropogenic disturbances1,2,3,4,5,6,7. Anthropogenic disturbances such as fisheries, sand extraction, pollution and shipping can impact growth, mortality, dispersal and recruitment of macrobenthic invertebrates, which in turn will affect ecosystem structure and function, along with their resilience8,9,10. Macrobenthic invertebrates are among the key obligatory components of biological monitoring surveys implemented in numerous countries in support of environmental directives, such as the European Union’s Marine Strategy Framework Directive (MSFD 2008/56/EC) and the Environmental Impact Assessment Directive (EIAD 2014/52/EU). Furthermore, for ecosystem health assessment, often ecological indices, such as the AZTI’s Marine Biotic Index (AMBI) and Shannon-Wiener diversity index (H’), are applied to macrobenthic communities, and these mostly require species-level identifications2.
Taxonomic identifications of macrobenthic invertebrates for routine assessments in marine areas, including the North Sea, have been carried out, using almost entirely morphology-based methodologies up to this day. This is a time- and cost-consuming, as well as a skill-dependent approach, which can result in a bottleneck (sampled vs processed specimens) in processing benthic samples for e.g. status assessments11,12. Moreover, species-level identifications can often be hindered, either because morphology-based identifications are difficult and require specialised expertise (especially true in groups such as Bryozoa, Hydrozoa, and Nemertea), or because during sampling and processing specimens get damaged and are missing key taxonomic characteristics. Furthermore, species-level identifications can be extremely difficult or nearly impossible when dealing with non-adult stages such as juveniles, larvae or eggs12. The increasing reports of cryptic species, even among many common macrobenthic species, complicates the morphology-based species identifications even further13. Finally, non-indigenous species resembling native species can be overlooked in routine rapid monitoring assessments14.
DNA-based approaches, such as DNA metabarcoding, have the potential to help tackle many of the limitations encountered by the morphology-based approach15,16,17,18,19. DNA metabarcoding appears to be more cost- and time-effective, does not require taxonomic experts for species identification and can detect non-indigenous, rare, or even undiscovered species that can go unnoticed with conventional methods14,20,21. Instead of the specimens being identified one by one morphologically, DNA is extracted from the total community, and a small fragment of the genome is amplified through PCR22. The resulting amplicons are sequenced using high-throughput sequencing and the sequences produced are processed through bioinformatic pipelines23. However, the potential power of DNA metabarcoding is currently limited mainly by the considerable endeavour needed to build comprehensive and reliable taxonomic sequence reference libraries that are required for matching DNA sequences to species names15,24,25. To ensure a high quality reference library, sequences must have a priori curated taxonomic information, and are preferably restricted to a list of species of the study area as taxonomic misassignment increases with geographic distance25,26. Most studies re-use sequences obtained from public sequence repositories, with the most common being NCBI GenBank (https://www.ncbi.nlm.nih.gov/genbank/)27,28 and Barcode of Life Data System (BOLD; http://www.boldsystems.org/)29. Although the databases’ importance is unquestionable, there is a significant percentage of sequence data without quality control and taxonomic validation that could lead to misleading results25,30,31,32.
The North Sea is amongst the most heavily human-impacted marine areas worldwide33,34,35 (Fig. 1). At the same time, the North Sea is also one of the most well studied and data-rich marine areas in the world34, and it is routinely monitored by several countries organized in OSPAR (Convention for the Protection of the Marine Environment of the North-East Atlantic) and ICES (International Council for the Exploration of the Sea). The management of such a system with many anthropogenic pressures requires timely and efficient monitoring approaches. Therefore, the Interreg project GEANS (Genetic Tools for Ecosystem Health Assessment in the North Sea Region), a transnational project among nine institutions across the North Sea aimed to implement accurate, fast, cost-effective DNA-based tools in routine biomonitoring of the North Sea. To this end, GEANS deemed it essential to develop a curated DNA reference library based on mitochondrial cytochrome c oxidase subunit I (COI) for the North Sea macrobenthos (mainly soft bottom) in support of the routine monitoring programs in the North Sea. The choice of COI marker was driven by (i) the marker’s taxonomic resolution which permits species discrimination, identification and discovery in most of the marine invertebrate groups15,36; (ii) the vast amount of data already available as reference in the collaborators’ labs and in public repositories27,29 that could be used for cross-checking; (iii) the consistent use of COI in barcoding species and especially the 5′ end (COI-5P), the region that can be amplified using universal DNA-barcoding primers, such as LCO1490/HCO219837 and their variations developed in recent years38. Furthermore, prior national and international initiatives have demonstrated the general effectiveness of DNA barcoding for various marine invertebrate groups in the North Sea, such as Mollusca39, Echinodermata40, Crustacea41,42.
The aim of the present work was to create a curated DNA barcode reference library for the macrobenthic invertebrates of the North Sea (with priority to soft bottoms) by: (i) producing new COI reference sequences from a targeted region-defined species list; (ii) assessing and curating COI data already available to the collaborators’ labs; and (iii) providing a workflow for the creation of a curated reference library (Fig. 2).
Simplified overview of curation workflow for the GEANS reference library. Inspired by Collins et al. 2021 (logos and images are public domain and were acquired from https://www.phylopic.org, https://www.ncbi.nlm.nih.gov, https://boldsystems.org, https://www.keyence.eu, https://www.deutsche-meeresforschung.de).
The GEANS reference library numbers a total of 4005 COI-5P barcode sequences from 732 (715 identified to species level) macrobenthic taxa and assigned to 764 BINs (Barcode Index Number43), which in turn were distributed over 15 phyla, 29 classes, 92 orders, 333 families, and 537 genera (Fig. 3; GEANS Reference Library44). The reference library, when compared to the number of macrobenthic species (2514 species, North Sea species list44) present in the North Sea covers over 29% of macrobenthos species diversity. Of the total number of taxa barcoded and identified to species level (715), 77 correspond to NIS (GEANS targeted species list44). A total of 1714 new DNA barcodes were generated through this study, of which 173 belonged to 62 species barcoded for the first time (Fig. 4; Table 1). The number of individuals per species ranged from 1 to 158, with 346 species (48%) represented by less than three individuals, 272 of which were represented by only a single specimen. Arthropoda was the most well represented taxon in number of sequences in the library with 1886 (47%; Fig. 3; Table 1) belonging to 246 species (Figs. 4, 5). Annelida, although recorded by a low number of sequences (358), were well represented in the reference library (126 species, 18% of the total number of species barcoded within GEANS, Fig. 5). Among all sequenced groups, Echinodermata had the highest barcode coverage with 93% (corresponding to 40 species; Fig. 5) of all Echinodermata species included in the GEANS target checklist being barcoded and 48% (40 species) when compared to the whole North Sea fauna (84 species; Fig. 4). In contrast, Bryozoa had the lowest barcode coverage with 50% of the total number of Bryozoa species in the checklist but only 8% when compared with the total North Sea fauna (Figs. 4, 5).
Methods
The curated DNA barcode reference library (GEANS Reference Library44) presented here for North Sea macrobenthos was constructed based on a seven-step workflow (Fig. 2) that generated a diverse set of validated data starting with a targeted species checklist (GEANS targeted species list) restricted mainly to the south North Sea (Fig. 1).
Targeted species checklist and North Sea species list
The nine GEANS partners (originating from seven countries: Belgium, Denmark, Germany, Netherlands, Norway, Sweden, United Kingdom) provided regional species lists based on species encountered in their long-term morphology-based monitoring data. This resulted in a concatenated target list (GEANS targeted species list) of 1016 marine macrobenthic species (119 non-indigenous species, NIS), that were checked for taxonomy (e.g. synonyms removed, checking validity of species names). As such, the majority of these species occur in areas of the North Sea, where the GEANS partners performed the case studies for testing the effectiveness of metabarcoding for specific monitoring questions14,45,46. The targeted checklist served as the basis of the GEANS reference library. To put the targeted list in a wider North Sea perspective, a North Sea macrobenthic species list44 was generated. This North Sea species list was created after extracting macrobenthic data from EurOBIS in a similar manner as Herman et al.47. Additionally, it was completed by the list of Zettler et al.48, and cross-checked by the list provided by WORMS for the North Sea which in turn was verified based on the relevant literature.
Specimen collection, and identification of specimens
Specimens were collected from the North Sea during various research expeditions that took place in the years 2019–2021 using Van Veen grabs, boxcorers, and dredges (ring dredge, Triple-D dredge). Sampling was conducted by three GEANS partners, the German Centre for Marine Biodiversity Research (DZMB), Naturalis Biodiversity Center (Naturalis), and Flanders Research Institute for Agriculture, Fisheries and Food (ILVO) with research vessels RV Senckenberg, RV Pelagia, RV Belgica A956, RV Simon Stevin and GeoSurveyor XI. Subsequently, the same partners performed the morphological and genetic analyses. Following their collection, bulk samples or separated animals were fixed in precooled 96% or 99.8% ethanol. For all samples and specimens, DZMB collected, the ethanol was decanted after 24 hours and replaced with new 96–99.8% EtOH to guarantee sufficient ethanol concentration for preservation of high-quality DNA, and subsequently stored at −20 °C in one or more of the collaborative laboratories. In the laboratory, samples were sorted and identified at species level by taxonomic experts. The taxonomic status of each species was validated based on the World Register of Marine Species (www.marinespecies.org). For each species, when possible, at least three voucher specimens were archived. Additional specimens were provided by the Gothenburg Natural History Museum, Sweden as well as by the German authority Landesbetrieb für Küstenschutz Nationalpark und Meeresschutz Schleswig-Holstein.
Barcoding data collection
DNA extraction, amplification, and sequencing
Total genomic DNA was extracted from animal tissue. At DZMB for samples where DNA of high quality was expected, the DNA extractions were carried out using 30 μL Chelex (InstaGene Matrix, Bio-Rad) according to the protocol of Estoup et al.49 and directly using it as DNA template for PCR. For samples where DNA of low quality was expected the Monarch Genomic DNA Purification Kit was used following manufacturer’s instructions. At ILVO, DNA was extracted using the DNeasy Blood & Tissue kit (Qiagen) following the manufacturer’s protocol. The concentration of the DNA was determined using the Quantus Fluorometer with the QuantiFluor dsDNA System (Promega). At Naturalis, DNA was extracted using the NucleoMag 96 Tissue kit (Macherey-Nagel) on the KingFisher (Thermo Scientific) according to the manufacturer’s protocol. DNA extractions were stored at −20 °C. A fragment of 658 bp of the mitochondrial cytochrome c oxidase subunit (COI), which is the standard barcoding marker for animals, was amplified by polymerase chain reaction (PCR). Amplifications in DZMB were performed using AccuStart PCR SuperMix (ThermoFisher Scientific) in a 25-μL volume using a standardised protocol (Table 2). All PCR products were purified using ExoSap-IT (ThermoFisher Scientific). Amplifications in Naturalis were performed using Phire II Hotstart (Thermo Scientific) in a 25 μL volume (Table 2). For the COI amplification the degenerate forward primers jgLCO1490 and reverse primer jgHCO219838, tailed with M13F and M13R-pUC, respectively, were used both by DZMB and Naturalis. DZMB also used the Echinodermata specific forward primer, LCOech1aF150, a polychaeta specific primer pair51 whereas, a universal pair that amplifies a shorter barcode region was also tested11. Amplifications in ILVO were performed with LCO1490 and HCO2198 primers in a 40 μL volume (Table 2). PCR products produced by ILVO were purified using the Wizard® SV Gel and PCR Clean-Up System (Promega). Purified PCR products from SGN and ILVO were sequenced by Sanger sequencing in both directions at Macrogen Europe BV (Amsterdam, The Netherlands), whereas Naturalis fragments were sequenced at BaseClear BV (Leiden, The Netherlands).
Existing barcodes in collaborators databases
The collaborators’ internal databases were mined for barcode sequences of macrobenthic animals collected from the North Sea. Only barcodes above 500 bp were considered, unless shorter fragments were the only ones available for a targeted species. Specifically, the DZMB completed the GEANS reference library with COI sequences from past barcode initiatives such as the “Molecular taxonomy and DNA barcoding of marine organisms (metazoa) of the North Sea”39,40,41,42. These sequences correspond to specimens or tissue archived in DZMB’s collections. The forward and reverse sequence chromatograms for each specimen were inspected, assembled, and edited using Geneious v.9.1.7 (www.geneious.com52). The COI sequences were aligned using MAFFT v7.30853 under G-INS-I algorithm, while alignments were further manually edited.
Data Records
The GEANS Reference Library (summary information), GEANS Targeted Species List and North Sea species list and Neighbour Joining trees are available in Figshare44. Additionally in Figshare44 are found the DNA barcodes and specimen photos corresponding to the new barcodes produced. Additionally, barcodes produced during GEANS are available in GenBank (BioProject PRJNA123682254). The data are available as well in BOLD through the dataset DS-GEANS155 (dx.doi.org/10.5883/DS-GEANS1). Each COI barcode included in the GEANS reference library is accompanied by the following mandatory information: 1) sample ID; 2) specimen taxonomic identification and classification; 3) collection date; 4) collection coordinates; 5) storing institution; 6) when possible one photograph of the specimen (Fig. 6), when possible photos of the key diagnostic features; 7) name of taxonomic expert; 8) sequence chromatograms; 9) museum ID when specimens are archived in museum collections. Finally, the GEANS reference library also includes: (1) voucher specimens; (2) tissue samples; (3) total DNA extractions. A specimen was considered as a species reference when molecular and morphological assessments agreed. The library follows the barcode data standard requirements29,32,36,56. Samples and extractions are available in the partner institutes (DZMB, Naturalis, ILVO).
GEANS reference library online gallery of photo vouchers of sequenced specimens identified by taxonomic experts. (A) Hippasteria phrygiana (Parelius, 1768); (B) Pagurus bernhardus (Linnaeus, 1758); (C) Macoma balthica (Linnaeus, 1758); (D) Peringia ulvae (Pennant, 1777); (E) Loimia ramzega Lavesque et al. 2017; (F) Psammechinus miliaris (P.L.S. Müller, 1771); (G) Ampelisca brevicornis (A. Costa, 1853); (H) Doris pseudoargus Rapp, 1827; (J) Diastylis bradyi Norman, 1879; (I) Magelona johnstoni Fiege, et al., 2000; (K) Euspira nitida (Donovan, 1803); (L) Pilumnus hirtellus (Linnaeus, 1761); (M) Lekanesphaera rugicauda (Leach, 1814). Scales: 1 cm (A, B, D, F); 1 mm (K, M); 2 mm (G, H, J, I, L); 5 mm (E). Photos by: V. Borges (A); M. Christodoulou (B, D, F); H. Hillewaert (C, E, G, J, I, K); GiMaRIS (H, L); W. Stamerjohanns (M).
Technical Validation
Each institution performed independent morphological identifications prior to the genetic identification. When disagreements were found, they were listed and the voucher specimens or the photos were revised to verify the original identifications. Obvious mistakes in identification or curation (e.g., mixing of photos for example) were corrected, in all other cases the mismatch between genetic and morphological identification was recorded as such. Finally, the species names were updated to the current taxonomy based on the World Register of Marine Species (WoRMS). Curation cycles were performed at regular intervals (Fig. 2). In addition to morphological validation, all barcodes were translated into amino acids to check for stop codons and to detect the presence of nuclear DNA pseudogenes (NUMTs). The obtained COI sequences were initially compared with the GenBank nucleotide database using BLASTN57 to confirm the phylum identity (Fig. 2). Additionally the BOLD database was used for verification once the barcodes were within, since BOLD contains more barcode sequences than GenBank (including unpublished barcodes). For each taxonomic group (phylum or order depending on the number of sequences), Neighbour Joining (NJ) analysis based on p-distances with 1000 non-parametric bootstrap replicates was performed using the software MEGA v.1158 and any irregularities (possible contaminations) were removed from the library (trees are available in Figshare44). Sequences were considered to be the same taxon if sequence identity was ≥97.5%.
Usage Notes
The GEANS DNA reference library offers a comprehensive, publicly available barcode dataset for North Sea macrobenthos available in BOLD (DS-GEANS1 (dx.doi.org/10.5883/DS-GEANS155). This resource enables specimen identification through barcoding and metabarcoding, thereby greatly facilitating macrobenthic biodiversity assessments using molecular tools in the North Sea region. The DNA barcode reference library presented in this study includes around 30% of North Sea macrobenthic species, and aims to complement and facilitate the morphological identification of species through barcoding or metabarcoding.
From the total number of targeted species in the checklist (1016 species, GEANS targeted species list), we were unable to recover barcode sequences for 215 species (21%), and were not successful in finding specimens for an additional 86 species (8%). The phylum with the lowest amplification success was Annelida-Polychaeta (37%, Fig. 5), followed by Arthropoda (18%) and Mollusca (14%).
The majority of BINs allocated to the species within the GEANS dataset were considered concordant (i.e., one BIN = one species) with 684 species corresponding to 96% of the total number of BINs (GEANS Reference Library). A total of 31 species were assigned to more than one BIN (72 BINs, 4% of the species). Although originally a larger number of BINs than the one mentioned above were found to be discordant (BINs shared by more than one species), a subsequent validation revealed that this was due mainly to misidentifications. A small number of shared BINs are most likely due to the presence of unvalidated or erroneously identified data in BOLD and not actually wrong records in our dataset, however some closely related species may not be distinguishable solely by the COI and they may appear sharing BINs. At the same time, a number of species found to hold more than one BIN could indicate the presence of cryptic species (e.g., Astropecten irregularis, Crepidula fornicata, Hediste diversicolor).
The library is expected to significantly expand the reach and accuracy of DNA metabarcoding studies in the North Sea whereas it allows for its continued growth to better understand the diversity of the North Sea fauna.
Code availability
No code was produced in the current study.
References
Dauer, D. M. Biological criteria, environmental health and estuarine macrobenthic community structure. Mar. Pollut. Bull. 26, 249–257 (1993).
Borja, A., Franco, J. & Pérez, V. A marine biotic index to establish the ecological quality of soft-bottom benthos within European estuarine and coastal environments. Mar. Pollut. Bull. 40, 1100–1114 (2000).
Rosenberg, R., Blomqvist, M., Nilsson, H. C., Cederwall, H. & Dimming, A. Marine quality assessment by use of benthic species-abundance distributions: a proposed new protocol within the European Union Water Framework Directive. Mar. Pollut. Bull. 49, 728–739 (2004).
Muxika, I., Borja, Á. & Bonne, W. The suitability of the marine biotic index (AMBI) to new impact sources along European coasts. Ecol. Indic. 5, 19–31 (2005).
Patrício, J., Neto, J. M., Teixeira, H., Salas, F. & Marques, J. C. The robustness of ecological indicators to detect long-term changes in the macrobenthos of estuarine systems. Mar. Environ. Res. 68, 25–36 (2009).
Kröncke, I. et al. Changes in North Sea macrofauna communities and species distribution between 1986 and 2000. Estuar. Coast. Shelf Sci. 94, 1–15 (2011).
Coates, D. A., Deschutter, Y., Vincx, M. & Vanaverbeke, J. Enrichment and shifts in macrobenthic assemblages in an offshore wind farm area in the Belgian part of the North Sea. Mar. Environ. Res. 95, 1–12 (2014).
Worm, B. et al. Impacts of biodiversity loss on ocean ecosystem services. Science 314, 787–790 (2006).
Thrush, S. F. et al. Changes in the location of biodiversity–ecosystem function hot spots across the seafloor landscape with increasing sediment nutrient loading. Proc. Biol. Soc. 284, 1471–2954 (2017).
Lam-Gordillo, O., Baring, R. & Dittmann, S. Ecosystem functioning and functional approaches on marine macrobenthic fauna: A research synthesis towards a global consensus. Ecol. Indic. 115, 106379 (2020).
Lobo, J., Shokralla, S., Costa, M. H., Hajibabaei, M. & Costa, F. O. DNA metabarcoding for high-throughput monitoring of estuarine macrobenthic communities. Sci. Rep. 7, 15618 (2017).
Hering, D. et al. Implementation options for DNA-based identification into ecological status assessment under the European Water Framework Directive. Water Resour. 138, 192–205 (2018).
Grosse, M., Capa, M. & Bakken, T. Describing the hidden species diversity of Chaetozone (Annelida, Cirratulidae) in the Norwegian Sea using morphological and molecular diagnostics. ZooKeys 1039, 139–176 (2021).
Hablützel, P. et al. DNA-based monitoring of Non-Indigenous Species. Pilot Report (Interreg North Sea Region GEANS, EU, 2023).
Bucklin, A., Steinke, D. & Blanco-Bercial, L. DNA barcoding of marine metazoa. Ann. Rev. Mar. Sci. 3, 471–508 (2011).
Taberlet, P., Coissac, E., Pompanon, F., Brochmann, C. & Willerslev, E. Towards next‐generation biodiversity assessment using DNA metabarcoding. Mol. Ecol. 21, 2045–2050 (2012).
Aylagas, E., Borja, A., Irigoien, X. & Rodriguez-Ezpeleta, N. Benchmarking DNA metabarcoding for biodiversity-based monitoring and assessment. Front. Mar. Sci. 3, 96 (2016).
Derycke, S. et al. Detection of macrobenthos species with metabarcoding is consistent in bulk DNA but dependent on body size and sclerotization in eDNA from the ethanol preservative. Front. Mar. Sci. 8 (2021).
Van den Bulcke, L. et al. DNA metabarcoding on repeat: Sequencing data of marine macrobenthos are reproducible and robust across labs and protocols. Ecol. Indic. 150, 110207 (2023).
Rey, A., Basurko, O. C. & Rodriguez-Ezpeleta, N. Considerations for metabarcoding-based port biological baseline surveys aimed at marine nonindigenous species monitoring and risk assessments. Ecol. Evol. 10, 2452–2465 (2020).
Bucklin, A. et al. Toward a global reference database of COI barcodes for marine zooplankton. Mar. Biol. 168, 78 (2021).
Leray, M. et al. A new versatile primer set targeting a short fragment of the mitochondrial COI region for metabarcoding metazoan diversity: application for characterizing coral reef fish gut contents. Front. Zool. 10, 34 (2013).
Pawloski, J. et al. The future of biotic indices in the ecogenomic era: Integrating (e)DNA metabarcoding in biological assessment of aquatic ecosystems. Sci. Total Environ. 637, 1295–1310 (2018).
Cristescu, M. E. & Hebert, P. D. N. Uses and misuses of environmental DNA in biodiversity science and conservation. Annu. Rev. Ecol. Evol. Syst. 49, 209–230 (2018).
Weigand, H. et al. DNA barcode reference libraries for the monitoring of aquatic biota in Europe: Gap-analysis and recommendations for future work. Sci. Total Envir. 678, 499–524 (2019).
Bergsten, J. et al. The effect of geographical scale of sampling on DNA barcoding. Syst. Biol. 61(5), 851–69 (2012).
Clark, K., Karsch-Mizrachi, I., Lipman, D. J., Ostell, J. & Sayers, E. W. GenBank. Nucleic Acids Res. 44 (2016).
Leray, M., Knowlton, N., Ho, S. L., Nguyen, B. N. & Machida, R. J. GenBank is a reliable resource for 21st century biodiversity research. Proc. Natl. Acad. Sci. USA 116(45), 22651–22656 (2019).
Ratnasingham, S. & Hebert, P. D. N. BOLD: The Barcode of Life Data System (www.barcodinglife.org). Mol. Ecol. Notes 7, 355–364 (2007).
Locatelli, N. S., McIntyre, P. B., Therkildsen, N. O. & Baetscher, D. S. GenBank’s reliability is uncertain for biodiversity researchers seeking species-level assignment for eDNA. Proc. Natl. Acad. Sci. USA 117, 32211–32212 (2020).
Collins, R. A. et al. Meta-Fish-Lib: A generalised, dynamic DNA reference library pipeline for metabarcoding of fishes. J. Fish Biol. 99(4), 1446–1454 (2021).
Pentinsaari, M., Ratnasingham, S., Miller, S. E. & Hebert, P. D. N. BOLD and GenBank revisited - Do identification errors arise in the lab or in the sequence libraries? PLoS One 15(4), e0231814 (2020).
Emeis, K. C. et al. The North Sea – A shelf sea in the Anthropocene. J. Mar. Syst. 141, 18–33 (2015).
Moullec, F. et al. Identifying and addressing the anthropogenic drivers of global change in the North Sea: a systematic map protocol. Environ. Evid. 10, 19 (2021).
Daewel, U., Akhtar, N., Christiansen, N. & Schrum, C. Offshore wind farms are projected to impact primary production and bottom water deoxygenation in the North Sea. Commun. Earth Environ. 3, 292 (2022).
Hebert, P. D. N., Cywinska, A., Ball, S. L. & DeWaard, J. R. Biological identifications through DNA barcodes. Proc. Roy. Soc. B: Biol. Sci. 270, 313–321 (2003).
Folmer, O., Black, M., Hoeh, W., Lutz, R. & Vrijenhoek, R. DNA primers for amplification of mitochondrial cytochrome c oxidase subunit I from diverse metazoan invertebrates. Mol. Mar. Biol. Biotechnol. 3(5), 294–299 (1994).
Geller, J. B., Meyer, C. P., Parker, M. & Hawk, H. L. Redesign of PCR primers for mitochondrial Cytochrome c oxidase subunit I for marine invertebrates and application in all-taxa biotic surveys. Mol. Ecol. Res. 13(5), 851–861 (2013).
Barco, A., Raupach, M. J., Laakmann, S., Neumann, H. & Knebelsberger, T. Identification of North Sea molluscs with DNA barcoding. Mol. Ecol. Resour. 16, 288–297 (2016).
Laakman, S., Boos, K., Knebelsberger, T., Raupach, M. J. & Neumann, H. Species identification of echinoderms from the North Sea by combining morphology and molecular data. Helg. Mar. Res. 70, 18 (2016).
Raupach, M. J. et al. The application of DNA barcodes for the identification of marine crustaceans from the North Sea and adjacent regions. PLoS ONE 10(9), e0139421 (2015).
Beermann, J. et al. Cryptic species in a well-known habitat: applying taxonomics to the amphipod genus Epimeria (Crustacea, Peracarida). Sci. Rep. 8, 6893 (2018).
Ratnasingham, S. & Hebert, P. D. N. A DNA-based registry for all animal species: The Barcode Index Number (BIN) System. PLoS ONE 8(7), e66213 (2013).
Christodoulou, M. et al. A taxonomically reliable DNA barcode reference library for North Sea macrobenthos. https://doi.org/10.6084/m9.figshare.28408385.
Obst, M. et al. Marine long-term biodiversity assessment suggests loss of rare species in the Skagerrak and Kattegat region. Mar. Biodivers. 48, 2165–2176 (2017).
Derycke, S. et al. DNA-based monitoring of soft sediments. Pilot 1 Report (Interreg North Sea Region GEANS, EU, 2023).
Herman, P. M. J., Stolte, W. & van der Heijden, L. Summary presence/absence maps of macro-endobenthos in the greater North Sea, based of nearly 100,000 samples from 65 assembled monitoring data sets. EMODNET biology data product, https://www.emodnet-biology.eu/data-catalog?module=dataset&dasid=6617 (2020).
Zettler, M. L. et al. An annotated checklist of macrozoobenthic species in German waters of the North and Baltic Seas. Helgol. Mar. Res. 72, 5 (2018).
Estoup, A., Largiader, C., Perrot, E. & Chouruout, D. Rapid one-tube DNA extraction for reliable PCR detection of fish polymorphic markers and transgenes. Mol. Mar. Biol. Biotechnol. 5(4), 295–298 (1996).
Layton, K. K. S., Corstorphine, E. A. & Hebert, P. D. N. Exploring Canadian echinoderm diversity through DNA Barcodes. PLoS ONE 11(11), e0166118 (2016).
Carr, C. M., Hardy, S. M., Brown, T. M., Macdonald, T. A. & Hebert, P. D. N. A Tri-Oceanic perspective: DNA barcoding reveals geographic structure and cryptic diversity in Canadian polychaetes. PLoS ONE 6(7), e22232 (2011).
Kearse, M. et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28, 1647–1649 (2012).
Katoh, K., Misawa, K., Kuma, K. I. & Miyata, T. MAFFT: a novel method for rapid multiple sequence alignment based 585 on fast Fourier transform. Nucleic Acids Res. 30, 3059–3066 (2002).
Christodoulou, M. et al. GenBank. https://identifiers.org/ncbi/bioproject:PRJNA1236822.
Christodoulou, M. et al. Dataset: DS-GEANS1. GEANS Reference. dx. https://doi.org/10.5883/DS-GEANS1 Library.
Rimet, F. et al. Metadata standards and practical guidelines for specimen and DNA curation when building barcode reference libraries for aquatic life. Metabarcoding Metagenom. 5, e58056, 17–33 (2021).
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Tamura, K., Stecher, G. & Kumar, S. MEGA11: Molecular Evolutionary Genetics Analysis version 11. Mol. Biol. Evol. 38, 3022–3027 (2021).
Acknowledgements
Funding for this research was received through GEANS – Genetic tools for Ecosystem health Assessment in the North Sea region, an Interreg project supported by the North Sea Programme of the European Regional Development Fund of the European Union. Ship time on RV Belgica and GeoSurveyor XI was provided by BELSPO and RBINS-OD Nature. We thank the crew for the logistic support during the sampling campaigns. Also, the crew of RV Simon Stevin and VLIZ are thanked for providing ship time and logistic support. ILVO colleagues Stephie Seghers, Jan Wittoeck, Charles LeFranc, Felien Festjens are acknowledged for helping in collecting, identifying and/or photographing the Belgian specimens. Additional funding for ILVO was provided by the Sand Fund of the Federal Public Service Economy (Belgium).
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
M.C. Participation in collecting trips, data production and curation, project managment, visualization, writing original draft, writing review and editing. S.D. Conceptualization, funding acquisition, project administration, data curation, validation, writing review and editing. K.K.B. Data production, validation, project Administration,Writing review and editing. H.H. Data production, participation in collecting trips, data curation, validation. S.L., K.L., E.K., S.K., S.M., C.U., L.V.D.B. Data production, data curation, validation, writing review and editing. R.H. Writing review and editing. B.V.D.H. Funding acquisition, data production, validation, project administration, writing review and editing A.D.B. Conceptualization, funding acquisition, project administration, participation in collecting trips, writing review and editing. P.M.A. Conceptualization, original investigation design, funding Acquisition, project administration, participation in collecting trips, writing review and editing.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Christodoulou, M., Derycke, S., Beentjes, K.K. et al. A taxonomically reliable DNA barcode reference library for North Sea macrobenthos. Sci Data 12, 1198 (2025). https://doi.org/10.1038/s41597-025-05500-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-025-05500-z