Fig. 3
From: A globally synthesised and flagged bee occurrence dataset and cleaning workflow

Duplicate occurrence summary. Two bar plots showing (a) the total number of records and (b) the proportion of records in each dataset that were duplicates (sand), kept duplicates (light green), and unique (dark green). Duplicates are occurrences that were identified to have a match in another or the same dataset and that were thus flagged or discarded. Kept duplicates are the same as duplicates, except they are the version of the occurrence records that were kept. Unique occurrences are those that were not matched to any other occurrences and were also kept. The included datasets are the Global Biodiversity Information Facility (GBIF), Symbiota Collections of Arthropods Network (SCAN), Integrated Digitized Biocollections (iDigBio), the United States Geological Survey (USGS), the Atlas of Living Australia (ALA), Victorian and Western Australian Museum (VicWam5,39), Elle Pollination Ecology Lab (EPEL26), the Connecticut Agricultural Experiment Station (CAES30,31), Ecdysis (Ecd28), Allan Smith-Pardo (ASP), Gaiarsia (Gai29), Bombus Montana (BMont27), Ballare, et al.36 (Bal), Armando (Arm), Bob Minckley (BMin), Elinor Lichtenberg (Lic37), Eastern Colarado (EaCO), Texas literature data (SMC), and Dorey literature data (Dor3,4,38).