Table 1 Common sources of data

From: A quantitative analysis of the use of anonymization in biomedical research

Name

Country

Description

Cerner Corporation (Cerner)

US

Supplier of health information technology platforms acquired by Oracle in 2022. Offers access to longitudinal electronic health record data59,67.

Clinical Practice Research Datalink (CPRD)

UK

Funded by the Medicines and Healthcare products Regulatory Agency and the National Institute for Health and Care Research. Offers primary care data from general practices60.

Flatiron Health (Flatiron)

US

A healthcare technology company focusing on cancer care and research. Offers real-world cancer care data68,69.

IBM Watson Health (IBM)

US

A former division of IBM focusing on medical research and healthcare solutions acquired by Francisco Partners in 2022. Offers multiple datasets including IBM Explorys with routine healthcare data70,71.

Institute for Applied Health Research Berlin (InGef)

DE

Research institute connected to statutory health insurances through its owners. Provides anonymized claims data from multiple German health insurances72.

IQVIA

US

Global provider of health information and clinical research services. Offers real-world data, including electronic health records (EHR) and claims data73.

National Prescribing Service – MedicineWise (NPS)

AU

Funded by the Australian Government. Offers routinely collected health records to improve the surveillance of medicine use and primary care in Australia61.

Optum Incorporated (Optum)

US

A healthcare company. Offers various datasets including administrative data, claims data, and electronic health records74.

Secure Anonymised Information Linkage Databank (SAIL)

UK

Funded by the Welsh Government. Offers health and census datasets46.

South London and Maudsley NHS Foundation Trust (SLaM NHS)

UK

Funded by the UK’s National Health Service (NHS). Offers mental health data47.

TriNetX

US

A company focusing on real-world evidence generation by establishing a global network of healthcare organizations and life sciences companies. Offers access to longitudinal electronic health record and insurance data75.

Vanderbilt University Medical Center (VUMC)

US

Funded by Vanderbilt University. The Synthetic Derivative mirrors Vanderbilt’s electronic health record system and can be combined with samples from Vanderbilt’s BioVU biobank for genome-phenome analysis39,76.