Determining a role for Patient and Public Involvement and Engagement (PPIE) in genomic data governance for cancer care

Sahan, Katherine; Turner, Lesley; Hallowell, Nina; Parker, Michael; Lucassen, Anneke

doi:10.1038/s41431-025-01866-1

Download PDF

Article
Open access
Published: 23 May 2025

Determining a role for Patient and Public Involvement and Engagement (PPIE) in genomic data governance for cancer care

European Journal of Human Genetics (2025)Cite this article

2060 Accesses
2 Citations
3 Altmetric
Metrics details

Subjects

A Comment to this article was published on 18 July 2025

Abstract

Comprehensive collections of cancer data, including genomic data, are needed to improve cancer risk prediction and treatments. A recent government review, Better, Broader, Safer: Using health data for research and analysis, has argued for high-quality Patient and Public Involvement and Engagement (PPIE) for ethical data use. In this paper we determine a role and justification for PPIE to govern uses of genomic data in fields like cancer. First, we analyse two public attitudes studies about the role of PPIE in genomics governance. Second, we characterise two ethically-significant features of the context of governing genomic data: 1) data aggregation leading to novel group formation, and 2) the hybrid territory of genomic cancer data uses. Thirdly, we bring together these aspects to describe a fully determined role for PPIE within an approach to governing cancer genomic data, which is tailored to major areas of ethical consideration. Our account is a novel interpretation of what PPIE is for in governance, how it may foster public support and how its success in so doing depends on it being tailored to context.

Insights for precision oncology from the integration of genomic and clinical data of 13,880 tumors from the 100,000 Genomes Cancer Programme

Article Open access 11 January 2024

Genomic health data generation in the UK: a 360 view

Article Open access 19 October 2021

Balancing the safeguarding of privacy and data sharing: perceptions of genomic professionals on patient genomic data ownership in Australia

Article Open access 11 January 2023

Introduction

Approximately 1 in every 2 people in the UK will develop cancer in their lifetime [1]. Comprehensive collections of data about cancer patients enable understanding of the causes, prevention and treatment of cancer. These increasingly include—or link to—genomic data [2, 3]. As well as research, collections may be used for evaluating clinical performance, audit, and education [4]. Genomic data within collections can greatly assist understanding of cancer predisposition, progression and outcomes [5]. Using this data ethically is important for continued public trust and support.

Patient and Public Involvement and Engagement (PPIE) is increasingly part of discussions about public trust and data ethics^{Footnote 1}. For example, Goldacre and Morley’s recent review of using health data for research and analysis has recommended high-quality PPIE for achieving ‘productive and ethical’ data use [6]. Others have associated PPIE with what will secure public interests, trust or support [6, 10,11,12,13,14,15,16]. Still others have emphasized an important role for PPIE within research governance [17, 18], saying that ‘[p]ublic involvement in research governance can help research be more transparent and gain public trust.’ [18]. However, it is important to determine how precisely PPIE might function justifiably in particular governance contexts. In this paper, we seek to determine a role for PPIE in the context of governing genomic cancer data uses which is worthy of public support.

Why focus on public support for cancer genomics?

The field of genomic medicine promises substantial future impacts in healthcare outcomes [12] and risk prediction [19]. In this paper, we focus on cancer genomics because of cancer’s high disease burden and public visibility, and concur with others that public opinion about genomic medicine might depend on its successes in improving cancer outcomes [20, 21]. It is also the case that cancer (and rare disease) has been a priority focus for major genomic medicine initiatives to date [3, 22,23,24,25]. These initiatives have seen cancer genomic data collections grow and be called upon to perform increasingly more sophisticated analyses, including long-read sequencing and multi-modal data analysis [23, 26], magnifying the scale and complexity of data analysis and linkage. Together this makes cancer genomics a priority area for developing data governance arrangements that are worthy of public support. While our focus here is on cancer genomics, the arguments we develop will have a wider applicability for genomic data governance as a whole.

To determine an appropriate PPIE role, we first identify the scope of such a role (subsection “What ought to be the role of PPIE in the governance of genomics?” of section “Methods”), then consider how to arrange governance appropriately (subsection “Arranging governance” of section “Methods”). This involves identifying and analysing ethically significant features when using genomic data for research and other activities in fields like cancer. Lastly, we describe a fully determined role for PPIE responsive to ethically significant features of the governance context.

Methods

What ought to be the role of PPIE in the governance of genomics?

We begin with an analysis of two recent studies about publics and (genomic) data governance. We go on to situate the role of PPIE within a governance approach which is sensitive to key ethical challenges of genomic data use in fields such as cancer (“Arranging governance” of section “Methods”).

Study 1: The Ipsos Mori dialogue

The Ipsos Mori dialogue was commissioned in 2019 by Genomics England [14]^{Footnote 2}. The study was a response to a 2016 UK Chief Medical Officer (CMO) report which called for a rethink of the ‘social contract’^{Footnote 3} in healthcare in light of genomic medicine [27].

The study recommended a ‘lay expert’ panel to help prevent negative social outcomes from uses of genomic data such as a ‘stratified society which disenfranchises vulnerable members’ p37 [14]:

Ultimately, the public want policy makers to design a system that prevents these things [negative social outcomes] from happening in practice. Rational ignorance means participants are willing to defer its design to the NHS and experts with no vested interests (such as a panel of “lay-experts” e.g. 100,000 Genomes Project Participant Panel) ibid

The concept of ‘rational ignorance’ used here refers to the idea that participants may sometimes judge it reasonable to remain uninformed about the precise details of genomic data governance whilst nonetheless being content to participate in the research it governs. This gives scope to involve publics in a genomic data governance system but does not mandate their involvement [14]. Importantly, governance should be designed so that it supports and protects the interests of marginalised and underserved sections of society.

Study 2: National Data Guardian (NDG) Dialogue

The NDG Dialogue was conducted by Hopkins Van Mil and was commissioned in 2020 by the National Data Guardian for Health and Social Care (NDG) and by Understanding Patient Data [10]. It examines ‘how people assess public benefit in the use of health and adult social care data for purposes beyond individual care’ p5. The report recommends that authentic public engagement is a prerequisite for public benefit, warning that:

‘Public benefit is undermined if authentic public engagement is not integrated into data assessment. This requires engaging people from a cross-section of society in data assessment processes.’ p55.

Data assessment processes are not defined in the study, but the implication is that they include governance or strategic decision-making processes to judge whether data uses are publicly beneficial. Public engagement could include ‘a data assessment jury to be drawn on for complex ‘edge’ assessment cases with, for example an ethical dimension…’p55.

Analysis of these studies

What can be learned from these cases? Both studies support a role for PPIE in governance on the grounds of reduced inequalities in delivery of genomic medicine, and fostering well-founded public support (either through successful social contracting, or through robustly assessing public benefit). The NDG study emphasises PPIE should be authentic and non-tokenistic. Notwithstanding these areas of broad agreement, the two studies also have some unclear views about the role of PPIE in governance. We discuss these below to make progress on determining its role.

PPIE in relation to wider publics or society

The Ipsos study uses the concept of rational ignorance (RI) to explain why wider publics may reasonably choose to remain uninformed about the details of genomic data governance, deferring on these matters to narrower groups such as lay expert panels (p37 footnote^{Footnote 4}) [14]. However, care should be taken not to over-interpret this descriptive account of public engagement with genomics towards normative conclusions. Firstly, too much focus on wider publics choosing to remain uninformed will start to look like a reason not to do PPIE, disenfranchising publics or groups who already struggle to engage with genomics or access genomic services. Secondly, a focus on RI runs counter to a social contracting principle, also from the Ipsos study, that publics should ‘understand and appreciate the value of the care given’ in return for using genomic medicine services (p11) [14]. The principle suggests successful social contracting depends on publics making effort to understand aspects of genomics relevant to their care. These could include both what governance of genomic data entails (and why good data governance is important for their care) and complexities within care delivery itself e.g. that genomic results are far less deterministic than publics perceive and often have uncertain clinical significance. In light of this, one important role for PPIE within governance could be to judge what constitutes relevant and sufficient information for wider publics about how genomic data is governed and used in care. This would both be a way to represent the interests of wider publics within the governance process and help uphold social contracting (and public support).

PPIE constitution and procedural approach

How should PPIE be arranged within governance? The NDG study advocates PPIE, which engages a cross-section of society, suggesting convening ad hoc data assessment juries in cases with ‘an ethical dimension’ ([10] p.5). As well as the benefit of cross-sectional representation, such jury models are claimed as beneficial because they help publics think critically and give reasons for their views ([28, 29], p110). Finally, convening PPIE as a jury ad hoc might make governance more proportionate [6].

However, convening juries only in certain cases means PPIE is not integral to the governance process. This may be disadvantageous if we think PPIE should decide whether a planned data use requires ethics review, as well as contributing to the review itself. Secondly, developing and improving governance processes could be more complex without the institutional knowledge or memory of a regular or ‘standing’ PPIE input. This suggests that PPIE input should be regular not ad hoc.

Lay (PPIE) vested interests

It is problematic to suggest—as the Ipsos study appears to—that lay experts on a governance panel should not have vested interests. Like other experts, lay experts frequently have a particular interest in and experience of a disease area, joining governance or funding panels because of this. Rather than objecting to vested interests, it seems important that all governance panel members, lay or otherwise, openly acknowledge potential conflicts of interest and this is managed within governance processes. For example, composition of PPIE panels could be arranged to reflect broadly different attitudes among publics and patients to governing genomic cancer data collections. Patients are said to bring the value of their experience to PPIE activities [30]. Those patients with rare or complex diagnoses or care journeys may be more likely to support data uses, more excited about scientific discovery from these uses, and more minded to reduce barriers to use. In contrast, publics are said to be more ‘distinterested’, socially accountable and hold ‘public or common views rather than special expertise’ [30]. Publics might thus exercise more caution about uses, especially given a backdrop of prior data scandals [15] and discussions around harms from unethical personal data use [31].

Arranging governance

It is important for PPIE to be tailored to the context in which it operates. In this section, we discuss two particularly significant features for how to arrange governance of genomic data usage appropriately in fields like cancer: (1) data aggregation leading to novel group formation; (2) the hybrid (research-clinical) territory often inhabited by genomic data uses. In doing this our aim is both to highlight ethical challenges particular to these kinds of data uses, and to make consideration of those challenges core to a PPIE panel’s role.

Data aggregation leading to novel group formation

The first ethical feature of interest arises out of the fact that certain uses of genomic cancer data will lead to new groups being characterised based on shared genetic characteristics. There are complexities and features of such aggregation to which governance should be sensitive. For example, the possibility that such groups could suffer disadvantage, discrimination (e.g. where large genomic data aggregates ‘reveal health patterns of a certain sub-group’ or perpetuate ‘strong racial biases’) or ‘dignitary harm’ p6 [32]. Such harms could be compounded if groups are already disadvantaged for some other reason e.g. racism. These harms are not uniquely associated with using cancer genomic data. Even so, the stigma of having cancer might be higher or lead to more harms e.g. higher insurance premiums.

Caution about novel group formation relates to a wider discourse about ethical challenges of diversifying genomic data [33]. In particular, there are profound complexities in the diversification project, namely attempts to identify and better represent underrepresented groups in genomics and so reduce health inequities. It is possible for diversification attempts to compound biases and assumptions arising from imposed social or political constructs such as race [34]. Some scholars have called for more nuanced approaches to analysis methods like genetic ancestry in genomic analysis, such as not imposing labels or categories on what has been termed the ‘continuous, category-free nature of genetic variation’ [35]. Others, while sharing the caution around such biases and assumptions also describe how genetic knowledge might help support the rights of (disadvantaged) ‘genetic citizens’^{Footnote 5}, and provide ‘leverage for activism and policy initiatives’ to address social and environmental determinants of health inequalities p.39 [34].

PPIE can be used to assess whether proposed data groupings might discriminate, cause dignitary harm or perpetuate inequality, drawing on lived experience of related cancers or the cancer susceptibility genes under study. As part of this, a PPIE role in governance could be to connect and consult with wider publics or communities implicated by the proposed uses.

Hybrid territory inhabited by genomic cancer data uses

The second feature of interest is whether genomic cancer data use needs to be understood as a ‘hybrid’ research-clinical activity [36, 37] for the purpose of arranging and conducting governance.

Data uses for cancer care rely on a hybrid combination of clinical and research activities. Data interpretation necessitates research input and research requires linked clinical details to make useful inferences. Machine learning (ML) is often used to combine and analyse heterogeneous data at scale and is commonly used in fields like cancer genomic data science to develop computational tools for gene detection and variation [38,39,40]. When certain types of ML such as adaptive ML are used, data uses also function to help the ML models continuously learn, and may be generating generalizable ‘research’ results for future, other patients [41].

This hybrid activity sits against a backdrop of historically distinct governance mechanisms for clinical practice and research. This is partly due to distinct normative commitments, centrally the pursuit of patient benefit (in clinical practice) versus developing generalizable knowledge (in research). Thus, on the face of it, it seems hard to reconcile such opposing commitments within genomic data governance for cancer care without risking unacceptable compromises on patient benefit and care quality.

One approach to arranging governance would argue that the activities (research and clinical) are so hybridised that we cannot or should not disentangle research and care aspects nor apply separate governance analyses/standards. This would be a ‘hybrid’ account of governance. A second approach would argue that irrespective of this hybridisation, use cases should be presented so as to separate out the research- versus clinical-ethics considerations, facilitating separate ethical analyses, even if occuring within a single governance process. We will call this the ‘separationist’ account.

The hybrid account

Developing this kind of governance account entails re-thinking how we should govern uses productively but with awareness of conventional distinctions. The concept of a Learning Health System (LHS) is helpful in this regard [37, 42]. For example, Faden et al. argue for a more ‘inclusivist’ approach to governance comprising clinical activities, research activities, and presumably work which classes as ‘other’ e.g. data driven work [43]. The challenges of this approach are how to tailor governance to a more diverse set of uses, and how to resource extra workload arising from an expanded purview.

Faden et al’s account also exchanges ‘protectionism’ as a central governance principle for ‘justice’. This entails consideration of the risks and benefits of planned data uses at the group, community or society level, rather than just individual-level risks (p226-7) [44]. This requires a detailed account of distributive and social justice in the context of genomic cancer data uses. The former, broadly speaking, is said to be the ‘distribution of all rights and responsibilities in society’ (p226) [44] and the latter is how to decide whether planned data uses will result in unfair burdens for certain parts of society [45]. In respect of cancer genomics, operationalizing such an account within governance is important in order to address widening socioeconomic equalities in subgroups such as breast and colorectal cancer, and the overall ‘glaring lack of studies vital to promoting health equity’ [21].

The separationist account

A separationist account entails more recognition of why conventional distinctions in research and clinical ethics might matter for governance. For example, judging whether the social value of the research activity might, over time, be at the expense of care outcomes is an important concern [41]. This phenomenon has been described in the setting of personalised health monitoring in adaptive ML where the learning is argued to happen for the sake of others, not individuals’ own clinical needs [41]. As noted in this description, part of the added challenge for a governance approach is to be able to understand the detailed learning objectives of adaptive ML given the opacity of such ML techniques [41]. Even so a presumption that ML in uses for clinical care is likely to have a ‘learning’ (research) element gives reason to enquire routinely about the non-clinical objectives of ML-driven data uses within governance.

Understanding the context within which data uses happen and findings are generated is also important in order to distinguish (and judge the ethics of) research and care aspects. While the genomic cancer case is not a direct correlate with the example of personalised health monitoring, clinical scientists, clinicians and patients also have to grapple with the fact that genomic findings cannot always be regarded as clinically significant. As technologies are applied in an agnostic setting (i.e. not driven by particular clinical phenotypes) more genomic variants will be found whose clinical significance is uncertain [46]. Deciding clinical significance therefore becomes not just a technical question of using variant interpretation guides and reference libraries well, but also relying on close links with research that can evaluate the significance of findings in different contexts, including different populations. Additionally, practical ethical questions accompany significance decisions, such as judging when and how to include variants on a clinical report, when to revise them, and how to re-contact patients over time [47]. This shows the importance at least to some stakeholders (patients, clinicians, scientists with clinical responsibilities) of distinguishing the different clinical versus research implications of uses. However, it also shows the inter-dependence of iterated, non-linear research-clinical processes within genomic data uses, and how, unlike the personalised health monitoring example, the aims and intention behind uses will be similarly inter-dependent, and often mutually beneficial.

This is one demonstration of how being too separationist in the governance approach will be unwieldy and will unhelpfully stifle learning from data uses. Additionally, as with most research proposals, one major value of novel use cases will be their exploratory nature, making it unclear how possible it is for applicants to governance to predict all relevant ethical considerations. Nevertheless, any predictions will help, progressively, to characterise the ethical landscape of use. This sees governance itself as a learning entity, a repository for institutional knowledge about what ethical considerations are pertinent to use, what frameworks to use, where to draw the line and make trade-offs etc.

Both hybrid and separationist accounts, then, are useful to consider for governance, the first to characterise the data and describe how practically uses will go, the second to highlight the practical and ethical implications for care which follow from data uses.

Results: a PPIE role within a governance approach tailored to genomic data use in fields like cancer

What does all of this mean for the governance of genomic data uses in fields like cancer, and for PPIE’s role in that? Following Goldacre et al., we have argued that effective PPIE is a key factor in well-founded public trust and confidence in health data analysis and research [6]. We have argued that PPIE within the governance of big genomic data repositories or initiatives has the potential to foster legitimate grounding for public support. To this end we think that developing a coherent and effective PPIE role within the governance of cancer genomics is needed. In what follows we recommend a PPIE role within a governance approach which is sensitive to key ethical challenges of genomic data use in fields such as cancer and here set out a number of key governance aims^{Footnote 6}.

Inclusive and representative

The PPIE role should be inclusive and representative of a cross section of publics or society. The latter might necessitate members holding vested interests—indeed this is to be expected and welcomed to an extent. This is because vested interests when well managed can allow members to, for example, represent and advocate for certain underserved groups or disease areas. However, members should be self-aware about how such interests may affect their approach to governance (e.g. a broad techno-optimism about science among patients contrasted with more caution among publics) and corresponding judgements, should be able to state this to others and have their judgements reasonably challenged as part of a procedurally justified decision-making process [48]. This helps raise awareness of the subtlety and complexity of COIs among members without more standard commercial- or research-based interests.

Working in the public interest and transparency

The PPIE panel should also have ways to connect to and serve the interests of wider publics. This will firstly be part of their inclusion and representation work e.g. connecting to PPIE networks which look to minority group cancer interests [49]. Secondly, it will be in order to make governance more transparent and accountable, and so worthy of public support. This is in so far as PPIE members, through their governance work, can promote relevant and sufficient understanding among wider publics of information about how genomic data is governed and used in their care. The panel should especially comprise or connect to members of underserved communities, in order to gain better understanding of ethical issues pertaining to them, be they risks of harms to groups or other areas of inequality in data uses [50]. In this way, their role also helps social accountability within governance, since they are helping to involve ‘potential data subjects’ in governance considerations (p9) [32].

Addressing inequity

Thirdly,—and also addressing an area of profound public interest—a PPIE role should address concerns of inequity in the collection and use of genomic data for cancer care. This should be in the context of governance which is sensitive to data aggregation and diversification attempts. Governance should also develop justice as a principle central to its deliberations in order to support the range of issues arising from the hybrid nature of data uses [42, 51]. This involves weighing the risks and benefits of genomic data uses at the population level and considering whether uses are just, particularly in light of any existing inequities in genomic cancer care or research metrics. It also means recognising that a just provision of cancer care through genomics is about adequate and fair contributions from groups to its societal aims, so balancing protections of groups with societal well-being. This balancing work is a complex part of the role and depends on developing and operationalising an account of distributive and social justice as applied to the case of genomic data uses in cancer, building on similar types of endeavour such as the LHS and research which considers the case for its ethical governance or related checks and balances [52, 53]. (It is envisaged this starts out as a set of principles which is refined by experience of being on the panel and considering different sets of uses and may also be informed by expert opinion, changes in social policy, etc.). Despite recognising the hybrid territory inhabited by genomic data uses, it is important to monitor whether data uses are pursuing generalizable knowledge at the expense of adequately serving the needs of individuals. This recognises the importance of separating research ethics and clinical ethics analyses in order to highlight the practical and ethical implications for care.

Managing risk

The PPIE role may be multifaceted when it comes to questions of risk. As above, it could be valuable in discerning what planned uses might lead to risks, and whether these risks are justified by the aims of the project or projected compensating benefits. Secondly, the role could help to suggest what controls and security measures are possible to mitigate risks. This could be recommending use of a Trusted Research Environment [6, 54] or other secure environment as well as discussing feasible alternatives (e.g. if data projects cannot afford to use these options). Importantly, the emphasis should be on managing risks and burdens (as well as benefits) fairly as part of a justice-based account.

Managing legacy through learning governance

Finally, PPIE panels can more effectively serve a governance approach if they are standing panels which meet regularly. This gives them an ongoing, incremental knowledge of the area of genomic data and cancer care and an opportunity to broaden their governance expertise and value to the process. It also avoids a sense of tokenism since they are integral to the governance process and are not used sporadically. This represents a form of iterative data governance where PPIE panels are authentically involved in the sense they can appreciate over time what their input is achieving, and their iterated contributions count towards the learning of the system. Thinking of governance itself as a learning system reflects its potential as a forum for a dynamic, multidirectional process of knowledge-making and -receiving with PPIE at its core [42, 55]. It may be that more than one panel is needed to manage workload, specialised uses, or uses which involve particular underserved groups. Panels should be appropriately trained prior to their involvement and remunerated especially if taking on longstanding governance commitments.

Conclusion

Including PPIE as a key component of effective governance capable of fostering well-founded public support might help address barriers to the data uses within comprehensive cancer data collections. Such a role for PPIE fosters a more dynamic exchange where researchers, patients and the public contribute to collaborative environments for science and knowledge to flourish. Cultivating such environments is essential to ensure that research not only drives scientific progress but also aligns with societal needs and values. In this paper we have considered how PPIE should function within appropriate governance for genomic data for cancer care. We analysed two public attitudes studies about the role of PPIE in governance of genomics. This was set alongside analysis of two ethically-significant features of genomic research: data aggregation and novel group formation, and issues of hybridisation. This functioned to more fully determine a role for PPIE, situated in a governance approach sensitive and tailored to key areas of ethical consideration.

Our analysis led us to suggest a PPIE role with the following aims:

1.
the role should include and represent a cross section of publics or society. Any vested interests of PPIE members (and other members) should be embraced so far as they contribute to a procedurally justified governance process.
2.
the role should connect to and serve the interests of wider publics, also helping to promote a basic understanding of genomic data governance to aid public support.
3.
it should be oriented to concerns of inequity in data uses, employing a governance framework where social justice plays a central role.
4.
the role should help make judgements about risk and benefits, also being aware of appropriate risk mitigation controls.
5.
the panel should operate as a standing panel which meets regularly (and is remunerated appropriately), broadening their knowledge, experience and value to the process, and making governance itself a learning system.

This is not to under-estimate the complexity or breadth of the governance approach we propose – it is likely that multiple bodies would be needed, administered by an overarching strategic administration, in order to manage increasing volumes and proliferating categories of use, even within the same data repository or registry. Additionally, further ethics research is needed to tackle complex questions around how to operationalise justice-based accounts of ethical governance effectively, given the proposal to move away from protectionism.

Notes

Definitions and practices of Patient and Public Involvement and Engagement (PPIE) are highly variable and are accompanied by a rich history of patient advocacy and activism. For the purposes of this paper, we use the description of PPIE in reference [6] as a working definition: ‘The most useful, successful, and impactful health data research projects are often those that: design projects with, and for, patients and the public from the outset; involve a diverse range of representatives in every decision, from data definitions, to interpretation and dissemination; listen to (and act on) the advice, feedback, and input of patient representatives; and treat their values, beliefs and experiences as crucial to success…’ (6, p145). For more information about the origins and history of PPIE in England see [7], in Europe see [8], in the US see [9].
A declaration that Michael Parker was a co-author of [14].
The report defines the social contract as follows: ‘where patients, the wider public, health professionals, and researchers have shared expectations – a “social contract” – about what constitutes reasonable and acceptable uses of patient data and samples’.
‘It’s impossible to know everything. Some degree of ignorance is inevitable, and we all must personally decide what is worth knowing and understanding. Rational ignorance helps each of us decide what information would be most useful. Rational ignorance means intentionally choosing to remain uninformed on a topic because the cost of acquiring the information is greater than the estimated potential benefits.’
‘The notion of genetic citizenship refers to the obligations, rights, duties, and forms of care that circulate between citizens and the state’ p.38 [35].
We see the PPIE as central to decision-making within governance but also envisage other non-lay deciders within the governance approach.

References

CRUK. Cancer risk statistics. https://www.cancerresearchuk.org/health-professional/cancer-statistics/risk. Accessed 3 July 2023.
Siesling S, Louwman WJ, Kwast A, van den Hurk C, O’Callaghan M, Rosso S, et al. Uses of cancer registries for public health and clinical research in Europe: Results of the European Network of Cancer Registries survey among 161 population-based cancer registries during 2010-2012. Eur J Cancer. 2015;51:1039–49. https://doi.org/10.1016/j.ejca.2014.07.016.
Article CAS PubMed Google Scholar
Sosinsky A, Ambrose J, Cross W, Turnbull C, Henderson S, Jones L, et al. Insights for precision oncology from the integration of genomic and clinical data of 13,880 tumors from the 100,000 Genomes Cancer Programme. Nat Med. 2024;30:279–89. https://doi.org/10.1038/s41591-023-02682-0.
Article CAS PubMed PubMed Central Google Scholar
Henson KE, Elliss-Brookes L, Coupland VH, Payne E, Vernon S, Rous B, et al. Data resource profile: National Cancer Registration Dataset in England. Int J Epidemiol. 2020;49:16-h https://doi.org/10.1093/ije/dyz076.
Article Google Scholar
Turnbull C, Sud A, Houlston RS. Cancer genetics, precision prevention and a call to action. Nat Genet. 2018;50:1212–8. https://doi.org/10.1038/s41588-018-0202-0.
Article CAS PubMed PubMed Central Google Scholar
Goldacre B, Morley G. Better, broader, safer: using health data for research and analysis. A review commissioned by the Secretary of State for Health and Social Care. Department of Health and Social Care. 2022. https://www.gov.uk/government/publications/better-broader-safer-using-health-data-for-research-and-analysis. Accessed 3 July 2023.
Williams O, Robert G, Martin GP, Hanna E, O’Hara J. Is co-production just really good PPI? Making sense of patient and public involvement and co-production networks. In: Bevir, M., Waring, J., editors. Decentring health and care networks. Organizational Behaviour in Healthcare. Cham: Palgrave Macmillan; 2020. https://doi.org/10.1007/978-3-030-40889-3_10.
Fredriksson M, Tritter JQ. Disentangling patient and public involvement in healthcare decisions: why the difference matters. Sociol Health Illn. 2017;39:95–111. https://doi.org/10.1111/1467-9566.12483.
Article PubMed Google Scholar
Frank L, Forsythe L, Ellis L, Schrandt S, Sheridan S, Gerson J, et al. Conceptual and practical foundations of patient engagement in research at the patient-centered outcomes research institute. Qual Life Res. 2015;24:1033–41. https://doi.org/10.1007/s11136-014-0893-3.
Article PubMed PubMed Central Google Scholar
NDG. Putting Good into Practice. A public dialogue on making public benefit assessments when using health and care data. National Data Guardian; 2021. https://www.gov.uk/government/publications/putting-good-into-practice-a-public-dialogue-on-making-public-benefit-assessments-when-using-health-and-care-data#:~:text=Research%20and%20analysis-,Putting%20Good%20into%20Practice%3A%20A%20public%20dialogue%20on%20making%20public,that%20benefit%20people%20and%20society. Accessed 24 April 2025.
Milne R, Morley KI, Howard H, Niemiec E, Nicol D, Critchley C, et al. Trust in genomic data sharing among members of the general public in the UK, USA, Canada and Australia. Hum Genet. 2019;138:1237–46. https://doi.org/10.1007/s00439-019-02062-0.
Article PubMed PubMed Central Google Scholar
HMGov. Genome UK: the future of healthcare. HM Government; 2020. https://www.gov.uk/government/publications/genome-uk-the-future-of-healthcare. Accessed 24 Apr 2025.
van Staa TP, Goldacre B, Buchan I, Smeeth L. Big health data: the need to earn public trust. BMJ. 2016;354:i3636 https://doi.org/10.1136/bmj.i3636.
Article PubMed Google Scholar
Ipsos. A public dialogue on genomic medicine: time for a new social contract? Ipsos Mori. 2019. https://www.ipsos.com/en-uk/public-dialogue-genomic-medicine-time-new-social-contract-report. Accessed 24 Apr 2025.
Carter P, Laurie GT, Dixon-Woods M. The social licence for research: why care.data ran into trouble. J Med Ethics. 2015;41:404–9. https://doi.org/10.1136/medethics-2014-102374.
Article PubMed Google Scholar
Aitken M, de St Jorre J, Pagliari C, Jepson R, Cunningham-Burley S. Public responses to the sharing and linkage of health data for research purposes: a systematic review and thematic synthesis of qualitative studies. BMC Med Ethics. 2016;17:73 https://doi.org/10.1186/s12910-016-0153-x.
Article PubMed PubMed Central Google Scholar
Health Research Authority. Planning & Improving research > Best Practice > Patient and Public Involvement. 2025. https://www.hra.nhs.uk/planning-and-improving-research/best-practice/public-involvement/.
NIHR. UK Standards for Public Involvement. 2025. https://sites.google.com/nihr.ac.uk/pi-standards/standards/governance.
de Villiers CM S, Moorthie S, Kroese M, Blackburn L. Polygenic scores for cancer. PHG Foundation; 2022. www.phgfoundation.org.
Shendure J, Findlay GM, Snyder MW. Genomic medicine-progress, pitfalls, and promise. Cell. 2019;177:45–57. https://doi.org/10.1016/j.cell.2019.02.003.
Article CAS PubMed PubMed Central Google Scholar
Saulsberry L, Olopade OI. Precision oncology: directing genomics and pharmacogenomics toward reducing cancer inequities. Cancer Cell. 2021;39:730–3. https://doi.org/10.1016/j.ccell.2021.04.013.
Article CAS PubMed PubMed Central Google Scholar
NHS England 2024. Accelerating genomic medicine in the NHS. A strategy for embedding genomics in the NHS over the next 5 years. 2025. https://www.england.nhs.uk/long-read/accelerating-genomic-medicine-in-the-nhs/.
Genomics England 2024. Genomics England > Our Initiatives > Cancer 2.0. 2025. https://www.genomicsengland.co.uk/initiatives/cancer.
CanGene-CanVar. 2024. https://www.cangene-canvaruk.org/about-cangene-canvar. Accessed 18 Sep 2024.
US Precision Medicine Initiative. 2015. https://obamawhitehouse.archives.gov/precision-medicine.
Rashbass J, Peake M. Editorial. Eur J Cancer Care (Engl), 2014;23:757–9. https://doi.org/10.1111/ecc.12259.
CMO. Annual Report of the Chief Medical Officer 2016 Generation Genome. Chief Medical Officer [Internet]. 2016. https://www.gov.uk/government/publications/chief-medical-officer-annual-report-2016-generation-genome. Accessed 24 April 2025
Coote A, Lenaghan J, Research IfPP. Citizens’ juries: theory into practice: Institute for Public Policy Research; 1997.
Dzur AW. 105 Juries, Juries Everywhere (But Not Inside the Courts). In: Punishment, participatory democracy, and the jury. Oxford University Press; 2012. p. 110.
McCoy MS, Warsh J, Rand L, Parker M, Sheehan M. Patient and public involvement: two sides of the same coin or different coins altogether? Bioethics. 2019;33:708–15. https://doi.org/10.1111/bioe.12584.
Article PubMed PubMed Central Google Scholar
McCoy MS, Allen AL, Kopp K, Mello MM, Patil DJ, Ossorio P, et al. Ethical responsibilities for companies that process personal data. Am J Bioeth. 2023;23:11–23. https://doi.org/10.1080/15265161.2023.2209535.
Article PubMed Google Scholar
Ferretti A, Ienca M, Sheehan M, Blasimme A, Dove ES, Farsides B, et al. Ethics review of big data research: what should stay and what should be reformed? BMC Med Ethics. 2021;22:51 https://doi.org/10.1186/s12910-021-00616-4.
Article PubMed PubMed Central Google Scholar
Hardcastle F, Lyle K, Horton R, Samuel G, Weller S, Ballard L, et al. The ethical challenges of diversifying genomic data: A qualitative evidence synthesis. Camb Prisms Precis Med. 2024;2:e1 https://doi.org/10.1017/pcm.2023.20.
Article Google Scholar
Timmermans S, Shostak S. Gene worlds. Health. 2016;20:33–48. https://doi.org/10.1177/1363459315615394.
Article PubMed Google Scholar
Lewis ACF, et al. Getting genetic ancestry right for science and society. Science. 2022;376:250–252. https://doi.org/10.1126/science.abm7530.
Article CAS PubMed PubMed Central Google Scholar
Hallowell N, Cooke S, Crawford G, Lucassen A, Parker M. Distinguishing research from clinical care in cancer genetics: theoretical justifications and practical strategies. Soc Sci Med. 2009;68:2010–7. https://doi.org/10.1016/j.socscimed.2009.03.010.
Article PubMed Google Scholar
Horton R, Lucassen A. Genomic testing in healthcare: a hybrid space where clinical practice and research need to co-exist. Expert Rev Mol Diagn. 2019;19:963–7. https://doi.org/10.1080/14737159.2019.1672540.
Article CAS PubMed PubMed Central Google Scholar
Huang K, Xiao C, Glass LM, Critchlow CW, Gibson G, Sun J. Machine learning applications for therapeutic tasks with genomics data. Patterns. 2021;2:100328 https://doi.org/10.1016/j.patter.2021.100328.
Article CAS PubMed PubMed Central Google Scholar
Kang M, Kim S, Lee DB, Hong C, Hwang KB. Gene-specific machine learning for pathogenicity prediction of rare BRCA1 and BRCA2 missense variants. Sci Rep. 2023;13:10478 https://doi.org/10.1038/s41598-023-37698-6.
Article CAS PubMed PubMed Central Google Scholar
Cubuk C, Garrett A, Choi S, King L, Loveday C, Torr B, et al. Clinical likelihood ratios and balanced accuracy for 44 in silico tools against multiple large-scale functional assays of cancer susceptibility genes. Genet Med. 2021;23:2096–104. https://doi.org/10.1038/s41436-021-01265-z.
Article CAS PubMed PubMed Central Google Scholar
Sparrow R, Hatherley J, Oakley J, Bain C. Should the use of adaptive machine learning systems in medicine be classified as research? Am J Bioeth. 2024:1–12. https://doi.org/10.1080/15265161.2024.2337429.
Faden RR, Kass NE, Goodman SN, Pronovost P, Tunis S, Beauchamp TL. An ethics framework for a learning health care system: a departure from traditional research ethics and clinical ethics. Hastings Cent Rep. 2013:Spec No:S16-27. https://doi.org/10.1002/hast.134.
Kass NE, Faden RR, Goodman SN, Pronovost P, Tunis S, Beauchamp TL. The research-treatment distinction: a problematic approach for determining which activities should have ethical oversight. Hastings Cent Rep. 2013;Spec No:S4-S15. https://doi.org/10.1002/hast.133.
Beauchamp TL, Childress JF. Principles of biomedical ethics. Oxford, United states: Oxford University Press, Incorporated; 2001.
Google Scholar
Belmont. The Belmont report: Ethical principles and guidelines for the protection of human subjects of research. In: Research. NCftPoHSoBaB, editor. Washington, D.C: U.S. Government Printing Office; 1979.
Loong L, Garrett A, Allen S, Choi S, Durkie M, Callaway A, et al. Reclassification of clinically-detected sequence variants: Framework for genetic clinicians and clinical scientists by CanVIG-UK (Cancer Variant Interpretation Group UK). Genet Med. 2022;24:1867–77. https://doi.org/10.1016/j.gim.2022.05.002.
Article CAS PubMed Google Scholar
Sahan K, Lyle K, Carley H, Hallowell N, Parker M, Lucassen AM. Ethical preparedness in genomic medicine: how NHS clinical scientists navigate ethical issues. J Med Ethics. 2024;50:517–22.
Article PubMed Google Scholar
Daniels N, Sabin J. Limits to health care: fair procedures, democratic deliberation, and the legitimacy problem for insurers. Philos Public Aff. 1997;26:303–50. https://doi.org/10.1111/j.1088-4963.1997.tb00082.x.
Article PubMed Google Scholar
BlackinCancer. BlackinCancer: Strengthening networks and highlighting black excellence in cancer research and medicine. 2023. https://www.blackincancer.com/. Accessed 24 Apr 2025.
Parker M. Deliberative bioethics. In: Principles of health care ethics. Wiley. 2006. p. 185–91.
Munung NS, de Vries J, Pratt B. Genomics governance: advancing justice, fairness and equity through the lens of the African communitarian ethic of Ubuntu. Med Health Care Philos. 2021;24:377–88. https://doi.org/10.1007/s11019-021-10012-9.
Article PubMed PubMed Central Google Scholar
Kass NE, Faden RR. Ethics and learning health care: the essential roles of engagement, transparency, and accountability. Learn Health Syst. 2018;2:e10066 https://doi.org/10.1002/lrh2.10066.
Article PubMed PubMed Central Google Scholar
Asch DA, Joffe S, Bierer BE, Greene SM, Lieu TA, Platt JE, et al. Rethinking ethical oversight in the era of the learning health system. Health. 2020;8:100462 https://doi.org/10.1016/j.hjdsi.2020.100462.
Article Google Scholar
Sudlow C. The what & why of trusted research environments. 2021. https://understandingpatientdata.org.uk/news/what-why-trusted-research-environments. Accessed 24 Apr 2025.
Banner N. A new approach to decisions about data. 2020. https://understandingpatientdata.org.uk/learning-data-governance-new-approach-decisions-about-data. Accessed 24 Apr 2025.

Download references

Acknowledgements

We would like to thank the Cangene Canvar Patient Reference Panel for their contribution to this work. Thanks also to Beth Torr from Cangene for supporting the ethics research workpackage. Finally we pay tribute to our colleague and co-author, Nina Hallowell, who sadly passed away during the production of this manuscript.

Funding

This work was supported by grants from Cancer Research UK (C61296/A27223) and the Wellcome Trust (Facilitating ethical preparedness in genomic medicine).

Author information

These authors contributed equally: Michael Parker, Anneke Lucassen.

Authors and Affiliations

Ethox Centre, Oxford Population Health, University of Oxford, Big Data Institute, Old Road Campus, Roosevelt Drive, Oxford, UK
Katherine Sahan, Nina Hallowell & Michael Parker
Patient Reference Panel, Cangene Canvar, London, UK
Lesley Turner
Clinical Ethics, Law, and Society (CELS) Oxford, Nuffield Dept of Medicine, University of Oxford, Oxford, UK
Anneke Lucassen
Centre for Personalised Medicine, University of Oxford, Oxford, UK
Anneke Lucassen
Centre for Human Genetics, Oxford, UK
Anneke Lucassen

Authors

Katherine Sahan
View author publications
Search author on:PubMed Google Scholar
Lesley Turner
View author publications
Search author on:PubMed Google Scholar
Nina Hallowell
View author publications
Search author on:PubMed Google Scholar
Michael Parker
View author publications
Search author on:PubMed Google Scholar
Anneke Lucassen
View author publications
Search author on:PubMed Google Scholar

Contributions

KS and NH developed the concept for the paper. KS wrote first drafts of the manuscript with support from NH, MP and AL. LT provided critical insights from the PPIE perspective. MP and AL substantially revised sections of subsequent versions and the final manuscript. All authors (except NH who died before final manuscript submission) read and approved the final submitted version.

Corresponding author

Correspondence to Katherine Sahan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethical approval

Ethical approval was not required because this paper does not include data from human research studies where consent or ethical approval would be needed.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sahan, K., Turner, L., Hallowell, N. et al. Determining a role for Patient and Public Involvement and Engagement (PPIE) in genomic data governance for cancer care. Eur J Hum Genet (2025). https://doi.org/10.1038/s41431-025-01866-1

Download citation

Received: 23 January 2025
Revised: 18 April 2025
Accepted: 30 April 2025
Published: 23 May 2025
Version of record: 23 May 2025
DOI: https://doi.org/10.1038/s41431-025-01866-1

This article is cited by

Beyond the dataset: integrating public voices in data science
- Ana-Paula Rubio
- Janette Dunn
- Miguel O. Bernabeu
Research Involvement and Engagement (2026)
Comment on “Determining a role for Patient and Public Involvement and Engagement (PPIE) in genomic data governance for cancer care.”
- Clara Fabian-Therond
European Journal of Human Genetics (2025)