Introduction

Neoadjuvant (or primary) systemic therapy (NST) is increasingly used in the treatment of early breast cancer as it confers multiple potential benefits for patients1. These include permitting downstaging of locoregional treatments such as surgery2,3 and radiation therapy (RT)4 in those who have a good treatment response and allowing response-adapted tailoring of adjuvant systemic therapies5,6 with escalation or de-escalation of treatments to optimise both oncological outcomes and quality of life. As treatment response is also highly prognostic on an individual level7,8, NST has become the standard of care for specific disease subtypes9,10,11.

Locoregional therapies, however, remain an integral component of the effective management of early breast cancer and accurate information about the surgery and RT performed following completion of NST is essential for the meaningful interpretation of trial results. The quality and consistency of locoregional treatment reporting in NST trials, however, is poor and needs to be improved12,13. This was first highlighted in 2018 by the Early Breast Cancer Trialists Collaborative Group (EBCTCG) meta-analysis of the outcomes of neoadjuvant and adjuvant chemotherapy, which included ten trials involving 5,250 women and suggested a higher rate of locoregional recurrence (LRR) in patients receiving NST12. This difference in LRR did not translate into significant differences in either distant recurrence or overall survival between the groups and the authors commented that the poor quality of locoregional treatment reporting in the included trials precluded meaningful interpretation of the results12. While it could be argued that the trials included in the EBCTCG meta-analysis were largely historic and reporting will have improved over time, a recent systematic review of contemporary NST studies suggests that locoregional treatment reporting remains heterogenous and inconsistent13. This review which included 137 NST trials involving 575,531 women published between 2018 and 2023 showed that most studies continued to report minimal, if any information regarding the surgery and RT performed13. Not only does this hamper accurate interpretation of key oncological outcomes such as LRR, it may also explain the consistent failure to translate breast cancer downstaging into a reduction in the extent of surgery in both trials and real-world cohorts, where significant proportions of patients continue to undergo mastectomy despite achieving a complete pathological response (pCR)2,3,14.

One solution to addressing poor and inconsistent outcome reporting in NST trials may be to develop and implement a core outcome set (COS), ‘an agreed standardized set of outcomes that should be measured and reported, as a minimum, in all clinical trials’15. For this approach to be successful, however, it is vital that the international breast cancer research community is engaged in the development of a COS and committed to future implementation. The aim of the PRECEDENT (imProving REporting of loCoregional therapies in nEoaDjuvant brEast caNcer Trials) project was therefore to work with key international stakeholders to develop a COS for locoregional treatment measurement and reporting in NST trials.

Results

Phase 1: Generation of a list of outcome domains

A total of 747 locoregional outcomes (510 surgical and 237 RT related) identified from the systematic review13 were categorised into 68 surgical and 28 RT outcome domains. Focused discussion with key stakeholders identified one additional surgical and four additional RT outcome domains for inclusion. Following discussion and iterative refinement by the steering group, a total of 101 outcome domains (69 surgical and 32 RT related) were carried forward for survey prioritisation.

Phase 2: Outcome domain prioritisation

The 101 outcome domains grouped thematically into nine surgery and four RT sections and formatted into 101 questionnaire items for inclusion in the international Delphi survey.

A total of 470 healthcare professionals prioritised at least one COS item in Round 1 and there was good multidisciplinary representation from the surgical (n = 206, 43.8%), medical (n = 144, 30.6%) and radiation (n = 98, 20.8%) oncology communities (Table 1). Respondents were predominantly from Europe (n = 274, 58.3%) but there was broad geographical representation reflecting good international engagement with the survey. Most participants were research active (n = 437, 93.0%), experienced clinicians (>10 years at consultant/attending level n = 307, 65.3%) working in Academic/University/Teaching hospital settings (n = 338, 71.9%) and over half (n = 248, 55.8%) identified as female.

Table 1 Demographics of Delphi participants

A total of 336 (71.5%) Round 1 participants (153/206 (74.3%) surgeons, 90/144 (62.5%) medical oncologists and 77/98 (78.6%) radiation oncologists) prioritised at least one COS item in Round 2. Respondent demographics were similar between rounds (Table 1).

Scores for each outcome domain following Round 2 are summarised by stakeholder group in Supplementary Table 1. Applying the revised definitions of consensus (Table 2), 31 outcome domains were scored ‘consensus in’ (scored ‘very important’ by ≥70% surgeons and/or radiation oncologists and/or other professionals). These were reviewed and, where thematically relevant, combined into 15 summary outcome domains that were carried forward for review and ratification at the consensus meeting (Supplementary Table 2). Ten outcome domains were scored as ‘no consensus’ (scored very important by 60–70% of surgeons and/or radiation oncologists) so were carried forward for discussion and voting at the meeting. The remaining 60 outcome domains were scored ‘consensus out’ (scored ‘very important’ (7–9) by <60% surgeons and/or radiation oncologists) and were carried forward to the consensus meeting to ratify their exclusion (Supplementary Table 1).

Table 2 Definitions of consensus

Phase 3: Consensus meeting

Twenty-three experienced professionals including seven surgeons, 13 radiation oncologists, two medical oncologists, a radiologist and five patient advocates with broad international representation (Europe n = 18, North America n = 4; Middle East 2, Asia n = 3; Australia n = 1) attended the in-person consensus meeting. Following discussion, participants agreed to include the 15 ‘consensus in’ summary outcome domains in the final COS but proposed that the ‘side effects of RT’ domain was amended to reflect the morbidity associated with locoregional treatments (surgery and RT) more broadly. Minor revisions of outcome domain wording were discussed and ratified. Following discussions and voting, none of the 10 ‘no consensus’ items met the threshold for inclusion (≥70% meeting participants scoring outcome ‘very important’) in the COS so these and the 60 ‘consensus out’ items were excluded. The final revised 15-item COS for locoregional outcome reporting in NST studies was agreed and ratified (Table 3).

Table 3 Final core outcome set for locoregional treatment outcome reporting in breast cancer neoadjuvant systemic therapy trials

Discussion

This robust international consensus process has developed a core outcome set for locoregional outcome reporting in breast cancer neoadjuvant systemic therapy trials. It includes 15 outcomes that all key stakeholders think are essential to measure and report as a minimum in all future NST studies. Surgical outcomes include the type of breast and axillary surgery performed; how response to NST is assessed; the adequacy of surgery performed, the proportion of patients who had their breast or axillary surgery downstaged and how many avoided surgery due to an exceptional treatment response. Radiation therapy outcomes included receipt of RT and details of target volumes, dose and fractionation. Short and long-term locoregional treatment morbidity (as defined in the protocol) was also a key outcome. Reporting this essential information regarding locoregional therapy in NST trials will improve their quality and value, ensure that results can be translated into practice and important oncological outcomes such as locoregional recurrence can be interpreted with confidence.

Previous attempts have been made to standardise outcome reporting16,17, but these have not been widely implemented into practice. One reason for this may be that these recommendations were developed based on expert opinion without engagement of the wider breast cancer community. The PRECEDENT COS has specifically been developed in collaboration with key professional stakeholders and experienced patient advocates involved in breast cancer trials worldwide using established methods for consensus to promote engagement and future implementation of the COS from the start18. The COS developed using this collaborative, inclusive and methodologically robust approach, therefore includes outcomes that are meaningful to the international breast cancer community, who should then feel empowered to promote the uptake and implementation of the COS in future NST studies.

Complexity may also represent a barrier to implementation. Several international groups have robustly developed detailed recommendations for locoregional therapy reporting19, but collecting significant amounts of additional data is time consuming and expensive. This is a specific concern in the NST setting where many trials are designed and funded by the pharmaceutical industry to meet regulatory requirements, often without the involvement of surgeons or radiation oncologists to emphasise the importance and relevance of reporting locoregional treatments. A COS is, by definition, an essential set of outcomes that are important to all key stakeholders that should be reported as a minimum in all future NST trials. This minimalistic approach to locoregional outcome selection may represent a more acceptable strategy to improving outcome reporting in NST trials but engagement with funders, trialists and regulators including the US Food and Drug Administration and the UK Medicines and Healthcare products Regulatory Agency will be essential to ensure robust locoregional outcome reporting in all future studies of drugs used in the neoadjuvant setting.

It is also important to consider how the purpose of this COS differs from other breast cancer metrics, such as quality indicators (QIs)20. Core outcome sets aim to improve the quality and consistency of outcome reporting in trials, facilitate data synthesis and reduce research waste21. They represent a minimum reporting standard in clinical trials; they do not restrict the number of outcomes that can be measured or imply that other outcomes are not important. Quality indicators (QIs), by contrast, are used to monitor the quality of clinical care in routine practice. Unlike core outcome sets, which are limited to a small number of essential outcomes that must be reported18, there is no restriction on the number of QIs that may be proposed, as these should, by definition, cover all aspects of the treatment pathway. This difference in function explains why the COS and published QIs differ. Outcomes such as ‘time from completion of NST to surgery’ for example are not considered essential to include in the COS for locoregional outcome reporting in NST trials, but are a key quality indicator20 as delays in surgery adversely impact oncological outcomes22. By contrast, the proportion of patients with an exceptional response post NST who are able to omit or avoid surgery is a core outcome that should be reported in all NST trials but is not currently considered a quality indicator used to measure the quality of the care received.

This work has limitations that should be considered. Firstly, given the complexity of the topic, the consensus process was undertaken in English. This may have limited the participation to individuals with high levels of English proficiency. However, the key stakeholders for this COS were experienced breast cancer trialists. English is the primary language for scientific communication so restricting the process to English would be unlikely to significantly impact the results.

A robust international engagement strategy was embedded in the project a priori to optimise future implementation. Central to this strategy was a partnership with the BIG-NCTN network and the recruitment of multidisciplinary representatives of the major international breast cancer research groups to the steering group to provide access to global speciality networks. This strategy was partially successful, with excellent engagement with professionals from Europe and good international participation overall. There were, however, notably fewer Delphi participants from North America (<10%) with less than two thirds of these individuals participating in both survey rounds. Reasons for this are unclear but there was excellent international representation at the consensus meeting to ensure the final COS was acceptable in different healthcare settings. Similarly, despite strong steering group representation from the radiation oncology community, proportionally fewer radiation oncologists participated in the Delphi. This group, however, were the most likely to complete both survey rounds and were well represented at the consensus meeting, providing confidence that the RT outcomes included in the COS were the most important to radiation oncologists more broadly.

Finally, although patients did not participate in the Delphi, every effort was made to ensure that both the project and the final COS were patient-focused. This was achieved from the start, with experienced patient advocate members of the steering group actively participating in discussions about the project and helping to shape the Delphi survey. Strong international patient representation at the consensus meeting ensured the patient perspective was central to discussions. The final COS, therefore, included outcomes that were equally important to patients as well as professionals.

For the newly developed COS to improve the quality and consistency of locoregional outcome reporting in breast cancer NST trials, however, it will need to be accepted and effectively implemented into practice. To facilitate this process, the PRECEDENT group have worked with experienced trialists and data managers from the Austrian Breast and Colorectal Cancer Study Group to operationalise the 15-item COS into an easy-to-use, practical case report form (CRF) that can be embedded in all future NST trials (Supplementary Materials 3). The CRF is available in several electronic formats that can be uploaded into trial databases, minimising the effort required to include the COS in a trial dataset. The CRF includes clear, well-defined outcomes that will allow all trialists to consistently collect the same, minimum locoregional treatment dataset across all future NST studies; promote standardised implementation of the COS in future trials and facilitate data pooling and future meta-analysis. It may also enable streamlined data collection using artificial intelligence-driven real-world datasets as research methods continue to evolve23,24. Work is now needed to engage funders, study sponsors and regulatory agencies worldwide in the need to mandate inclusion of the locoregional reporting COS in all future NST studies, improving their quality and value to patients, trialists and the international breast cancer community as part of our ongoing efforts to improve care.

Methods

The COS was developed using international consensus methods in accordance with Core Outcome Measures in Effectiveness Trials (COMET)18 and Core Outcome Set-STAndards for Development (COS-STAD) guidelines25. The full protocol including key definitions of ‘locoregional therapy’ and ‘NST’ has been published previously26 and the study prospectively registered on the COMET website (https://www.comet-initiative.org/Studies/Details/2854).

The PRECEDENT project comprised three phases: 1) Generation of a list of outcomes from the literature and interviews with key stakeholders; 2) Outcome prioritisation in an international Delphi survey; 3) A consensus meeting to agree the final COS (Fig. 1).

Fig. 1
figure 1

PRECEDENT COS development process.

Stakeholder, patient and public engagement

This project was developed and delivered in conjunction with the Breast International Group (BIG) and the North American National Cancer Institute National Clinical Trials Network (NCI-NCTN) (BIG-NCTN collaborative group). A collaborative approach was chosen to promote engagement with the international breast cancer research community, to ensure the future COS would be broadly applicable across all healthcare settings and to optimise future adoption and implementation. An expert steering group, including representatives from the major breast cancer research networks across the world and patient advocates, was convened to provide overall oversight of the project. The steering group comprised 23 key stakeholders with expertise in breast cancer NST trials including patients, surgeons, medical and radiation oncologists, radiologists, pathologists, trialists and methodologists. Patient advocates advised on all aspects of the project and attended the consensus meeting to ensure the final COS included outcomes that were relevant and meaningful to patients.

Phase I: Generation of a list of outcomes

A systematic review of primary NST studies published between 2018 and 2023 was used to identify locoregional outcome domains for inclusion in the Delphi survey13. Outcomes were defined broadly and included traditional outcomes such as complications following surgery, as well as additional factors which would influence the interpretation of locoregional recurrence, such as the adequacy of surgical resection or target volumes irradiated. All outcomes were extracted verbatim by two reviewers, with discrepancies resolved by discussion. The long list was then categorised into outcome domains by the study team (MJ/SP/SAMcI) and discussed with a purposively selected group of expert stakeholders (surgeons and radiation oncologists) to review the comprehensiveness of the list and identify additionally relevant domains in a series of brief focused interviews. The final long list of outcome domains was reviewed and refined by the steering group prior to outcome prioritisation.

Phase II: Outcome domain prioritisation

Sequential rounds of an international multistakeholder Delphi survey were used to prioritise the outcome domains for inclusion in the COS.

Outcome domains were operationalized into survey questionnaire items for inclusion in the Delphi survey. Questions were written in English and reviewed by the international steering group to ensure the terminology used was appropriate and comprehensible to participants from different geographical regions. The survey was piloted with a small group of surgeons and radiation oncologists who were not involved in the project to ensure face validity prior to launch.

Respondents were asked how important it would be to include each outcome domain in a COS on a nine point Likert scale ranging from 1 (not important) to 9 (extremely important) based on the Grading of Recommendations Assessment, Development and Evaluation (GRADE) scale for including items in the final COS27.

Key stakeholders were professionals involved in breast cancer NST trials worldwide. This included surgeons, radiation and medical oncologists, radiologists and methodologists. Steering group members circulated the Round 1 survey widely through their professional networks, breast cancer groups and social media to promote participation and international engagement.

Following extensive discussion with the steering group, including patient advocates, it was agreed that patients would not be recruited to participate in the Delphi survey. This was due to the highly technical nature of the surgery and radiation oncology outcome domains. An alternative patient involvement strategy was devised. This included active patient advocate involvement in survey development including review of the long list to ensure its comprehensiveness and patient participation in the consensus meeting to ensure the final COS maintained a patient focus. To enable their full participation in the consensus meeting, preparatory briefing meetings were held with attending patient advocates to answer questions and explain the processes/methods that would be involved.

Participants completed two sequential survey rounds. Round 1 was open between 03/28/24 and 06/13/2024 and Round 2 between 06/26/24 and 08/02/2024. The survey was administered online using REDCap28 data capture software to facilitate international engagement and collaboration. Respondents were categorised into three key stakeholder groups: (i) surgeons, (ii) radiation oncologists and (iii) other professionals as the views of locoregional therapists were considered particularly relevant to consensus development. All Round 1 participants were invited to participate in Round 2. In Round 2, each survey item was accompanied by anonymised feedback about how that item was scored in Round 1, with the aim of promoting consensus across groups whose views may differ18,29,30. The feedback included (i) the participant’s own Round 1 score, (ii) a table summarising the median scores for each key stakeholder group and (iii) a histogram showing the distribution of scores by the stakeholder group. All items were retained between survey rounds, allowing participants to review their score and re-score each item, considering the feedback received. It was agreed a priori that a third survey round may be required if sufficient consensus was not achieved following Round 2.

Simple summary statistics were used to describe the participants’ characteristics in each survey round.

Following Round 2, the proportion of participants in each key stakeholder group scoring each item as ‘not important’ (score 1–3); ‘equivocal’ (score 4–6) and ‘very important’ (score 7–9) was calculated and each item categorised as ‘consensus in’, ‘consensus out’ or ‘no consensus’.

Consensus was defined a priori (Table 1). However, review of scoring across stakeholder groups between Round 1 and 2 demonstrated two potential issues; i) that survey respondents were reluctant to score outcomes as ‘not important’ (score 1–3) resulting in an unmanageable number of ‘no consensus items’ that would need to be discussed at the consensus meeting and ii) an impractical number of ‘consensus in’ items for inclusion in a future COS. Following discussion with the steering group, it was considered unlikely that a further survey round would result in any further prioritisation. It was therefore decided post hoc to revise the definition of ‘consensus out’ (Table 1). The definition of ‘consensus in’ was retained, but thematically similar items were merged to create a smaller number of broader ‘summary’ outcome domains for inclusion in the final COS. Both the original and proposed summary outcome domains were reviewed and revised by the steering group prior to the consensus meeting, where the outcomes were presented for review and ratification.

All data were analysed in STATA V18 (www.stata.com).

Participant attrition between rounds was monitored. Weekly automatic reminder e-mails were sent to Round 1 participants who had either not completed or only partially completed the Round 2 survey to optimise participation.

Phase III: Consensus meeting

The in-person consensus meeting was held in Barcelona, Spain on 14th September 2024. Professional participants were purposively selected from those individuals who had completed both survey rounds and expressed an interest in attending the meeting. Sampling was based on stakeholder group, gender and geographical location to ensure a broad representation of views. An international group of experienced patient advocates were also invited to attend to ensure the discussions were patient focused. The meeting was facilitated by an independent chair (KC) who encouraged all participants to openly express their views. All participants were sent a meeting pack prior to the event which included a summary of the process and details of the consensus meeting format together with lists of the i) summary ‘consensus in’ and ii) ‘consensus out’ items for review, and iii) ‘no consensus’ items for discussion and voting at the meeting.

At the consensus meeting, a summary of the Delphi survey results was presented together with the patient and professional perspectives on the importance of a COS for context. ‘Consensus in’ and ‘consensus out’ items were then presented and participants given the opportunity for brief discussion to consider any objections to these items being included or excluded. Participants were then asked to vote to ratify the inclusion of outcome domains classified as ‘consensus in’ and exclusion of items classified as ‘consensus out’. Next, the ‘no consensus’ outcome domains were discussed in smaller multidisciplinary breakout groups. Each group was asked to identify up to three ‘no consensus’ items for inclusion in the COS and to provide a supporting justification. Each breakout group reported back verbally to the main group and all participants then voted on whether each ‘no consensus’ outcome domain was ‘very important’, ‘equivocal’ or ‘not important’ to include in the final COS. Domains voted as ‘very important’ by ≥70% of participants were included, either as separate outcomes or, if considered more appropriate, combined with existing ‘consensus in’ domains. All other items were discarded. Voting was conducted anonymously using MentiMeter software (www.mentimeter.com). The consensus meeting concluded with review and ratification of the final COS by meeting participants.

Sample size

There is no standard sample size for consensus processes18 so the aim was to ensure good representation from key stakeholder groups across all geographical regions. A target sample size of 250–300 professionals for the survey and 20–25 participants for the consensus meeting was therefore agreed upon based on similar research31,32.

Ethics

Ethical approval was obtained from the Queen’s University Belfast Faculty of Medicine, Health and Life Sciences Ethics committee (reference MHLS 23_167). Formal written consent was not obtained, as participation and attendance were taken as implied consent.