Use of artificial intelligence (AI) in haematology is advancing rapidly. Initially, AI was used for pattern recognition, like analysing blood and bone marrow slide images and radiographs. Now, use has expanded to analysing multi-dimensional clinical data to assist in diagnosis, estimating prognosis, guiding therapy decision-making, and prescribing drugs [1,2,3].

There is considerable debate whether AI should be physician-substituting or physician-complementing [4, 5]. Substituting and complementing are economic concepts best illustrated by examples (Fig. 1). Assume, on a scale of 100, the performance level of skilled physicians is 70 and that of less-skilled physicians, 50. An AI technology increasing their performance to 80 and 70 shrinks the performance gap to 10 and is termed physician-substituting because it disproportionally makes up for a shortage of skills in less-skilled physicians. In contrast, an AI technology which increases the level of skilled and less-skilled physicians to 90 and 60 widens their performance gap to 30 and is termed physician-complementing because it disproportionately increases the advantage of skilled physicians.

Fig. 1: Physician-substituting versus -complementing artificial intelligence.
figure 1

A \({S}_{1}\) and \({S}_{2}\) are performance levels of less-skilled and skilled physicians, respectively, without artificial intelligence assistance. A technology that raises their performance to \({S}_{1}^{{\prime} }\) and \({S}_{2}^{{\prime} }\) and shrinks the gap between them is physician-substituting. B On the other hand, a technology that raises their levels to \({S}_{1}^{{\prime} {\prime} }\) and \({S}_{2}^{{\prime} {\prime} }\), thereby amplifying their performance gap, is physician-complementing. Our definition of substituting and complementing follows ref. [29].

Physician-substituting and -complementing AI technologies are not mutually exclusive. Without resource constraints, we can use physician-substituting AI for less-skilled haematologists and physician-complementing AI for skilled haematologists. However, if we must choose one AI technology, the decision will depend on whether it is more important to improve the basic level of healthcare quality (i.e., developing physician-substituting AI for less-skilled haematologists) or its maximum height (i.e., developing physician-complementing AI for skilled haematologists). Almost everyone, including physicians, tends to rate themselves as above average, but daily experience is also fraught with a sense of inadequacy and insecurity [6]. This raises the difficult question: What is the goal of AI in haematology? Is it to raise the performance of less-skilled haematologists to near that of skilled haematologists, or something else? Different people have different answers, but a rising tide lifts all boats [7]. Physician-substituting AI has the potential to elevate healthcare standards in resource-constrained areas. However, it is equally important to push the performance frontier forward, and for this we argue that physician-complementing AI can play an important role.

Physician-complementing AI empowers skilled physicians to do more and makes them even better at what they do. This does not imply that a physician-complementing AI technology necessarily engages in direct patient care. Consider ambient documentation technology, which focuses on other tasks [8, 9]. Haematologists, regardless of their level of empathy, spend considerable time during a patient encounter on non-patient-facing tasks like documenting the visit in the electronic medical record (EMR) and (re)-ordering laboratory tests and drugs. AI scribes can assume these tasks, allowing haematologists with more empathetic skills to increase their interaction with patients.

We anticipate that many physician-complementing AI technologies will directly contribute to therapy decision-making and drug prescribing. One area where even highly-skilled haematologists may need help is precision medicine, wherein every person’s therapy should consider their unique features, such as genetic variant topography and co-morbidities. As more and more covariates are considered, the curse of dimensionality quickly arises. Even the most intelligent and skilled person cannot consciously process many variables simultaneously because conscious thoughts have a limited bandwidth [10, 11]. The low capacity for conscious deliberation forces decision-makers to focus on a subset of variables with imperfect weighting, distorted by anchoring and recall bias heuristics. In our RAND-Delphi study of breast cancer therapy, haematologists/oncologists claimed they needed data on 15 covariates before opining whether an autotransplant is appropriate therapy for a specific patient. However, when we analysed their recommendations using recursive partitioning, we found they used only 3 or 4 covariates to guide their decision-making [12].

With AI assistance, haematologists will be empowered to consider a much greater number of covariates compared with those they can currently process. Skilled haematologists would be able to process all available clinical and laboratory data when deciding whether to prescribe a drug, because an AI agent is embedded in the EMR to assist them. One example is a recent prospective trial of using autonomous AI to monitor a very large number (>140) of dynamic clinical covariates and assist transplant experts in making complex decisions on pre-emptive drug intervention to prevent grade III–IV acute graft-versus-host disease in people receiving a human leucocyte antigen-haplotype-matched haematopoietic cell transplant [3]. Most physicians and patients agreed to let AI prescribe the drug, even though they were free to modify or decline the AI model’s  recommended prescriptions.

Another potential use case for physician-complementing AI might be deciding whether to recommend intensive induction chemotherapy (cytarabine/daunorubicin) or less intensive therapy (azacitidine/venetoclax) to someone >65 years old with acute myeloid leukaemia, especially if they are scheduled to subsequently receive a haematopoietic cell transplant [13]. In addition, if they achieve a histological complete remission, would they benefit from more chemotherapy pretransplant? [14] Should results of measurable residual disease- and/or genetic variant topography-testing influence this decision? [15,16,17,18,19] There are many covariates the haematologist should consider in recommending a therapy strategy [20]. AI might assist the haematologist and improve his/her therapy recommendation, an assumption testable in a randomised controlled trial (RCT).

Another example is deciding whether to prescribe an immune therapy drug for someone with Hodgkin lymphoma and a pre-existing autoimmune disorder, wherein it is important to balance benefits and risks but there are few data to inform decision-making [21, 22]. Physician-complementing AI might help haematologists by summarising evidence from diverse data sources and EMRs of similar people with Hodgkin lymphoma receiving immune therapy, thereby providing structure in considering huge numbers of covariates.

In addition to clinical literature data (expert consensus statements, clinical practice guidelines, results of RCTs, etc.), skilled haematologists consider many more covariates, including their experience with similar patients and therapies (recall heuristic), concern for quality-of-life (QoL), doctor-patient interactions, and confidence in their ability to decide on a therapy plan [23]. However, efficacy endpoints in haematology trials are often progression-free survival (PFS) and/or survival. Expert consensus statements and clinical practice guidelines usually assume these endpoints are what people value most, shifting the burden of evidence review from patients and physicians to experts [24]. Nevertheless, people may rank other outcome attributes above survival [25]. For example, some may prefer a therapy with less time spent in healthcare facilities compared with another therapy associated with longer survival but more time in healthcare facilities [26]. Yet others may prioritise avoiding pain and/or avoiding adverse events over longer PFS or survival. Haematologists should adjust therapy decisions based on these considerations [27]. However, currently there is no established best practice for this task [28].

Patient preference-sensitive decision-making can be broken down into 2 steps: (1) identifying a patient’s preferences; and (2) prioritising therapy options based on their preferences. For the 1st step, AI can assist the haematologist and patient to take part in a structured conversation to discover the patient’s weighting of outcome attributes, including survival, treatment response, risk of recurrence, therapy-related adverse events, QoL, modes of care delivery, burden on caregiver(s), and stress related to time and cost. Discovered patient preferences can be cross-checked with previous EMRs. For instance, AI perusal of EMRs might reveal frequent prior hospital and/or clinic visits related to therapy-related adverse events, perhaps corroborating a patient’s stated low tolerance of intensive therapy. Subsequently, in the 2nd step, another AI module ranks available therapy options based on the patient’s preferences. However, currently, expert consensus statements and clinical practice guidelines prioritise PFS and/or survival over other outcome attributes and do not support preference-sensitive decision-making because they are not structured in a way that metadata of therapy options beyond PFS and survival are easily queryable. Responsibility of future task forces for drafting clinical practice guidelines will include systematic review and compilation of benefits and harms of therapy options, rather than prescribing one optimised patient care pathway assuming a universal preference for PFS or survival.

There are caveats to our forecast of the future of AI in haematology. 1st, we write from the perspective of high-resource, tertiary medical centres with sophisticated EMRs with ready interoperability with AI systems. Our forecast might not operate at all hospitals and clinics. 2nd, presently, most medical AI studies are not done in prospective settings, and performance is often measured on highly-curated datasets using metrics not directly connected to patient well-being. Prospective testing of model performance using credible benchmarks is needed to persuade sceptics. 3rd, throughout our discussion, we assume AI is a co-pilot partnering with haematologists rather than a fully autonomous arbitrator of medical decision-making. Whatever recommendations AI makes have to be ratified by physicians in accordance with current regulations in most jurisdictions. The more complex a decision-making setting, the more necessary the requirement for physicians to have the competence to evaluate recommendations generated by AI. A natural corollary to this is that the most sophisticated AI models are necessarily physician-complementing rather than -substituting because they can be safely used only by skilled physicians. And 4th, for rare diseases and/or new therapies, there might be insufficient data to support preference-sensitive decision-making even if we can reliably measure patients’ outcome preferences. Under these circumstances, decision-making needs to rely on skilled physicians’ judgement.

There is consensus that AI will transform haematology. Exactly how and when are being debated. With modern cooking techniques (e.g., sous vide, flash freezing, and ultrasonic homogenising), master chefs create new, previously unthinkable dishes. We suggest that physician-complementing AI will allow haematologists to take patient care to new heights.