Quantum speedup in the identification of cause–effect relations

Chiribella, Giulio; Ebler, Daniel

doi:10.1038/s41467-019-09383-8

Download PDF

Article
Open access
Published: 01 April 2019

Quantum speedup in the identification of cause–effect relations

Giulio Chiribella^1,2,3 &
Daniel Ebler^1,4

Nature Communications volume 10, Article number: 1472 (2019) Cite this article

9305 Accesses
43 Citations
2 Altmetric
Metrics details

Subjects

Abstract

The ability to identify cause–effect relations is an essential component of the scientific method. The identification of causal relations is generally accomplished through statistical trials where alternative hypotheses are tested against each other. Traditionally, such trials have been based on classical statistics. However, classical statistics becomes inadequate at the quantum scale, where a richer spectrum of causal relations is accessible. Here we show that quantum strategies can greatly speed up the identification of causal relations. We analyse the task of identifying the effect of a given variable, and we show that the optimal quantum strategy beats all classical strategies by running multiple equivalent tests in a quantum superposition. The same working principle leads to advantages in the detection of a causal link between two variables, and in the identification of the cause of a given variable.

Experimental aspects of indefinite causal order in quantum mechanics

Article 19 July 2024

A versatile single-photon-based quantum computing platform

Article Open access 26 March 2024

Quantum switch instabilities with an open control

Article Open access 19 November 2024

Introduction

Identifying causal relations is a fundamental primitive in a variety of areas, including machine learning, medicine, and genetics^1,2,3. A canonical approach is to formulate different hypotheses on the cause–effect relations characterizing a given phenomenon, and test them against each other. For example, in a drug test some patients are administered the drug, while others are administered a placebo, with the scope of determining whether or not the drug causes recovery. Traditionally, causal discovery techniques have been based on classical statistics, which effectively describes the behavior of macroscopic variables. However, classical techniques become inadequate when dealing with quantum systems, whose response to interventions can strikingly differ from that of classical random variables^4,5.

Recently, there has been a growing interest in the extension of causal reasoning to the quantum domain. Several quantum generalizations of the notion of causal network have been proposed^{6,7,8,9,10,11,12,13,14,15} and new algorithms for quantum causal discovery have been designed^{16,17,18,19,20}. Besides its foundational relevance, the study of quantum causal discovery algorithms is expected to have applications in the emerging area of quantum machine learning^21,22, in the same way as classical causal discovery algorithms have previously impacted classical artificial intelligence.

An intriguing possibility is that quantum mechanics may provide enhanced ways to identify causal links. A clue in this direction comes from refs. ^17,18, where the authors show that certain quantum correlations are witnesses of causal relationships, in apparent violation of the classical tenet “correlation does not imply causation”. This observation suggests that quantum setups for testing causal relationships could overcome some of the limitations of existing classical setups. However, the type of advantage highlighted in refs. ^17,18 only concerns a limited class of setups, where the experimenter is constrained to a subset of the possible interventions. If arbitrary interventions are allowed, this particular type of advantage disappears. A fundamental open question is whether quantum setups can offer an advantage over all classical setups, without any restriction on the experimenter’s interventions.

Here, we answer the question in the affirmative, proving that quantum features like superposition and entanglement can significantly speed up the identification of causal relations. We start from the task of deciding which variable, out of a list of candidates, is the effect of a given variable. We first analyze the problem in the classical setting, determining the performance of the best classical strategy. Then, we construct a quantum strategy that reduces the error probability by an exponential amount, doubling the decay rate of the error probability with the number of accesses to the relevant variables. Remarkably, the decay rate of our strategy is the highest achievable rate allowed by quantum mechanics, even if one allows for exotic setups where the order of operations is indefinite^23,24. The key ingredient of the quantum speedup is the ability to run multiple equivalent experiments in a quantum superposition. The same working principle enables quantum speedups in a broader set of tasks, including, e.g., the task of deciding whether there exists a causal link between two given variables, and the task of identifying the cause of a given variable.

Results

Theory-independent framework for testing causal hypotheses

Here, we outline a framework for testing causal hypotheses in general physical theories^{25,26,27,28,29,30}. In this framework, variables are represented as physical systems, each system with its set of states. The framework applies to theories satisfying the Causality Axiom²⁸, stating that the probability of an event at a given time should not depend on choices of settings made at future times.

A causal relation between variable A and variable B is represented by a map describing how the state of B responds to changes in the state of A. If the map discards A and outputs a fixed state of B, then no causal influence can be observed. In all the other cases, some change of A will lead to an observable change of B. Hence, we say that A is a cause for B.

In general, the set of allowed causal relationships depends on the physical theory, which determines which maps can be implemented by physical processes. In classical physics, cause–effect relations can be represented by conditional probability distributions of the form p(b|a), where a and b are the values of the random variables A and B, respectively. In quantum theory, cause–effect relations are described by quantum channels, i.e., completely positive trace-preserving maps transforming density matrices of system A into density matrices of system B.

Given a set of variables, one can formulate hypotheses on the causal relationships among them. For example, consider a three-variable scenario, where variable A may cause either variable B or variable C, but not both. The causal relation is described by a process ${\cal C}$, with input A and outputs B and C. Here, we consider two alternative causal hypotheses: either A causes B but not C; or A causes C but not B. The problem is to distinguish between these two hypotheses without having further knowledge of the physical process responsible for the causal relation. This means that the process ${\cal C}$ is unknown, except for the fact that it must compatible with one and only one of the two hypotheses. Mathematically, the two hypotheses correspond to two sets of physical processes, and the problem is to determine which set contains the process ${\cal C}$.

In order to decide which hypothesis is correct, we assume that the experimenter has black box access to the physical process ${\cal C}$. The experimenter can probe the process for N times, intervening between one instance and the next, as illustrated in Fig. 1. In the end, a measurement is performed and its outcome is used to guess the correct hypothesis.

An important question is how fast the probability of error decays with N. The decay is typically exponential, with an error probability vanishing as p_err(N) ≈ 2^−RN for some positive constant R, which we call the discrimination rate. The operational meaning of the discrimination rate is the following. Given an error threshold ε, the error probability can be made smaller than ε using approximately N > log ε⁻¹/R calls to the unknown process. The bigger the rate, the smaller the number of calls needed to bring the error below the desired threshold.

Since the explicit form of the process ${\cal C}$ is unknown, we take p_err(N) to be the worst-case probability over all processes compatible with the two given causal hypotheses. If prior information over ${\cal C}$ is available, one may also consider a weaker performance measure, based on the average with respect to some prior. In the following, we stick to the worst case scenario, as it provides a stronger guarantee on the performance of the test.

Identifying causal intermediaries

A variable B is a causal intermediary for variable A if all the influences of A propagate through B. Physically, one can think of B as a slice of the future light cone of A, so that all causal influences of A must pass through B, as illustrated in Fig. 2. Mathematically, the fact that B is a causal intermediary means that there exists a process ${\cal C}$ from A to B such that for every other variable B′ and for every process ${\cal C}^\prime$ with input A and output B′ one can decompose ${\cal C}^\prime$ as ${\cal C}^\prime = {\cal R}\circ {\cal C}$, where ${\cal R}$ is a suitable process from B to B′.

The condition that a variable is a causal intermediary of another has a simple characterization in all physical theories where processes are fundamentally reversible, meaning that they can be modeled as the result of a reversible evolution of the system and an environment²⁸. The reversibility condition is captured by the expression ${\cal C} = ({\cal I}_B \otimes {\mathrm{Tr}}_{E\prime }){\cal U}({\cal I}_A \otimes \eta _E)$, where variables E and E′ represent the environment (before and after the interaction), η is the initial state of the environment, Tr_E′ is the operation of discarding system E′²⁸, and ${\cal U}$ is a reversible process from AE to BE′.

When the reversibility condition is satisfied, the variable A can be recovered from variables B and E′. If variable B is to be a causal intermediary of A, then the process ${\cal C}$ must be correctable, in the sense that its action can be undone by another process ${\cal R}$. In addition, if the state spaces of variables A and B are finite dimensional and of the same dimension, then the process ${\cal C}$ must be reversible. In classical theory, this means that ${\cal C}$ is an invertible function. In quantum theory, this means that ${\cal C}$ is a unitary channel, of the form ${\cal C}(\rho ) = U\rho U^\dagger$ for some unitary operator U.

In the following, we will consider the task of identifying which variable, out of a given set of candidates, is the causal intermediary of a given variable A. An important feature of this task is that it admits a complete analytical treatment, allowing us to rigorously prove a quantum advantage over all classical strategies. Besides its fundamental interest, this advantage could have applications to the task of monitoring the information flow in future quantum communication networks, allowing an experimenter to determine which node of a quantum network receives information from a given source node.

Optimal classical strategy

Suppose that A, B, and C are random variables with the same alphabet of size d < ∞. In this case, the fact that X ∈ {B, C} is a causal intermediary for A means that the map from A to X is a permutation. The first (second) causal hypothesis is that B (C) is a permutation of A, while C (B) is uniformly random. Other than this, no information about the functional relation between the variables is known to the experimenter. In particular, the experimenter does not know which permutation relates the variable A to its causal intermediary X.

Let us determine how well one can distinguish between the two hypotheses with a finite number of experiments. In principle, we should examine all sequential strategies as in Fig. 1. However, in classical theory the problem can be greatly simplified: the optimal discrimination rate can be achieved by a parallel strategy, wherein the N input variables are initially set to some prescribed set of values³¹.

The possibility of an error arises is when the randomly fluctuating variable accidentally takes values that are compatible with a permutation, so that the outcome of the test gives no ground to discriminate between the two hypotheses. The probability of such inconclusive scenario is equal to P(d, v)/d^N, where v is the number of distinct values of A probed in the experiment and P(d, v) = d!/(d − v)! is the number of injective functions from a v-element set to a d-element set. The probability of confusion is minimal for v = 1, leading to the overall error probability

$$p_{{\mathrm{err}}}^{\mathrm{C}} = \frac{1}{{2d^{N - 1}}}.$$

(1)

As a consequence, the rate at which the two causal hypotheses can be distinguished from each other is

$$R_{\mathrm{C}} = {\mathrm{log}}\,d.$$

(2)

A first quantum advantage

Classical systems can be regarded as quantum systems that lost coherence across the states of a fixed basis, consisting of the classical states. But what if coherence is preserved? Could a coherent superposition of classical states be a better probe for the causal structure?

If the causal relations are restricted to reversible gates that permute the classical states, coherence offers an immediate advantage. The experimenter can prepare N probes, each in the superposition $|e_0\rangle = \mathop {\sum}\nolimits_{i = 0}^{d - 1} |i\rangle /\sqrt d$. Since the superposition is invariant under permutations, the unknown process will produce either N copies of the state |e₀⟩⟨e₀|⊗I/d or N copies of the state I/d⊗|e₀⟩⟨e₀|, depending on which causal hypothesis holds. Using Helstrom’s minimum error measurement³², the error probability is reduced to

$$p_{{\mathrm{err}}}^{{\mathrm{coh}}} = \frac{1}{{2d^N}}.$$

(3)

Compared with the classical error probability (1), the error probability of this simple quantum strategy is reduced by a factor d, which does not change the rate, but could be significant when the size of the alphabet is large.

Let us consider the full quantum version of the problem. Three quantum variables A, B, and C, corresponding to d-dimensional quantum systems, are promised to satisfy one of two causal hypotheses: either (i) the state of B is obtained from the state of A through an arbitrary unitary evolution and the state of C is maximally mixed, or (ii) the state of C is obtained from the state of A through an arbitrary unitary evolution and the state of B is maximally mixed.

Despite the fact that now the cause–effect relation can be one of the infinitely many unitary gates, it turns out that the error probability (3) can still be attained. A universal quantum strategy, working for arbitrary unitary gates, is to prepare d particles in the singlet state

$$\left| {S_d} \right\rangle = \frac{1}{{\sqrt d !}}\mathop {\sum}\limits_{k_1,k_2, \cdots ,k_d} {\epsilon _{k_1k_2 \ldots k_d}} \left| {k_1} \right\rangle \left| {k_2} \right\rangle \cdots \left| {k_d} \right\rangle$$

(4)

where $\epsilon _{k_1k_2 \ldots k_d}$ is the totally antisymmetric tensor and the sum ranges over all vectors in the computational basis. Then, each of the d particles is used as an input to one use of the channel. Repeating the experiment for t times, and performing Helstrom’s minimum error measurement one can attain the error probability $p_{{\mathrm{err}}}^{{\mathrm{coh}}} = (2d^N)^{ - 1}$, with N = td, independently of the unitary gate representing the cause–effect relationship. In summary, the quantum error probability is at least d times smaller than the best classical error probability, even if the cause–effect relationship is described by an arbitrary unitary gate.

Optimality among simple parallel strategies

We now show that the value (3) is optimal among all simple strategies where the unknown process is applied N times in parallel on N identical input systems, as in Fig. 3.

Optimality follows from a complementarity relation between the information about the causal structure and the information about the functional dependence between cause and effect. Suppose that the cause–effect dependence amounts to a unitary gate U in some finite set U. The ability of a state |Ψ〉 to probe the cause–effect dependence can be quantified by the probability $p_{{\mathrm{guess}}}^{\mathrm{U}}$ of correctly guessing the unitary U from the state U^⊗N|Ψ〉. When the set of possibly unitaries has sufficient symmetry, we find that the probability of error in identifying the causal structure satisfies the lower bound

$$p_{{\mathrm{err}}} \ge \frac{1}{{2d^N}}\left\{ {1 + \frac{1}{{2(d^N - 1)}}\left( {\frac{{p_{{\mathrm{guess}}}^{\mathrm{U}} - \frac{1}{{|{\mathrm{U}}|}}}}{{\frac{1}{{|{\mathrm{U}}|}}}}} \right)^2} \right\}$$

(5)

(Supplementary Note 1). The higher the probability of success in guessing the cause–effect dependence, the higher the probability of error in identifying the causal structure. A consequence of the bound (5) is that the minimum error probability in identifying the causal intermediary is (2d^N)⁻¹, and is attained when the success probability $p_{{\mathrm{guess}}}^{\mathrm{U}}$ is equal to the random guess probability 1/|U|.

Exponential reduction of the error probability

The bound (5) shows that the discrimination rate of simple parallel strategies cannot exceed the classical discrimination rate log d. We now show that that the rate can be doubled by entangling the N probes with an additional reference system.

The working principle of our strategy is to build a quantum superposition of equivalent experimental setups. If no reference system is used, we know that the optimal strategy is to divide the N probes into N/d groups (assuming for simplicity that N is a multiple of d), and to entangle the probes within each group. Clearly, different ways of dividing the N inputs into groups of d are equally optimal: it does not matter which particle is entangled with which, as long as all each particle is part of a singlet state. Still, we can imagine a machine that partitions the particles according to a certain configuration i if a control system is in the state |i〉. When the control system is in a superposition, the machine will probe the unknown process in a superposition of configurations, as pictorially illustrated in Fig. 4. Explicitly, the optimal input state is

$$\left| {\mathrm{\Psi }} \right\rangle = \frac{1}{{\sqrt {G_{N,d}} }}\mathop {\sum}\limits_{i = 1}^{G_{N,d}} {\left( {\left| {S_d} \right\rangle ^{ \otimes N/d}} \right)_i} \otimes \left| i \right\rangle ,$$

(6)

where i labels the different ways to partition N identical objects into groups of d elements, G_N,d is the number of such ways, $\left( {\left| {S_d} \right\rangle ^{ \otimes N/d}} \right)_i$ is the product of N/d singlet states arranged according to the i-th configuration, and {|i〉, i = 1, …, G_N,d} are orthogonal states of the reference system.

Classically, there would be no point in randomizing optimal configurations, because mixtures cannot reduce the error probability. But in the quantum case, the coherent superposition of equivalent configurations brings the error probability down to

$$p_{{\mathrm{err}}}^{\mathrm{Q}}(r) = \frac{r}{{2d^N}}\left( {1 - \sqrt {1 - r^{ - 2}} } \right)\mathop{\longrightarrow}\limits^{{r \gg 1}}\frac{1}{{4rd^N}}{\kern 1pt} ,$$

(7)

where r is the number of linearly independent states of the form $\left( {\left| {S_d} \right\rangle ^{ \otimes N/d}} \right)_i$ (Supplementary Note 2).

To determine how much the error probability can be reduced, we only need to evaluate the number of linearly independent states. It turns out that this number grows as d^N, up to a polynomial factor (Supplementary Note 2 again). Taking the logarithm, we obtain the discrimination rate

$$R_{\mathrm{Q}} = - \mathop {{\lim }}\limits_{N \to \infty } \frac{{{\mathrm{log}}\,p_{{\mathrm{err}}}^{\mathrm{Q}}}}{N} = 2\,{\mathrm{log}}\,d,$$

(8)

which is twice the classical discrimination rate (2). In fact, the asymptotic regime is already reached with a small number of interrogations, of the order of a few tens. For example, the causal relation between two quantum bits can be determined with an error probability smaller than 10⁻⁶ using with 12 interrogations, whereas 20 interrogations are necessary for classical binary variables.

The above strategy is universal, in that it applies to causal relationships described by arbitrary unitary gates. In particular, it applies to gates that permute the classical states. Hence, the ability to maintain coherence across the classical states and to generate entanglement with a reference system offers an exponential speedup with respect to the best classical strategy. In passing, we note that the universal quantum strategy is insensitive to the presence of perfectly correlated noise, such as the noise due to the lack of a reference frame³³, where each of the N input variables is subjected to the same unknown unitary gate.

The ultimate quantum limit

So far, we examined strategies where the unknown process is applied in parallel to a large entangled state. Could a general sequence of interventions achieve an even better rate?

Finding the optimal sequential strategy is generally a hard problem. To address this problem, we introduce the fidelity divergence of two quantum channels ${\cal C}_1$ and ${\cal C}_2$, defined as

$$\partial F({\cal C}_1,{\cal C}_2) = \mathop {{\inf }}\limits_R \mathop {{\inf }}\limits_{\rho _1,\rho _2} \frac{{F[({\cal C}_1 \otimes {\cal I}_R)(\rho _1),({\cal C}_2 \otimes {\cal I}_R)(\rho _2)]}}{{F(\rho _1,\rho _2)}},$$

(9)

where ρ₁ and ρ₂ are joint states of the channel’s input and of the reference system R. It is understood that the infimum in the right-hand side is taken over pairs of states (ρ₁, ρ₂) for which the fidelity F(ρ₁, ρ₂) is non-zero, so that the expression on the right-hand side of Eq. (9) is well-defined.

The fidelity divergence quantifies the ability of channels ${\cal C}_1$ and ${\cal C}_2$ to move two states apart from each other. In the Methods section, we show that the error probability in distinguishing between ${\cal C}_1$ and ${\cal C}_2$ with N queries is lower bounded as

$$p_{{\mathrm{err}}}^{{\mathrm{seq}}}({\cal C}_1,{\cal C}_2;N) \ge \frac{{\partial F({\cal C}_1,{\cal C}_2)^N}}{4}.$$

(10)

In particular, suppose that the two channels ${\cal C}_1$ and ${\cal C}_2$ have the form ${\cal C}_1 = {\cal U} \otimes I/d$ and ${\cal C}_2 = I/d \otimes {\cal U}$, where ${\cal U}$ is a fixed unitary channel. In this case, we find that the fidelity divergence is 1/d². Hence, the error probability satisfies the bound

$$p_{{\mathrm{err}}}^{{\mathrm{seq}}}({\cal C}_1,{\cal C}_2;N) \ge \frac{1}{{4d^{2N}}}.$$

(11)

In the causal intermediary problem, the unitary gate ${\cal U}$ is unknown, and therefore the error probability can only be larger than $p_{{\mathrm{err}}}^{{\mathrm{seq}}}({\cal C}_1,{\cal C}_2;N)$. Hence, the identification of the causal intermediary cannot occur at a rate faster than 2 log d.

Equation (11) limits all sequential quantum strategies. But in fact quantum theory is also compatible with scenarios where physical processes take place in an indefinite order^23,24. Could the rate be increased if the experimenter had access to exotic phenomena involving indefinite order?

The answer is negative. In the Methods section, we develop the concepts and methods needed to answer this question, and we show that the minimum error probability in distinguishing between the two channels ${\cal C}_1 = {\cal I} \otimes I/d$ and ${\cal C}_2 = I/d \otimes {\cal I}$ using arbitrary setups with indefinite order satisfies the bound

$$p_{{\mathrm{err}}}^{{\mathrm{ind}}}({\cal C}_1,{\cal C}_2;N) \ge \frac{{1 - \sqrt {1 - \frac{1}{{d^{2N}}}} }}{2}{\kern 1pt} .$$

(12)

Clearly, this bound applies to the causal intermediary problem, which is harder than the discrimination of the two specific channels ${\cal C}_1 = {\cal I} \otimes I/d$ and ${\cal C}_2 = I/d \otimes {\cal I}$. Hence, the rate R_Q = 2 log d represents the ultimate quantum limit to the identification of a causal intermediary.

Extension to arbitrary numbers of hypotheses

The quantum advantage demonstrated in the previous sections can be extended to the identification of the causal intermediary among an arbitrary number k of candidate variables. The best classical strategy still consists of initializing all variables to the same value. Errors arise when the values of two or more output variables are compatible with an invertible function. In the limit of many repetitions, the minimum error probability is $p_{{\mathrm{err}},{\mathrm{k}}}^{\mathrm{C}} = (k - 1)/(2d^{N - 1}) + O\left( {d^{ - 2N}} \right)$. (Supplementary Note 3). For quantum strategies, the best option among simple parallel strategies is still to divide the input particles into N/d groups of d particles and to initialize each group in the singlet state. In Supplementary Note 4, we show that this strategy reduces the error probability to $p_{{\mathrm{err}},{\mathrm{k}}}^{{\mathrm{coh}}} = (k - 1)/(2d^N) + O\left( {d^{ - 2N}} \right)$, for causal relations represented by arbitrary unitary gates.

An exponentially smaller error probability can be achieved using the input state (6). The evaluation of the error probability is more complex than in the two-hypothesis case, but the end result is the same: when the causal dependency is probed N times, the quantum error probability decays at the exponential rate R_Q = 2 log d, twice the rate of the best classical strategy (see Supplementary Note 5 for the technical details).

Applications to other tests of causal hypotheses

The strategies developed in the previous sections can be applied to the identification of causal relations in a variety of scenarios. For example, they can be used to decide whether there is a causal link between two variables A and B. More specifically, they can be used to determine whether variable B is a causal intermediary for variable A or whether B fluctuates at random independently of A. Also in this case, the error probability of the best classical strategy is 1/(2d^N−1), whereas preparing N/d copies of the singlet yields error probability 1/(2d^N).

By superposing all possible partitions of the N inputs into groups of d, one can boost the discrimination rate from log d to 2 log d. One could speculate that, in the future, such a fast identification could be useful as a quantum version of the ping protocol, capable of establishing whether there exists a quantum communication link between two nodes of a quantum internet³⁴.

Another application of our techniques is in the problem of identifying the cause of a given variable. Suppose that one of k variables A₁, A₂, …, A_k is the cause for a given variable B. An example of this situation arises in genetics, when trying to identify the gene responsible for a certain characteristic. Here, the interesting scenario is when the number of candidate causes is large.

Classically, the problem is to find the variable A_x such that B is a function of A_x. For simplicity, we first assume that all variables have the same d-dimensional alphabet, and that the function from A_x to B is the identity, namely b = a_x. In this case, the cause can be identified without any error by probing the unknown process for $\left\lceil {{\mathrm{log}}_dk} \right\rceil$ times. The identification is done by a simple search algorithm, where one divides the candidate variables in d groups and initializes the input variables in the i-th group to the value i. In this way, d − 1 groups can be ruled out, and one can iterate the search in the remaining group. Using a decision tree argument³⁵, it is not hard to see that $\left\lceil {{\mathrm{log}}_dk} \right\rceil$ is the minimum number of queries needed to identify the unknown process in the worst case scenario.

In the quantum version of the problem, we find that the number of queries can be cut down by approximately a half when the number of hypotheses is large. The trick is to prepare k maximally entangled states, and to apply the unknown process to the first system of each pair. Repeating this procedure for N times and using results on port-based teleportation³⁶ we find that the error probability is p_err = (k − 1)/(d^2N + k − 1). Hence, $N = \left\lceil {(1 + \epsilon)({\mathrm{log}}_dk)/2} \right\rceil$ queries are sufficient to identify the cause with vanishing error probability in the large k limit.

In Supplementary Note 6, we consider the more complex scenario where the functional dependence between the cause and effect is unknown, and the only assumption is that the effect is a causal intermediary of the cause. Despite the lack of information about the functional dependence, we show that the correct cause can be still identified with high probability using $N = \left\lceil {(1{\mathrm{ }} + \epsilon)({\mathrm{log}}_dk)/2} \right\rceil$ calls to the unknown process. The fast identification of the cause is achieved by dividing the N copies of each input variable A_i into groups of d copies, preparing each group in the singlet state, and entangling the configuration of the groupings with an external reference system. Once again, the superposition of multiple equivalent setups leads to a quantum speedup over the best classical strategy.

Discussion

We showed that quantum mechanics enhances our ability to detect direct cause–effect links. This finding motivates the exploration of more complex networks of causal relations, including intermediate nodes and global causal dependences between groups of variables^1,2,3. The development of new techniques for testing causal relations could find applications to future quantum communication networks, providing a fast way to test the presence of communication links. It could also assist the design of intelligent quantum machines, in a similar way as classical causal discovery algorithms have been useful in classical artificial intelligence. In view of such applications, it is important to go beyond the noiseless scenario considered in this paper, and to address scenarios where the cause–effect relationships are obfuscated by noise. The techniques developed in our work already provide some insights in this direction. Quite interestingly, one can show that the quantum advantage persists in the presence of depolarizing noise, provided that the noise level is not too high (see Supplementary Note 7). A complete study of the noisy scenario, however, remains an open direction of future research.

Another direction of future investigation is foundational. Given the advantage of quantum theory over classical theory, it is tempting to ask whether alternative physical theories could offer even larger advantages. Interesting candidates are theories that admit more powerful dense coding protocols than quantum theory³⁷, as one might expect super-quantum advantages to arise from the presence of stronger correlations with the reference system. In a similar vein, one could explore physical theories with higher dimensional state spaces, such as Zyczkowski’s quartic theory³⁸, or quantum theory on quaternionic Hilbert spaces³⁹. Indeed, it is intriguing to observe that the classical rate R^C = log d and the quantum rate R^Q = 2 log d are equal to the logarithms of the dimensions of the classical and quantum state spaces, respectively. In general, one may expect a relationship between the dimension of the state space and the rate. Should super-quantum advantages emerge, it would be natural to ask which physical principle determines the causal identification power of quantum mechanics. An intriguing possibility is that one of the hidden physical principles of quantum theory could be a principle on the ability to distinguish alternative causal hypotheses.

Methods

Properties of the fidelity divergence

Here, we derive two properties of the fidelity divergence defined in Eq. (9). First, the fidelity divergence provides a lower bound on the probability of misidentifying a channel with another:

Proposition 1 The probability of error in distinguishing between two quantum channels ${\cal C}_1$ and ${\cal C}_2$ with N queries is lower bounded as $p_{{\mathrm{err}}}^{{\mathrm{seq}}}({\cal C}_1,{\cal C}_2;N) \ge \partial F({\cal C}_1,{\cal C}_2)^N/4$.

The bound can be obtained in the following way. Let $\rho _x^{(N)}$ be the output state of a circuit as in Fig. 1. Then, we have the bound

$$\begin{array}{*{20}{l}} {p_{{\mathrm{err}}}^{{\mathrm{seq}}}({\cal C}_1,{\cal C}_2;N)} \hfill & = \hfill & {\frac{1}{2}\left( {1 - \frac{1}{2}\left\| {\rho _1^{(N)} - \rho _2^{(N)}} \right\|_1} \right)} \hfill \\ {} \hfill & {} \hfill & { \ge \frac{1}{2}\left( {1 - \sqrt {1 - F(\rho _1^{(N)},\rho _2^{(N)})} } \right)} \hfill \\ {} \hfill & {} \hfill & { \ge \frac{1}{2}\left[ {1 - \sqrt {1 - \partial F^N({\cal C}_1,{\cal C}_2)} } \right]} \hfill \\ {} \hfill & {} \hfill & { \ge \frac{1}{2}\left[ {1 - \left( {1 - \frac{{\partial F^N({\cal C}_1,{\cal C}_2)}}{2}} \right)} \right]} \hfill \\ {} \hfill & = \hfill & {\frac{{\partial F({\cal C}_1,{\cal C}_2)^N}}{4}.} \hfill \end{array}$$

(13)

The first line follows from Helstrom’s theorem³², and the second line follows from the Fuchs–Van De Graaf Inequality⁴⁰. The third line follows from the definition of the fidelity divergence (9), which implies that the fidelity between the states right after the (t + 1)-th use of the unknown channel ${\cal C}_x$, denoted by ρ_x,t+1, satisfies the bound

$$\begin{array}{*{20}{l}} {F(\rho _{1,t + 1},\rho _{2,t + 1})} \hfill & { \ge \partial F({\cal C}_1,{\cal C}_2)F({\cal U}_{t + 1}\rho _{1,t},{\cal U}_{t + 1}\rho _{2,t})} \hfill \\ {} \hfill & { \ge \partial F({\cal C}_1,{\cal C}_2)F(\rho _{1,t},\rho _{2,t}),} \hfill \end{array}$$

(14)

where ${\cal U}_{t + 1}$ is the (t + 1)-th operation in Fig. 1. The fourth line follows from the elementary inequality $\sqrt {1 - t} \le 1 - t/2$.

Another important property is that the fidelity divergence can be evaluated on pure states. The proof is simple: let ρ₁ and ρ₂ be two arbitrary states of the composite system AR, where R is an arbitrary reference system. By Uhlmann’s theorem⁴¹, there exists a third system E and two purifications $|\Psi _1\rangle ,|\Psi _2\rangle \in {\cal H}_A \otimes {\cal H}_R \otimes {\cal H}_E$, such that F(Ψ₁, Ψ₂) = F(ρ₁, ρ₂). On the other hand, the monotonicity of the fidelity under partial trace⁴², ensures that the fidelity between the output states $({\cal C}_1 \otimes {\cal I}_{RE})({\mathrm{\Psi }}_1)$ and $({\cal C}_2 \otimes {\cal I}_{RE})({\mathrm{\Psi }}_2)$ cannot be larger than the fidelity between the states $({\cal C}_1 \otimes {\cal I}_R)(\rho _1)$ and $({\cal C}_2 \otimes {\cal I}_R)(\rho _2)$. Hence, the minimization on the right-hand side of Eq. (9) can be restricted without loss of generality to pure states.

Fidelity divergence for the identification of the causal intermediary

Let us see how the fidelity divergence can be applied to our causal identification problem. The two channels are of the form ${\cal C}_{1,U}(\rho ) = U\rho U^\dagger \otimes I/d$ and ${\cal C}_{2,V} = I/d \otimes V\rho V^\dagger$, where U and V are two unknown unitary gates. Since we are interested in the worst case scenario, every choice of U and V will give an upper bound to the discrimination rate. In particular, we pick U = V.

Proposition 2 The fidelity divergence for the two channels ${\cal C}_{1,U}$ and ${\cal C}_{2,U}$ is $\partial F({\cal C}_{1,U},{\cal C}_{2,U}) = 1/d^2$.

By the unitary invariance of the fidelity, $\partial F({\cal C}_{1,U},{\cal C}_{2,U})$ is independent of U. Without loss of generality, let us pick U = I. For a generic reference system R and two generic pure states $|{\mathrm{\Psi }}_1,\rangle |{\mathrm{\Psi }}_2\rangle \in {\cal H}_A \otimes {\cal H}_R$, the two output states are

$$\begin{array}{*{20}{l}} {\rho _{1 }^{\prime}} \hfill & = \hfill & {({\cal C}_{1,I} \otimes {\cal I}_R)({\mathrm{\Psi }}_1) = ({\mathrm{\Psi }}_1)_{BR} \otimes \frac{{I_C}}{d}} \hfill \\ {\rho _{2 }^{\prime}} \hfill & = \hfill & {({\cal C}_{2,I} \otimes {\cal I}_R)({\mathrm{\Psi }}_2) = \frac{{I_B}}{d} \otimes ({\mathrm{\Psi }}_1)_{CR}{\kern 1pt} ,} \hfill \end{array}$$

(15)

up to reordering of the Hilbert spaces. The fidelity can be computed with the relation

$$F(\rho _{1 }^{\prime},\rho _{2 }^{\prime}) = \frac{{\left| {{\mathrm{Tr}}\left[ {\sqrt {({\mathrm{\Psi }}_1)_{BR}({\mathrm{\Psi }}_2)_{CR}({\mathrm{\Psi }}_1)_{BR}} } \, \right]} \right|^2}}{{d^2}}{\kern 1pt} ,$$

(16)

where we omitted the identity operators for the sake of brevity. Let us expand the input states as

$$\begin{array}{*{20}{l}} {\left| {{\mathrm{\Psi }}_x} \right\rangle = \mathop {\sum}\limits_n {\left| {\phi _{xn}} \right\rangle } \otimes \left| n \right\rangle ,\qquad x \in \{ 0,1\} } \hfill \end{array}$$

(17)

where {|n⟩} is an orthonormal basis for the reference system, and {|ϕ_xn⟩} is a set of unnormalized vectors. Inserting Eq. (17) into Eq. (16), we obtain the expression

$$F(\rho _1^\prime ,\rho _2^\prime ) = \frac{{\left| {{\mathrm{Tr}}\left[ {\sqrt {C^\dagger C} } \right]} \right|^2}}{{d^2}} = \frac{{|{\kern 1pt} Tr|C|{\kern 1pt} |^2}}{{d^2}}{\kern 1pt} ,$$

(18)

with $C = \mathop {\sum}\nolimits_n {\kern 1pt} |\phi _{1n}\rangle \langle \phi _{2n}|$. On the other hand, the fidelity between the input states is

$$F(\rho _1,\rho _2) = |\langle {\mathrm{\Psi }}_1|{\mathrm{\Psi }}_2\rangle |^2 = |{\mathrm{Tr}}[C]|^2.$$

(19)

Hence, the fidelity divergence satisfies the bound

$$\begin{array}{*{20}{l}} {\partial F({\cal C}_1,{\cal C}_2)} \hfill & = \hfill & {\mathop {{\inf }}\limits_R \mathop {{\inf }}\limits_{\rho _1,\rho _2} \frac{{F(\rho _1^\prime ,\rho _2^\prime )}}{{F(\rho _1,\rho _2)}}} \hfill \\ {} \hfill & = \hfill & {\frac{1}{{d^2}}\mathop {{\inf }}\limits_C \left| {\frac{{{\mathrm{Tr}}|C|}}{{{\mathrm{Tr}}[C]}}} \right|^2} \hfill \\ {} \hfill & {} \hfill & { \ge \frac{1}{{d^2}},} \hfill \end{array}$$

(20)

having used the inequality |Tr[C]| ≤ Tr|C|, valid for every operator C. The inequality holds with the equality sign whenever C is positive. This condition is satisfied, e.g., when the input states |Ψ₁〉 and |Ψ₂〉 are identical.

Quantum strategies with indefinite causal order

In principle, quantum mechanics is compatible with situations where multiple processes are combined in indefinite order^23,24. This suggests that an experimenter could devise new ways to probe quantum channels, allowing the relative order among different uses of the same channel to be indefinite. We call such strategies indefinite testers.

Consider the problem of identifying a channel ${\cal C}_x$ from N uses. The input resource is the channel ${\cal C}_x^{ \otimes N}$, representing N identical black boxes that can be arranged in any desired order. Besides the product of N independent channels, the most general class of channels with this property is the class of no-signaling channels with N pairs of input/output systems.

Mathematically, an indefinite tester is a linear map from the set of no-signaling channels to the set of probability distributions over a given set of outcomes. Equivalently, the tester can be described by a set of operators {T_x}, where each operator T_x acts on the Hilbert space $\otimes _i({\cal H}_i^{{\mathrm{in}}} \otimes {\cal H}_i^{{\mathrm{out}}})$, where ${\cal H}_i^{{\mathrm{in}}}$ and ${\cal H}_i^{{\mathrm{out}}}$ are the Hilbert spaces of the input and output system in the i-th pair, respectively. When the test is performed on a no-signaling channel ${\cal C}$, the probability of the outcome x is given by the generalized Born rule p_x = Tr[T_xC], where C is the Choi operator of the channel ${\cal C}$⁴³. The normalization of the probabilities

$$\mathop {\sum}\limits_x {{\mathrm{Tr}}[T_x\,C]} = 1$$

(21)

is required to hold for every no-signaling channel ${\cal C}$.

Consider the problem of distinguishing between a set of no-signaling channels $\{ {\cal C}_x\}$ using an indefinite tester. For every probability distribution {π_x}, the worst-case probability of error satisfies the bound

$$p_{{\mathrm{err}}}^{{\mathrm{ind}}} \ge 1 - \mathop {\sum}\limits_x \pi _x\,{\mathrm{Tr}}[T_xC_x]{\kern 1pt} .$$

(22)

Now, suppose that there exists a constant λ and a no-signaling channel ${\cal C}$ such that

$$\lambda {\kern 1pt} C \ge \pi _xC_x$$

(23)

for every x. Substituting Eq. (23) into Eq. (22) one obtains the bound

$$p_{{\mathrm{err}}}^{{\mathrm{ind}}} \ge 1 - \lambda \mathop {\sum}\limits_x {{\mathrm{Tr}}[T_xC]} = 1 - \lambda {\kern 1pt} ,$$

(24)

having used the normalization condition (21). The bound (24) can be seen as a generalization of the classical Yuen–Kennedy–Lax bound for quantum state discrimination⁴⁴.

We now apply the bound (24) to the task of distinguishing between the two channels ${\cal C}_{1,I} = ({\cal U} \otimes I/d)^{ \otimes N}$ and ${\cal C}_{2,I} = (I/d \otimes {\cal U})^{ \otimes N}$. To this purpose, we consider the universal cloning channel⁴⁵

$${\cal C}_ \pm : = \frac{2}{{d^N + 1}}P_ + (\rho \otimes I^{ \otimes N})P_ + ,$$

(25)

and the universal NOT channel⁴⁶

$${\cal C}_ \pm : = \frac{2}{{d^N - 1}}P_ - (\rho \otimes I^{ \otimes N})P_ - ,$$

(26)

with P_± = (I ± SWAP)/2, and SWAP being the unitary operator that swaps between the even and odd output spaces. It is easy to verify that both channels are no-signaling. Moreover, we find that the convex combination ${\cal C} = p_ + {\cal C}_ + + p_ - {\cal C}_ -$ with $p_ \pm = \sqrt {\frac{{d^N \pm 1}}{{2d^N}}} /\left( {\sqrt {\frac{{d^N + 1}}{{2d^N}}} + \sqrt {\frac{{d^N - 1}}{{2d^N}}} } \right)$ satisfies the condition (23) with $\lambda = \frac{1}{2}\left( {\sqrt {\frac{{d^N + 1}}{{2d^N}}} + \sqrt {\frac{{d^N - 1}}{{2d^N}}} } \right)^2$ (see Supplementary Note 8 for technical details). Hence, the bound (24) becomes

$$p_{{\mathrm{err}}}^{{\mathrm{ind}}} \ge 1 - \lambda = \frac{{1 - \sqrt {1 - \frac{1}{{d^{2N}}}} }}{2} \ge \frac{1}{{4d^{2N}}}{\kern 1pt} .$$

(27)

The above bound implies that the discrimination rate of quantum strategies with indefinite order cannot exceed 2 log d.

Data availability

The authors declare that the data supporting the findings of this study are available within the paper and in the Supplementary Information files.

References

Spirtes, P., Glymour, C. N. & Scheines, R. Causation, Prediction, and Search (MIT Press, Cambridge, Massachusetts, United States 2000).
Pearl, J. Causality (Cambridge University Press, Cambridge, United Kingdom 2009).
Pearl, J. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference (Morgan Kaufmann, Burlington, Massachusetts, United States 2014).
Chaves, R. et al. Quantum violation of an instrumental test. Nat. Phys. 14, 291–296 (2018).
Article ADS CAS Google Scholar
Van Himbeeck, T. et al. Quantum violations in the instrumental scenario and their relations to the Bell scenario. Preprint at: https://arxiv.org/abs/1804.04119 (2018).
Leifer, M. S. Quantum dynamics as an analog of conditional probability. Phys. Rev. A 74, 042310 (2006).
Article ADS Google Scholar
Chiribella, G., D’Ariano, G. M. & Perinotti, P. Theoretical framework for quantum networks. Phys. Rev. A 80, 022339 (2009).
Article ADS MathSciNet Google Scholar
Coecke, B. & Spekkens, R. W. Picturing classical and quantum Bayesian inference. Synthese 186, 651–696 (2012).
Article MathSciNet Google Scholar
Leifer, M. S. & Spekkens, R. W. Towards a formulation of quantum theory as a causally neutral theory of Bayesian inference. Phys. Rev. A 88, 052130 (2013).
Article ADS Google Scholar
Henson, J., Lal, R. & Pusey, M. F. Theory-independent limits on correlations from generalized Bayesian networks. New J. Phys. 16, 113043 (2014).
Article ADS Google Scholar
Pienaar, J. & Brukner, Č. A graph-separation theorem for quantum causal models. New J. Phys. 17, 073020 (2015).
Article ADS Google Scholar
Costa, F. & Shrapnel, S. Quantum causal modelling. New J. Phys. 18, 063032 (2016).
Article ADS Google Scholar
Portmann, C., Matt, C., Maurer, U., Renner, R. & Tackmann, B. Causal boxes: quantum information-processing systems closed under composition. IEEE Trans. Inf. Theory 63, 3277–3305 (2017).
MathSciNet MATH Google Scholar
Allen, J.-M. A., Barrett, J., Horsman, D. C., Lee, C. M. & Spekkens, R. W. Quantum common causes and quantum causal models. Phys. Rev. X 7, 031021 (2017).
Google Scholar
MacLean, J.-P. W., Ried, K., Spekkens, R. W. & Resch, K. J. Quantum-coherent mixtures of causal relations. Nat. Commun. 8, 15149 (2017).
Article ADS CAS Google Scholar
Wood, C. J. & Spekkens, R. W. The lesson of causal discovery algorithms for quantum correlations: causal explanations of Bell-inequality violations require fine-tuning. New J. Phys. 17, 033002 (2015).
Article ADS Google Scholar
Fitzsimons, J. F., Jones, J. A. & Vedral, V. Quantum correlations which imply causation. Sci. Rep. 5, 18281 (2015).
Article ADS CAS Google Scholar
Ried, K. et al. A quantum advantage for inferring causal structure. Nat. Phys. 11, 414–420 (2015).
Article CAS Google Scholar
Chaves, R., Majenz, C. & Gross, D. Information–theoretic implications of quantum causal structures. Nat. Commun. 6, 5766 (2015).
Article ADS Google Scholar
Giarmatzi, C. & Costa, F. A quantum causal discovery algorithm. npj Quantum Inf. 4, 17 (2018).
Article ADS Google Scholar
Schuld, M., Sinayskiy, I. & Petruccione, F. An introduction to quantum machine learning. Contemp. Phys. 56, 172–185 (2015).
Article ADS Google Scholar
Biamonte, J. et al. Quantum machine learning. Nature 549, 195 (2017).
Article ADS CAS Google Scholar
Chiribella, G., D’Ariano, G. M., Perinotti, P. & Valiron, B. Quantum computations without definite causal structure. Phys. Rev. A 88, 022318 (2013).
Article ADS Google Scholar
Oreshkov, O., Costa, F. & Brukner, Č. Quantum correlations with no causal order. Nat. Commun. 3, 1092 (2012).
Article ADS Google Scholar
Hardy, L. Quantum theory from five reasonable axioms. Preprint at: https://arxiv.org/abs/quant-ph/0101012 (2001).
Barnum, H., Barrett, J., Leifer, M. & Wilce, A. Generalized no-broadcasting theorem. Phys. Rev. Lett. 99, 240501 (2007).
Article ADS Google Scholar
Barrett, J. Information processing in generalized probabilistic theories. Phys. Rev. A 75, 032304 (2007).
Article ADS Google Scholar
Chiribella, G., D’Ariano, G. & Perinotti, P. Probabilistic theories with purification. Phys. Rev. A 81, 062348 (2010).
Article ADS Google Scholar
Hardy, L. Foliable operational structures for general probabilistic theories. In Deep Beauty: Understanding the Quantum World through Mathematical Innovation (ed. Halvorson, H.) 409–442 (Cambridge University Press, Cambridge, United Kingdom 2011).
Chiribella, G. & Spekkens, R. W. Quantum Theory: Informational Foundations and Foils (Springer, Dordrecht, The Netherlands 2016).
Hayashi, M. Discrimination of two channels by adaptive methods and its application to quantum system. IEEE Trans. Inf. Theory 55, 3807–3820 (2009).
Article MathSciNet Google Scholar
Helstrom, C. W. Quantum detection and estimation theory. J. Stat. Phys. 1, 231–252 (1969).
Article ADS MathSciNet Google Scholar
Bartlett, S. D., Rudolph, T. & Spekkens, R. W. Reference frames, superselection rules, and quantum information. Rev. Mod. Phys. 79, 555–609 (2007).
Article ADS MathSciNet CAS Google Scholar
Kimble, H. J. The quantum internet. Nature 453, 1023–1030 (2008).
Article ADS CAS Google Scholar
Cormen, T. H., Leiserson, C. E., Rivest, R. L. & Stein, C. Introduction to Algorithms (MIT Press, Cambridge, Massachusetts, United States 2009).
Mozrzymas, M., Studziński, M., Strelchuk, S. & Horodecki, M. Optimal port-based teleportation. New J. Phys. 20, 053006 (2018).
Article ADS Google Scholar
Massar, S., Pironio, S. & Pitalúa-Garca, D. Hyperdense coding and superadditivity of classical capacities in hypersphere theories. New J. Phys. 17, 113002 (2015).
Article ADS Google Scholar
Życzkowski, K. Quartic quantum theory: an extension of the standard quantum mechanics. J. Phys. A 41, 355302 (2008).
Article MathSciNet Google Scholar
Barnum, H., Graydon, M. A. & Wilce, A. Some nearly quantum theories. Preprint at: https://arxiv.org/abs/1507.06278 (2015).
Fuchs, C. A. & Van De Graaf, J. Cryptographic distinguishability measures for quantum-mechanical states. IEEE Trans. Inf. Theory 45, 1216–1227 (1999).
Article MathSciNet Google Scholar
Uhlmann, A. The transition probability in the state space of a*-algebra. Rep. Math. Phys. 9, 273–279 (1976).
Article ADS MathSciNet Google Scholar
Wilde, M. M. Quantum Information Theory (Cambridge University Press, 2013).
Choi, M.-D. Completely positive linear maps on complex matrices. Linear Algebra Appl. 10, 285–290 (1975).
Article MathSciNet Google Scholar
Yuen, H., Kennedy, R. & Lax, M. Optimum testing of multiple hypotheses in quantum detection theory. IEEE Trans. Inf. Theory 21, 125–134 (1975).
Article MathSciNet Google Scholar
Werner, R. F. Optimal cloning of pure states. Phys. Rev. A 58, 1827–1832 (1998).
Article ADS CAS Google Scholar
Bužek, V., Hillery, M. & Werner, R. Optimal manipulations with qubits: universal-not gate. Phys. Rev. A 60, R2626–R2629 (1999).
Article ADS MathSciNet Google Scholar

Download references

Acknowledgements

The authors acknowledge Robert Spekkens, David Schmidt, Lucien Hardy, Sergii Strelchuk, Akihito Soeda, and Thomas Gonda for stimulating discussions. This work is supported by the National Natural Science Foundation of China through Grant 11675136, the Croucher Foundation, John Templeton Foundation, Project 60609, Quantum Causal Structures, the Canadian Institute for Advanced Research (CIFAR), the Hong Research Grant Council through Grants 17300317 and 17300918, and the Foundational Questions Institute through Grant FQXi-RFP3-1325. This publication was made possible through the support of a grant from the John Templeton Foundation. The opinions expressed in this publication are those of the authors and do not necessarily reflect the views of the John Templeton Foundation. This research was supported in part by Perimeter Institute for Theoretical Physics. Research at Perimeter Institute is supported by the Government of Canada through the Department of Innovation, Science and Economic Development Canada and by the Province of Ontario through the Ministry of Research, Innovation and Science.

Author information

Authors and Affiliations

Department of Computer Science, The University of Hong Kong, Pokfulam Road, Hong Kong
Giulio Chiribella & Daniel Ebler
Department of Computer Science, University of Oxford, Oxford, OX1 3QD, UK
Giulio Chiribella
Perimeter Institute for Theoretical Physics, Waterloo, ON, N2L 2Y5, Canada
Giulio Chiribella
Department of Physics, Institute for Quantum Science and Engineering, Southern University of Science and Technology, Shenzhen, 518055, China
Daniel Ebler

Authors

Giulio Chiribella
View author publications
Search author on:PubMed Google Scholar
Daniel Ebler
View author publications
Search author on:PubMed Google Scholar

Contributions

Both the authors contributed substantially to the research presented in this paper and to the preparation of the manuscript.

Corresponding author

Correspondence to Giulio Chiribella.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Journal peer review information: Nature Communications thanks Cyril Branciard, Jonatan Bohr Brask and the other anonymous reviewer for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Peer Review File (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chiribella, G., Ebler, D. Quantum speedup in the identification of cause–effect relations. Nat Commun 10, 1472 (2019). https://doi.org/10.1038/s41467-019-09383-8

Download citation

Received: 19 July 2018
Accepted: 08 March 2019
Published: 01 April 2019
Version of record: 01 April 2019
DOI: https://doi.org/10.1038/s41467-019-09383-8

This article is cited by

Quantum causal inference with extremely light touch
- Xiangjing Liu
- Yixian Qiu
- Vlatko Vedral
npj Quantum Information (2025)
Experimental aspects of indefinite causal order in quantum mechanics
- Lee A. Rozema
- Teodor Strömberg
- Philip Walther
Nature Reviews Physics (2024)
RSNET: inferring gene regulatory networks by a redundancy silencing and network enhancement technique
- Xiaohan Jiang
- Xiujun Zhang
BMC Bioinformatics (2022)
Quantum operations with indefinite time direction
- Giulio Chiribella
- Zixuan Liu
Communications Physics (2022)
Quantum causal unravelling
- Ge Bai
- Ya-Dong Wu
- Giulio Chiribella
npj Quantum Information (2022)