Introduction

In the early days of quantum theory, the question of whether there is deeper theory underlying quantum theory was considered “a philosophical question for which physical arguments alone are not decisive”1. Bell’s theorem2,3 made it possible to exclude experimentally some of these deeper theories, called hidden-variable (HV) theories4. Today, Bell tests5,6,7,8,9 have convinced us that some HV theories cannot explain what we see.

In a Bell test, a source of pairs of particles sends each particle to a different laboratory. In the first laboratory, an observer (Alice) chooses to measure x X and obtains a A. In the second laboratory, a different observer (Bob) chooses to measure y Y and obtains b B. After many repetitions, Alice and Bob compute the joint probability of (ab) given (xy), denoted p(abxy). The set {p(abxy)}xX,yY,aA,bB is called a correlation for the Bell scenario (X, A; Y, B), in which Alice can choose between X measurement settings with A possible outcomes and Bob between Y settings with B outcomes.

Bell’s theorem asserts that no HV model satisfying some assumptions can reproduce certain quantum correlations. These models are collectively called “local” HV models and are defined as those satisfying the following assumptions10,11:

(0) Hidden variables. There are hidden variables that associate to each pair of particles a state λ Λ and underlying probability densities p(abλxy) and p(λxy) so

$$p(a,b| x,y)=\int\,d\lambda \,p(\lambda | x,y)p(a,b| \lambda,x,y).$$
(1)

(1) Measurement independence (MI): For every pair of particles, the measurements (xy) are not correlated with λ. That is, p(xyλ) = p(xy), which, through Bayes’s theorem, is equivalent to

$$p(\lambda | x,y)=p(\lambda ).$$
(2)

Therefore, the knowledge of λ gives no information about (xy), and vice versa.

(2) Outcome independence (OI)11 also referred to as completeness10,12: p(aλxyb) is independent of b, and hence may be written

$$p(a| \lambda,x,y,b)=p(a| \lambda,x,y).$$
(3)

Similarly, p(bλxya) = p(bλxy).

(3) Parameter independence (PI)11, initially called locality10: p(aλxy) is independent of y, and hence may be written as

$$p(a| \lambda,x,y)=p(a| \lambda,x).$$
(4)

Similarly, p(bλxy) = p(bλy).

Assumptions (2) and (3) are independent10 and, together, imply that

$$p(a,b| \lambda,x,y)=p(a| \lambda,x)p(b| \lambda,y),$$
(5)

which is Bell’s original assumption2 which is now called local factorizability or local causality.

Assumption (0) is the expression of the belief in a deeper theory underlying quantum theory. MI is motivated by the assumption that each of the observers has freedom of choice13 or, more generally, by the assumption that which specific measurements are actually performed is not governed by the HVs that govern the particles. OI is based on the assumption that, as it happens in deterministic models [i.e., when p(abλxy) {0, 1}], if we would know λ, we would observe that p(abλxy) = p(aλxy)p(bλxy)14. PI is grounded on the assumption that superluminal signalling between one party’s choice and the other party’s spacelike separated outcome is impossible10.

Existing experiments are inconclusive about which assumptions fail. As a consequence, possible explanations include HV theories with different degrees of “measurement dependence”12,14,15,16,17,18,19,20,21 (that may occur due to limitations to freedom of choice22,23 or to retrocausal influences24,25), different amounts of instantaneous “actions at a distance”26,27,28,29,30,31,32, and combinations thereof33. At least one of the four assumptions is false. But which one or which ones?11[pp. 124, 149, 96],34[p. 66]. The prevalent view is that advancing in the resolution of this problem is not possible “on purely physical grounds but it requires an act of metaphysical judgement”35. Here, we challenge this view and present two results. Result 1 shows that there are quantum correlations that cannot be simulated with any HV theory assuming OI but partial (as opposed to complete) measurement dependence (MD) or partial parameter dependence (PD). Result 2 shows that there are quantum correlations that cannot be simulated with any HV theory assuming MI, PI but partial outcome dependence (OD).

In Sec. II A, we introduce the standard ways to quantify MI, PI and OI. Result 1 is presented in Sec.II B, where we also describe an experiment to exclude HV theories with partial MD and PD. Result 2 is presented in Sec. II C, which includes the description of an experiment to exclude HV theories with partial OD. The consequences and applications of the results are discussed in Sec. III.

Results

Relaxing the assumptions

Quantifying measurement dependence

To quantify any lack of MI, and therefore to quantify MD, we have to take into account the distribution of x and y and, therefore, we have to consider the full distribution p(abxy) rather than only p(abxy). The full distribution p(abxy) can be reproduced with an l-measurement dependent (l-MD) HV model36 if it can be reproduced with an HV model such that, for all x X,  y Y, and for all λ,

$$p(x,y| \lambda )\ge l\ge 0.$$
(6)

If there are only two inputs per party, a value l = 1/4 implies that p(xyλ) must be uniform. Therefore, in this case, p(abxy) can be reproduced with an HV model with MI in which there is no correlation between the hidden variables of the particles and the measurement settings. However, this is not the case for 0≤l < 1/4. We say that p(abxy) can be reproduced with an HV model with partial MD if there is an (l-MD) model for some l > 0. We say that p(abxy) can only be reproduced with an HV model with complete MD if p(abxy) cannot be reproduced with any (l-MD) model for any l > 0; the only possible HV models have p(xyλ) = 0 for some pair of settings (xy) and some λ. If there are only two inputs per party, any p(abxy) corresponding to any non-signaling correlation can be reproduced with an HV model with complete MD. The complete relaxation of MI using alternative ways of quantifying MD14,18 matches the above definition of complete MD (see also Supplementary Note1).

Quantifying parameter dependence

A correlation p(abxy) can be reproduced with an (εAεB)-parameter dependent [(εAεB)-PD] HV model14 if it can be reproduced with an HV model such that, for all x, yy′ (y, x, x′), and for all λ,

$$\frac{1}{2}\mathop{\sum }\limits_{a}| p\left(a| \lambda,x,y\right)-p\left(a| \lambda,x,{y}^{{\prime} }\right)| \le {\varepsilon }_{A} \,$$
(7a)
$$\frac{1}{2}\mathop{\sum }\limits_{b}| p\left(b| \lambda,x,y\right)-p\left(b| \lambda,x^{\prime},y\right)| \le {\varepsilon }_{B} \ .$$
(7b)

Therefore, p(abxy) can be reproduced with an HV model with PI if, and only if, it can be reproduced with an (εAεB)-PD HV model with εA = εB = 0. If not, we say that p(abxy) can be reproduced with an HV model with partial PD if it can be reproduced with an (εAεB)-PD HV model for some 0 < εAεB < 1. Finally, p(abxy) can only be reproduced with HV models with complete PD if p(abxy) cannot be reproduced with any (εAεB)-PD HV model for any εAεB < 1. Any non-signaling correlation can be reproduced with HV models with complete PD.

Quantifying outcome dependence

A correlation p(abxy) can be reproduced with a δ-outcome dependent (δ-OD) HV model14 if it can be reproduced with an HV model such that, for all x, y, a, a′, and for any λ,

$$\frac{1}{2}\mathop{\sum }\limits_{b}\left\vert \right.p(b| \lambda,x,y,a)-p(b| \lambda,x,y,a^{\prime} )\left\vert \right.\le \delta .$$
(8)

Therefore, p(abxy) can be reproduced with an HV model with OI if, and only if, it can be reproduced with a δ-OD HV model with δ = 0. We say that p(abxy) can be reproduced with an HV model with partial OD if it can be reproduced with a δ-OD HV model with 0 < δ < 1. We say that p(abxy) can only be reproduced with an HV model with complete OD if it cannot be reproduced with any δ-OD HV model for any δ < 1. Any non-signaling correlation can be reproduced with HV models with complete OD.

Result 1: Quantum correlations that cannot be simulated if there is arbitrarily small MI or PI

Consider the bipartite Bell experiment in which Alice and Bob have two measurement options xy {0, 1}, each of them with 2N possible results which can be expressed as a string of N bits, ab {(0, 0, …, 0), (0, 0, …, 1), …, (1, 1, …, 1)}. Suppose that Alice and Bob share the following 2N × 2N-dimensional entangled state:

$$\left\vert \psi \right\rangle={\left\vert \phi \right\rangle }_{A_1,B_1}\otimes \cdots \otimes {\left\vert \phi \right\rangle }_{A_N,B_N} \,$$
(9)

where

$$\left\vert \phi \right\rangle=a(\left\vert 01\right\rangle+\left\vert 10\right\rangle )+\sqrt{1-2{a}^{2}}\left\vert 11\right\rangle,$$
(10)

with \(a=\frac{\sqrt{5}-1}{2}\), is a two-qubit state with the first qubit in Alice’s side and the second qubit in Bob’s side. Suppose that Alice’s and Bob’s measurements are of the form

$${A}_{{a}_{1},\ldots,{a}_{N}| x}={A}_{{a}_{1}| x}\otimes \cdots \otimes {A}_{{a}_{N}| x} \,$$
(11a)
$${B}_{{b}_{1},\ldots,{b}_{N}| y}={B}_{{b}_{1}| y}\otimes \cdots \otimes {B}_{{b}_{N}| y} \,$$
(11b)

where, here, the tensor product refers to the qubits in each observer’s system and the specific form of the factors is given by

$${A}_{1| x}={\mathbb{1}}-{A}_{0| x} \,$$
(12a)
$${B}_{1| y}={\mathbb{1}}-{B}_{0| y} \,$$
(12b)

where

$${A}_{0| 0}={B}_{0| 0}=\left\vert 0\right\rangle \left\langle 0\right\vert,$$
(13a)
$${A}_{0| 1}={B}_{0| 1}=\left\vert \varphi \right\rangle \left\langle \varphi \right\vert,$$
(13b)

with \(\left\vert \varphi \right\rangle=\frac{1}{\sqrt{1-{a}^{2}}}(\sqrt{1-2{a}^{2}}\left\vert 0\right\rangle -a\left\vert 1\right\rangle )\). That is, each of the 2N-outcome measurements can be seen as N (nonindependent) two-outcome measurements performed simultaneously on a 2N-dimensional quantum system. These state and measurements produce a correlation with the following properties:

$$p(0,1,{a}_{2},{b}_{2},\ldots,{a}_{N},{b}_{N}| 0,1)=\ldots=p({a}_{1},{b}_{1},\ldots,{a}_{N-1},{b}_{N-1},0,1| 0,1)=0,$$
(14a)
$$p(1,0,{a}_{2},{b}_{2},\ldots,{a}_{N},{b}_{N}| 1,0)=\ldots=p({a}_{1},{b}_{1},\ldots,{a}_{N-1},{b}_{N-1},1,0| 1,0)=0,$$
(14b)
$$p(0,0,{a}_{2},{b}_{2},\ldots,{a}_{N},{b}_{N}| 1,1)=\ldots=p({a}_{1},{b}_{1},\ldots,{a}_{N-1},{b}_{N-1},0,0| 1,1)=0,$$
(14c)

for all a1, …, aNb1, …, bN {0, 1}. Eq. (14a) indicates that, if the measurements are x = 0 for Alice and y = 1 for Bob, then, in the N-bit strings that Alice and Bob obtain as outputs cannot be one position where Alice has 0 and Bob has 1. Similarly, for Eqs. (14b) and (14c). These state and measurements are the ones needed for the parallelised version37 of the optimal version of the proof of Bell non-locality proposed by Hardy38.

Let us define

$${p}_{H}^{N}{:}=\sum _{\begin{array}{c} {a}_{1},\ldots,{a}_{N},{b}_{1},\ldots,{b}_{N}\hfill\\ \left({a}_{1},{b}_{1}\right)=(0,0)\vee \ldots \vee \left({a}_{N},{b}_{N}\right)=(0,0)\end{array}}p\left({a}_{1},{b}_{1},\ldots,{a}_{N},{b}_{N}| 0,0\right),$$
(15)

where is the logical OR.

Result 1 can be stated as follows: In any l-MD and (εAεB)-PD HV model satisfying OI and Eqs. (14a), (14b) and (14c), for all l > 0 and all N,

$${p}_{H}^{N}\le {\varepsilon }_{A}+{\varepsilon }_{B}-{\varepsilon }_{A}{\varepsilon }_{B}.$$
(16)

The proof is in the Supplementary Note 2. Therefore, if εA < 1 and εB < 1, then \({p}_{H}^{N}\, < \,1\). In contrast, in quantum theory37, as N tends to infinity,

$${p}_{H}^{N}\mathop{\longrightarrow }\limits^{N\to \infty }1.$$
(17)

Consequently, for any l-MD and (εAεB)-PD HV model with l > 0, εA < 1, εB < 1, satisfying OI, there is N such that quantum theory predicts a value for \({p}_{H}^{N}\) that cannot be simulated.

For example, Table 1 gives the values of ε = εA = εB that cannot be simulated if nature achieves the quantum value for \({p}_{H}^{N}\). Notice that the number of excluded HV models grows with N. As N tends to infinity, the only surviving HV models are those with ε = 1.

Table 1 Relaxation of PI as a function of the number of parallel copies

The correlations defined by Eqs. (10)–(13) are special: they define an extremal non-exposed point of the quantum set of correlations for the two-observer two-setting two-outcome Bell scenario39. Any other correlation with this property can also be used in the experiment. The full characterisation of these points is in40.

A natural question is what conditions a correlation must satisfy to allow for arbitrarily small MI and PI, and whether there are quantum correlations in Bell scenarios with finite number of inputs that allow for such relaxation. In the Supplementary Note 3, we show a necessary condition - the quantum correlation must necessarily lie on or be arbitrarily close to the nonsignaling boundary. We also illustrate by an explicit example that this condition is not sufficient. We leave as an open question whether there is a finite input-output quantum correlation that proves Result 1.

Experimental test to exclude HV theories with partial MD and PD

So far, we have identified a quantum correlation that cannot be simulated by any l-MD and (εAεB)-PD HV model with l > 0, εA < 1, εB < 1, satisfying OI. This correlation is a point in the set of quantum correlations. The problem is that, due to experimental errors, an actual experiment will fail to exactly produce this point. Here, we reformulate Result 1 in a way that the existence of correlations that cannot be simulated by l-MD and (εAεB)-PD HV models with l > 0, εA < 1, εB < 1, and satisfying OI, can be experimentally tested.

It can be proven (see Supplementary Note 4) that, for any l-MD and (εAεB)-PD HV model with l > 0, εA < 1, εB < 1, satisfying OI, the following Bell-like inequality holds:

$${I}_{\kappa }^{N}({p})\le {\tilde{\varepsilon }}_{A}+{\tilde{\varepsilon }}_{B}-{\tilde{\varepsilon }}_{A}{\tilde{\varepsilon }}_{B},$$
(18)

where

$${I}_{\kappa }^{N}({p}_{H}^{N}) \, {:\!\!=} \sum _{\begin{array}{c}{a}_{1},\ldots,{a}_{N},{b}_{1},\ldots,{b}_{N}\\ \left({a}_{1},{b}_{1}\right)=(0,0)\vee \ldots \vee \left({a}_{N},{b}_{N}\right)=(0,0)\end{array}}{p}\left(\left({a}_{1},{b}_{1}\right),\ldots,\left({a}_{N},{b}_{N}\right)| 0,0\right) \\ - \, \kappa \sum _{\begin{array}{c}{a}_{1},\ldots,{a}_{N},{b}_{1},\ldots,{b}_{N}\\ \left({a}_{1},{b}_{1}\right)=(0,1)\vee \ldots \vee \left({a}_{N},{b}_{N}\right)=( 0,1)\end{array}}{p}\left(\left({a}_{1},{b}_{1}\right),\ldots,\left({a}_{N},{b}_{N}\right)| 0,1\right) \\ - \, \kappa \sum _{\begin{array}{c}{a}_{1},\ldots,{a}_{N},{b}_{1},\ldots,{b}_{N}\\ \left({a}_{1},{b}_{1}\right)=(1,0)\vee \ldots \vee \left({a}_{N},{b}_{N}\right)=(1,0)\end{array}}{p}\left(\left({a}_{1},{b}_{1}\right),\ldots,\left({a}_{N},{b}_{N}\right)| 1,0\right) \\ - \, \kappa \sum _{\begin{array}{c}{a}_{1},\ldots,{a}_{N},{b}_{1},\ldots,{b}_{N}\\ \left({a}_{1},{b}_{1}\right)=(0,0)\vee \ldots \vee \left({a}_{N},{b}_{N}\right)=(0,0)\end{array}}{p}\left(\left({a}_{1},{b}_{1}\right),\ldots,\left({a}_{N},{b}_{N}\right)| 1,1\right),$$
(19)

with

$$\kappa \, > \, \frac{{N}^{2}}{l{(1-\varepsilon )}^{2}},$$
(20)

where \(\varepsilon=\max \{{\varepsilon }_{A},{\varepsilon }_{B}\}\), and

$${\tilde{\varepsilon }}_{A}={\varepsilon }_{A}+N\sqrt{\frac{2}{l{\kappa}}} \,$$
(21a)
$${\tilde{\varepsilon }}_{B}={\varepsilon }_{B}+N\sqrt{\frac{2}{l\kappa }} \ .$$
(21b)

This means that, for any l-MD and (εAεB)-PD HV model with l > 0, εA < 1, εB < 1, satisfying OI, for sufficiently large κ, the quantity \({I}_{\kappa }^{N}({p})\) is upper bounded by a value that is always smaller than 1. Furthermore, for fixed N, this bound approaches the bound for (16) when we take large values of κ and is therefore violated by the quantum state and measurements described earlier.

Result 2: Quantum correlations which cannot be simulated if there is arbitrarily small OI

Consider the bipartite Bell experiment in which Alice and Bob have M + 1 measurement options xy {0, 1, …, M}, each of them with 2 possible results, ab {0, 1}. Suppose that Alice and Bob share the following two-qubit entangled state:

$$\left\vert \phi \right\rangle=\frac{1}{\sqrt{1+{t}^{2}}}(t\left\vert 00\right\rangle -\left\vert 11\right\rangle ),$$
(22)

where t [0, 1] is the value that maximises

$$\mathop{\max }\limits_{0\le t\le 1}\frac{{t}^{2}}{1+{t}^{2}}{\left(\frac{1-{t}^{2M}}{1+{t}^{2M+1}}\right)}^{2}.$$
(23)

Alice’s and Bob’s measurements are of the form \({A}_{a| x}=| {\pi }_{a| x}\left.\right\rangle \left\langle \right.{\pi }_{a| x}|\) and \({B}_{b| y}=| {\sigma }_{b| y}\left.\right\rangle \left\langle \right.{\sigma }_{b| y}|\), with

$$| {\pi }_{0| x}\left.\right\rangle =\cos {a}_{x}| 0\left\rangle \right.+\sin {a}_{x}| 1\left.\right\rangle,\forall x\in \{0,\ldots,M\},\\ | {\pi }_{1| x}\left.\right\rangle =-\sin {a}_{x}| 0\left\rangle \right.+\cos {a}_{x}| 1\left.\right\rangle,\forall x\in \{0,\ldots,M\},$$
(24)

and

$$| {\sigma }_{0| y}\left.\right\rangle =\cos {b}_{y}| 0\left.\right\rangle+\sin {b}_{y}| 1\left.\right\rangle,\forall y\in \{0,\ldots,M\},\\ | {\sigma }_{1| y}\left.\right\rangle =-\sin {b}_{y}| 0\left\rangle \right.+\cos {b}_{y}| 1\left.\right\rangle,\forall y\in \{0,\ldots,M\},$$
(25)

with

$${a}_{k}={b}_{k}=\arctan \left[{(-1)}^{k}{t}^{k+1/2}\right]\forall k\in \{0,\ldots,M\}.$$
(26)

These state and measurements produce a correlation with the following properties:

$$p(0,0| 0,0)=0,$$
(27a)
$$p(0,1| k,k-1)=0\quad \forall k\in \{1,\ldots,M\},$$
(27b)
$$p(1,0| k-1,k)=0\quad \forall k\in \{1,\ldots,M\},$$
(27c)

and correspond to the optimal implementation of the “ladder” version of Hardy’s proof21,38,40.

Let us define

$${p}_{H}^{M}:=p(0,0| M,M).$$
(28)

Result 2 can be formulated as follows: In any δ-OD HV model that satisfies MI, PI and Eq. (27),

$${p}_{H}^{M}\le \frac{{\delta }^{q}}{2},$$
(29)

where \(q=\frac{M+1}{2}\). The proof is provided in the Supplementary Note 5. Therefore, if δ < 1, it follows that \({p}_{H}^{M} < \frac{1}{2}\). In contrast, in quantum theory21,40, as M approaches infinity,

$${p}_{H}^{M}\mathop{\longrightarrow }\limits^{M\to \infty }\frac{1}{2}.$$
(30)

Consequently, for any HV model with MI, PI and δ-OD, with δ < 1, there is M such that quantum theory predicts a value for \({p}_{H}^{M}\) that cannot be simulated. In addition, it can be proven (see Supplementary Note 6) that the set of correlations produced by HV with MI, PI and complete OD is the set of nonsignaling correlations.

The above proof is based on the assumption that Eq. (27) hold. In an actual experiment, instead of the zeros Eq. (27), we will obtain small values. Once we have them, we can derive an optimal Bell-like inequality that will allows us to discard any HV model with δ-OD for some δ < 1.

Discussion

The results presented have consequences both for foundations and applications in quantum information processing, communication and computation. For foundations, our results bring us closer to the solution of a problem proposed by Shimony11[pp. 96, 124, 149]34[p. 66] and which can be formulated as follows: “One of these three premises [MI, PI and OI] must be false and it is important to locate the false one”11[p. 96]. If we assume that only one is false (and that it is the same one for all non-local quantum correlations), then

  1. I.

    If the assumption that fails is MI, Result 1 shows that, MI has to fail completely because there are quantum correlations that can only be explained with complete MD.

  2. II.

    If the assumption that fails is PI, Result 1 shows that, PI has to fail completely because the same correlations used in [I] can only be explained with complete PD.

  3. III.

    If the assumption that fails is OI, Result 2 shows that OI has to fail completely because there are quantum correlations that can only be explained with complete OD.

Each of these solutions to Shimony’s problem requires extra causal influences which are not needed if HVs do not exist. These extra causal influences are shown with non-black colours in Fig. 1a–c, respectively. The causal influences if HVs do not exist are shown in Fig. 1d.

Fig. 1: Space-time diagrams of the causal influences needed, in every round of the Bell-like test, for each of the possible solutions to Shimony’s problem in light of our results.
figure 1

In all diagrams, black arrows represent causal influences common to all possibilities. a Complete measurement dependence. It can occur in, essentially, two ways. The first is with complete superdeterminism without retrocausality. In this case, Alice and Bob do not have freedom of choice to choose the measurement settings, X and Y, respectively. Instead, the settings are determined by the distribution of the HVs in the past light cones of X and Y, represented by the lower node Λ. The causal influences between the lower Λ and X and Y are represented by violet continuous arrows. The second way complete MD can occur is with complete retrocausality with freedom of choice. In this case, Alice and Bob have freedom to choose the “nominal” measurement settings X and Y, but the actual measurements are determined by the distribution of the HVs in the future light cones of X and Y, represented by the upper node Λ. The causal influences between the upper Λ and X and Y are represented by violet dashed arrows. b Complete parameter dependence. X is decided by Alice and Y is decided by Bob. However, X does not only influence Alice’s measurement outcome, represented by A, but also Bob’s measurement outcome, represented by B, which is outside the light cone of X. This superluminal influence (or “action at a distance'') is represented by a red arrow. Similarly, Y does not only influence B, but also A. c Complete outcome dependence. X is decided by Alice and Y is decided by Bob. However, A and B are causally connected despite they are space-like separated. These superluminal influences are represented by blue arrows. d No hidden variables. X is decided freely by Alice and Y is decided freely by Bob. A is causally connected only to the quantum state, represented by Ψ, and X. Similarly, B is causally connected only to Ψ and Y.

More generally, our results allow us to experimentally narrow down the possible explanations of Bell non-locality and the whole quantum theory, since they allow to experimentally excluding large subsets of HV models that are not excluded by previous experiments. Specifically, in principle, any l-MD, (εAεB)-PD HV model with l > 0, εA < 1, εB < 1, satisfying OI can be experimentally excluded. Similarly, any δ-OD HV model with δ < 1 and satisfying MI and PI can be, in principle, experimentally excluded. Still, the experiments cannot exclude HV models with complete MD22 or complete PD26,32 or complete OD.

Result 1 extends the observation in36 that there are quantum correlations that cannot be obtained from an l-MD HV model that satisfies OI and PI, for all values of l > 0. In36, the difference between quantum theory and the models with OI, PI and arbitrarily small MI is so small that any relaxation of PI makes the difference to vanish. In this respect, Result 1 makes testable the impossibility of HV models with partial MD and PD.

Result 2 is related to the observation in31 that there are quantum correlations that cannot be simulated with the assumptions of causal models (CM), MI, causal parameter independence (CPI) and the complete relaxation of causal outcome independence (COI). This observation is not made in the framework of the four assumptions of Bell’s theorem (HV, MI, PI, OI) but in the framework of causal models41. In general, neither HV and CM, nor PI and CPI, nor OI and COI, are equivalent.

In addition, Results 1 and 2 confirm that there is some interchangeability between MD and (PD+OD)42,43. However, our results go beyond that as they show that epsilon of each of MD and (PD+OD) is not enough: complete MD or complete PD or complete OD is needed.

One reason why it is important to exclude HV models with partial (but not complete) MD is that these models have been proposed to explain quantum correlations12,14,15,16,17,18,19,20,21. In addition, partial MD or, more precisely partial human’s free will, has been proposed in philosophy to resolve the conflict between the concept of an omniscient God and God’s commandment not to commit sin44.

One reason why it is important to exclude HV models with partial (but not complete) actions at a distance is that these models have been proposed to explain quantum correlations27,28,29,30,31,32. A second reason, which also applies to HV models with partial MD, is that excluding larger sets of HV models facilitates the discussion of the remaining models and, in particular, the discussion of the thermodynamics of the HV models45 that could not be discarded.

Our results are also of practical interest in quantum information processing, quantum communication and quantum computation. In the first place, for a general reason: the results show that quantum correlations do not only offer advantage with respect to local correlations, but also with respect to correlations assisted by partial instantaneous actions at a distance or even assuming the existence of partial constraints to freedom of choice or partial retrocausal influences. This can make a big difference in quantum computational advantage. For example, when mapping quantum non-local correlations into the circuit model, the advantage of quantum theory with respect to local HV theories translates into a non-oracular quantum advantage46,47. Our results show that there is also advantage with respect to non-local correlations with partial MD, PD and OD. This may translate into new forms of quantum computational advantage.

Another reason why our results are of practical interest is device-independent (DI) quantum information processing48,49. DI protocols for random number generation50, quantum key distribution48, state tomography51 and self-testing of quantum devices52 achieve advantage allowing users to monitor the performance of their devices irrespective of noise, imperfections, and lack of knowledge regarding the inner workings - the users simply treat their devices as black boxes with classical inputs and outputs. An obstacle for practical DI protocols is the experimentally challenging requirement of a Bell test with: (I) quantum devices being isolated from each other, (II) with the inputs being chosen with uniform randomness and (III) with the detection loophole53 closed. Experiments in different platforms54,55,56 allow for Bell tests with the detection loophole closed and high DI randomness generation rates. However, in these platforms, the quantum systems are very close, primarily to drive high entanglement generation rates via non-negligible coupling. The problem is that, precisely because the systems are close to one another, they can no longer be regarded as isolated in the sense needed for a Bell test. Sophisticated theoretical techniques have been devised to handle the issues of cross-talk and weak seeds separately. A Bell-like test allowing for arbitrary relaxation of MI and PI provides a simple and elegant solution to the problem of leakage of input information. A Bell-like test allowing for simultaneous relaxation of MI, PI and OI would allow DI randomness generation tolerating weak seeds and cross-talk. We hope that our results will stimulate research in these directions.