Abstract
A consistent theory of quantum gravity will require a fully quantum formulation of the classical equivalence principle. Such a formulation has been recently proposed in terms of the equality of the rest, inertial and gravitational mass operators, and for non-relativistic particles in a weak gravitational field. In this work, we propose a generalization to a fully relativistic formalism of the quantum equivalence principle, valid for all background space-times, as well as for massive bosons and fermions. The principle is trivially satisfied for massless particles. We show that if the equivalence principle is broken at the quantum level, it implies the modification of the standard Lorentz transformations in flat space-time and a corresponding modification of the metric in curved space-time by the different mass ratios. In other words, the observed geometry would effectively depend on the properties of the test particle. Testable predictions of potential violations of the quantum equivalence principle are proposed.
Similar content being viewed by others
Introduction
The Equivalence Principle is the cornerstone of classical general relativity (GR)1,2,3. However, gravity is expected to be fundamentally a quantum force, just as the other three forces in Nature, with various approaches to quantum gravity being proposed4,5,6, it follows that there must exist a Quantum Equivalence Principle (QEP), from which the standard Equivalence Principle follows in the classical limit. Such a QEP was proposed by Zych & Brukner7, which was applied to non-relativistic particles and weak gravitational fields. In this article, a necessary and sufficient generalization of the applicability of the QEP to relativistic bosons and fermions as well as to curved space-times is proposed. It is shown that if the QEP is violated, the Lorentz transformations in flat space-time and the corresponding metrics in curved space-times depend on the properties of the test particle itself and energies and momenta of non-interacting particles do not simply add up. In other words, if these problems are to be avoided, then the QEP should be exactly satisfied in Nature! Furthermore, not only should every theory of quantum gravity take into account this principle, but the latter may even provide useful guidance in the formulation of such theories.
The QEP is proposed in the form of an equality between the quantum operators representing the inertial, gravitational and rest masses of a given particle as7
which in turn can be written in terms of the ‘ground state mass’ mα and the internal interactions of the composite particle, described by the internal energy operator \({\hat{H}}_{int,\alpha }\) as7
where α = I, G, R signify the inertial, gravitational and rest masses of the test particle, and c the speed of light. Eq. (1) encompasses all three features of the Equivalence Principle. \({\hat{M}}_{{{{{{{{\rm{I}}}}}}}}}={\hat{M}}_{{{{{{{{\rm{G}}}}}}}}}\) represents the quantum version of the weak equivalence principle (WEP), \({\hat{H}}_{{{{{{{{\rm{int,R}}}}}}}}}={\hat{H}}_{{{{{{{{\rm{int,I}}}}}}}}}\) implies the quantum version of the local Lorentz invariance (LLI) and \({\hat{H}}_{{{{{{{{\rm{int,R}}}}}}}}}={\hat{H}}_{{{{{{{{\rm{int,G}}}}}}}}}\) implies the quantum version of the local position invariance (LPI). Note that the formalism of Zych & Brukner7, in which Eqs. (1) and (2) are applied, is non-relativistic (i.e. slow-speed particles) and considers a weak gravitational field that can be described by Newton’s gravity. In this article, it is shown, for the first time to the best of the authors’ knowledge, that Eqs. (1) and (2) can be applied in a generalized formalism, which can be used to test the QEP with massive bosons and fermions at relativistic speeds and in strong gravitational fields as well.
Results
Modifications of special relativity
First, as shown in Method Special Relativity, when the three masses are not equal, the standard dispersion relation of a free relativistic particle, namely E2 = p2c2 + m2c4, gets modified to the more general relation
which reduces to the dispersion relation, considered by Zych & Brukner7 in the non-relativistic limit and in the absence of a gravitational field. This results in the following modified Lorentz transformation for any four vector, for a boost v
which is a transformation between reference frames with different velocities, e.g., \({x}^{\mu }={\Lambda }_{\nu }^{\mu }({m}_{\alpha })\,{x}^{{\prime} \nu }\). The corrections in the i0 components appear as such, to preserve the norm VμVμ of an arbitrary 4-vector Vμ under boosts, and to be consistent with the modified Lorentz factor
as explained in Method Special Relativity. Now, it follows from Eq. (4) that
This means that even for two non-interacting particles, their masses do not just simply add up under the modified Lorentz transformations. This is known as the so-called soccer-ball problem8, and the above result shows that if this problem is to be absent in Nature, the LLI must necessarily be satisfied. Note however, that regardless of whether LLI holds or is broken, transformation (4) still forms a group. The generators of this group are still the standard Lorentz generators, since the modifications enter only through the boost parameters.
Modifications of general relativity
Next, one considers an arbitrary curved space-time, given by the (contravariant) metric gμν. For this, as shown in Method General Relativity, the metric gμν now contains the ratios between inertial, gravitational and rest masses, i.e., gμν = gμν(mI, mG, mR) ≡ gμν(mα). As expected, the standard form of the metric is recovered when mI = mG = mR. To arrive at a specific background space-time, one must solve the Einstein equations with the relevant stress-energy tensor and boundary conditions, and evaluate the integration constants, taking into account the masses mα. This is done by re-writing Newton’s second law of motion and Newton’s law of gravity for a freely falling particle with different inertial and gravitational masses as
where ϕ is the classical gravitational potential. A standard derivation of the metric from the Einstein equations follows, with the integration constants left arbitrary at first. To evaluate the integration constants, the weak field limit of the metric, where gμν = ημν + hμν and ∣hμν∣ ≪ 1 (here, ημν is the Minkowski flat space-time metric and hμν the small perturbation term), or equivalently, the weak field limit of the geodesic equation for that metric, is then compared to the above version of Newton’s law of gravity. One must also simultaneously consider the modification of the relativistic energy-momentum dispersion relation due to the different masses (see Eq. (28) in Method Special Relativity). By doing so, the resultant metric obtains modification factors in the form of ratios \({m}_{\alpha }/{m}_{{\alpha }^{{\prime} }}\), where \(\alpha ,{\alpha }^{{\prime} }={{{{{{{\rm{I}}}}}}}},\,{{{{{{{\rm{G}}}}}}}},\,{{{{{{{\rm{R}}}}}}}}\), as for example
for the 00 component of the weak gravitational field metric. For more general cases, where there is more than one integration constant, one must consider classical limits of other quantities, such as angular momentum of a rotating object in the case of the Kerr space-time (see Method Kerr Metric), to obtain the modified integration constants. This may have measurable consequences, as will be shown later. Note that the mass ratios do not change the differential equations of GR, since they modify only the integration constants.
Next, one considers the specific example of the Schwarzschild metric, which as one knows, is obtained by solving the vacuum Einstein equations under the assumptions of staticity, spherical symmetry and asymptotic flatness9. In the standard derivation of the Schwarzschild metric there is only one integration constant \(K=-\frac{2GM}{{c}^{2}}\), where G is the universal gravitational constant and M the mass associated with the static, spherically symmetric metric. This constant is obtained in the asymptotic (weak field) limit, by comparing ∣g00 − 1∣ to the classical Newtonian potential \(\phi =-\frac{GM}{r}\) (see Eq. (8)), where r is the distance from the central object. However, in the case where one distinguishes between the masses mα of a test particle, it turns out (see Method Weak Gravitational Field and the Schwarzschild Metric) that these masses modify the integration constant \(K=-\frac{{m}_{{{{{{{{\rm{G}}}}}}}}}}{{m}_{{{{{{{{\rm{R}}}}}}}}}}\frac{2GM}{{c}^{2}}\) and the metric reads as
where θ the polar angle. In the above, one can see that the mass ratio mG/mR modifies only the temporal and radial components of the metric (in the spherical coordinate system) and leaves the angular parts unchanged, while the inertial mass mI does not affect the components in this case. Therefore, if the ratio mG/mR is universal, i.e., the same irrespective of the composition of the test particles, then it can be set to unity and the above metric becomes independent of the test masses, just as in standard GR. However, if the ratio does depend on the composition, then our result shows that it affects the effective space-time geometry itself, which should have observable consequences. Such effects can be studied in the weak gravitational field10, or a general case of the Kerr space-time, where the modifications are obtained by following the procedure of Papapetrou11 (see Method Kerr Metric for details) and reads as
where \({\Sigma }^{2}({m}_{\alpha })={({r}^{2}+\frac{{m}_{{{{{{{{\rm{I}}}}}}}}}}{{m}_{{{{{{{{\rm{R}}}}}}}}}}{a}^{2})}^{2}\,\,-\frac{{m}_{{{{{{{{\rm{I}}}}}}}}}}{{m}_{{{{{{{{\rm{R}}}}}}}}}}{a}^{2}\Delta ({m}_{\alpha }){\sin }^{2}\theta\), \(\,\,\Delta ({m}_{\alpha })={r}^{2}-\frac{{m}_{{{{{{{{\rm{G}}}}}}}}}}{{m}_{{{{{{{{\rm{R}}}}}}}}}}\frac{2GM}{{c}^{2}}r+\frac{{m}_{{{{{{{{\rm{I}}}}}}}}}}{{m}_{{{{{{{{\rm{R}}}}}}}}}}{a}^{2}\) and \({\rho }^{2}({m}_{\alpha })={r}^{2}+\frac{{m}_{{{{{{{{\rm{I}}}}}}}}}}{{m}_{{{{{{{{\rm{R}}}}}}}}}}{a}^{2}{\cos }^{2}\theta\). From the above modified Kerr metric, one can see that all three masses mα modify the metric in all coordinate directions, i.e., the temporal, radial, polar and azimuthal directions. If the rotation of the central object vanishes, i.e., a ⟶ 0, the modified Schwarzschild solution from Eq. (9) is recovered. Note that for the cases studied here, the inertial mass mI enters the metric only in the rotating (non-static) case as seen from Eq. (10), where there is inertial frame dragging surrounding the central gravitating object. It turns out that mI appears in the metric of any arbitrary rotating object, since in the asymptotic limit of the corresponding gμν, one always obtains Eq. (64), where mI enters through Eq. (48).
Generalized formalism
Equipped with the above building blocks, one now proceeds to construct a formalism of the QEP, which is valid for bosons and fermions, travelling at arbitrary speeds and in arbitrary curved space-times. The corresponding wave equations reduce to the standard Klein-Gordon and Dirac equations in curved space-time in the limit when the QEP is satisfied.
First, one considers a bosonic particle, and writes its effective Hamiltonian in the Feshbach-Villars formalism12,13,14, suitably generalized to curved space-times and incorporating the three types of masses
where τk(k = 1, 2, 3) are the Pauli matrices. It can be easily shown that the above gives rise to the generalization of the Klein-Gordon equation in curved space-time15 incorporating the three masses (see Method Feshbach-Villars Formalism in Curved Space-time). Note that in the above, the gravitational mass enters through the metric gμν(mα). It can also be straightforwardly shown that the above modified Hamiltonian exactly corresponds to the modified Klein-Gordon Lagrangian
where \({\tilde{\partial }}_{\mu }=\left(\frac{1}{c}{\partial }_{0},\sqrt{\frac{{m}_{{{{{{{{\rm{R}}}}}}}}}}{{m}_{{{{{{{{\rm{I}}}}}}}}}}}{{{{{{{\boldsymbol{\partial }}}}}}}}\right)\). The above Lagrangian can be used to study the QEP with bosons in Quantum Field Theory in curved space-times. On variation, it gives rise to the same modified Klein-Gordon equation as obtained from Eq (11).
For fermions, on the other hand, one writes the standard Dirac Hamiltonian in curved space-time16, modified by the different masses as
where \({\underline{\gamma }}^{\mu }({m}_{\alpha })={e}_{a}^{\mu }({m}_{\alpha })\,{\gamma }^{a}\) with \({e}_{a}^{\mu }({m}_{\alpha })\) being the vierbeins (or tetrads), defined by \({g}^{\mu \nu }({m}_{\alpha })={e}_{a}^{\mu }({m}_{\alpha })\,{e}_{b}^{\nu }({m}_{\alpha })\,{\eta }^{ab}\), while γa and ηab are the flat space-time Dirac gamma matrices and Minkowski metric, respectively, of the tangent space, whose basis is denoted by Latin indices. The derivative ∇i(mα) from the momentum operator \({\hat{p}}_{i}({m}_{\alpha })=-i\hslash {\nabla }_{i}({m}_{\alpha })\) takes the form ∇μ(mα) = ∂μ − Γμ(mα), where
is the spinor affine connection. Note that \(\underline{{\gamma }^{\mu }}({m}_{\alpha })\) and Γμ(mα) are functions of mα, since they depend on gμν(mα). In the above \({e}_{\lambda ;\mu }^{b}({m}_{\alpha })={\partial }_{\mu }{e}_{\lambda }^{b}({m}_{\alpha })-{\Gamma }_{\mu \lambda }^{\rho }({m}_{\alpha })\,{e}_{\rho }^{b}({m}_{\alpha })\), where
are the modified Christoffel symbols.
It can be easily shown that Eq. (13) gives rise to the generalization of the Dirac equation in curved space-time incorporating the three masses. It can also be shown that the above modified Hamiltonian corresponds exactly to the modified Dirac Lagrangian
where \({\tilde{\nabla }}_{\mu }({m}_{\alpha })=\left(\frac{1}{c}{\nabla }_{0}({m}_{\alpha }),\sqrt{\frac{{m}_{{{{{{{{\rm{R}}}}}}}}}}{{m}_{{{{{{{{\rm{I}}}}}}}}}}}\,{{{{{{{\boldsymbol{\nabla }}}}}}}}({m}_{\alpha })\right)\). The above Lagrangian can be used to study the QEP with fermions in Quantum Field Theory in curved space-times, and gives rise to the same modified Dirac equation as obtained from Eq. (13). In this case, the gravitational mass enters not only through the metric gμν(mα), but also implicitly through \(\underline{{\gamma }^{\mu }}({m}_{\alpha })\) and Γμ(mα).
To describe the internal quantum effects of the test particle in the above generalized formalism, the masses mα are promoted to quantum operators (see Eq. (2)) in Eqs. (11) and (13), which can be used to study the QEP in any given scenario. Note that the operators \({\hat{M}}_{\alpha }\) describe the total mass-energy of a composite quantum system, with mα being the ground state, and \({\hat{H}}_{{{{{{{{\rm{int,\alpha }}}}}}}}}\) effectively describe the relativistic internal effects of the composite particle, related to its degrees of freedom and drives the non-trivial internal evolution7. This completes the generalization of the applicability of the QEP, introduced Zych & Brukner7, to relativistic bosons and fermions in arbitrary curved space-times.
Experimental bounds
The generalization of the applicability of the QEP proposed in this work may be tested in Earth-based experiments, such as interferometers and particle accelerators, and in strong gravitational fields, such as in the vicinity of stars and black holes. The most common experiments are Earth-based experiments, which corresponds to a non-relativistic limit and a weak gravitational field. Such experiments include the neutron interferometer experiment by Colella, Overhauser and Werner (COW)17 to test the WEP and other proposals7 to test the LLI and the LPI. The COW experiment can be used to measure the effective gravitational mass \(\langle {\hat{M}}_{{{{{{{{\rm{G}}}}}}}}}\rangle ={m}_{{{{{{{{\rm{G}}}}}}}}}+{E}_{{{{{{{{\rm{int,G}}}}}}}}}/{c}^{2}\) of the neutron, since the violation of the QEP modifies the phase shift to
where g is the gravitational acceleration, A = lh is the area, l the length and h the height of the interferometer, v is the velocity of the neutron in the lower branch and Eint,G = 〈Hint,G〉 is the deviation, obtained from the precision of measuring ΔΦ, which is ~ 10−318. The effective inertial mass of the neutron corresponds to the accepted value of the mass of the neutron, since it is measured kinematically. Comparing the above masses provides an upper bound of ≲ 10−3 for the deviation of mG/mI from unity. The above bound corresponds to the upper bound on the difference of eigenvalues \(\frac{1}{m{c}^{2}}({E}_{{{{{{{{\rm{int,G}}}}}}}}}-{E}_{{{{{{{{\rm{int,I}}}}}}}}})\), where one assumes m = mG = mI as the ground state value for both \({\hat{M}}_{{{{{{{{\rm{G}}}}}}}}}\) and \({\hat{M}}_{{{{{{{{\rm{I}}}}}}}}}\). Keeping the masses different would not change the order of magnitude of the correction. This provides a test for the WEP.
To test the QEP at relativistic speeds in the absence of a gravitational field, one can for example consider the mean lifetime τ of a particle in cosmic ray showers. It can easily be shown that the measured lifetime τ in a cosmic ray shower is given by
where τ0 is the mean lifetime of the particle in its rest frame and γ is defined in Eq. (26), containing the mass ratio mI/mR. Considering a charged pion decay, the mean lifetime in its rest frame is τ0 = 26.0231 ± 0.0050 ns19. It can be shown that this measurement provides a speed dependent upper bound of
for the deviation of mI/mR from unity. (One can write the relevant mass ratio as
where δ ≪ 1 is the deviation of mI/mR from unity. By expanding γ in Eq. (18) over δ and comparing the obtained deviation with the measurement uncertainty of τ
where \({\gamma }_{0}=1/\sqrt{1-\frac{{v}^{2}}{{c}^{2}}}\) and \({\sigma }_{{\tau }_{0}}\) the uncertainty of τ0, one obtains the deviation in Eq. (19)). One can see that the upper bound becomes smaller as the speed of the pion increases. Pions with the highest measurable energies travel with speeds close to the speed of light v = 0.9999996c20. For such pions the above upper bound is δ ≲ 3.33 × 10−10. The above bound corresponds to the upper bound on the difference of eigenvalues \(\frac{1}{m{c}^{2}}({E}_{{{{{{{{\rm{int,I}}}}}}}}}-{E}_{{{{{{{{\rm{int,R}}}}}}}}})\), where one assumes m = mI = mR as the ground state value for both \({\hat{M}}_{{{{{{{{\rm{I}}}}}}}}}\) and \({\hat{M}}_{{{{{{{{\rm{R}}}}}}}}}\) for the same reason as in the WEP case. This provides a test for the LLI.
To test the QEP in strong gravitational fields and at relativistic speeds, one can for example consider the perihelion precession of planets. Following the standard procedure2 (p. 194-200) and using the metric in Eq. (39), it turns out that the perihelion precession during one orbit gets modified by the relevant mass ratio as
where a is the semimajor axis and e the eccentricity of the orbit. Considering a number of orbits N around the central object, the above expression is multiplied by N and the deviation should be detectable for a large enough N if the EP is to be violated. It can be shown that the precision of the measured precession of the perihelion of Mercury ΔφMercury = 42.9799 ± 0.0009"cy−1 21 provides an upper bound of ≲ 2.1 × 10−5 for the deviation of mG/mR from unity in a similar manner as seen from Eqs. 20, 21. The above bound corresponds to the upper bound on the difference of eigenvalues \(\frac{1}{m{c}^{2}}({E}_{{{{{{{{\rm{int,G}}}}}}}}}-{E}_{{{{{{{{\rm{int,R}}}}}}}}})\), where one assumes m = mG = mR as the ground state value for both \({\hat{M}}_{{{{{{{{\rm{G}}}}}}}}}\) and \({\hat{M}}_{{{{{{{{\rm{R}}}}}}}}}\), for the same reason as in the WEP case. This provides a test for the LPI.
Conclusions
A precise formulation of the QEP is essential for the formulation of quantum gravity, since gravity is an intrinsically quantum interaction in such a theory. In this work, adapting the formulation of Zych & Brukner7, the most general formalism for the QEP is proposed, valid in generic curved space-times, for bosonic and fermionic particles, and moving with arbitrary velocities. The most important results that one obtains here are that the soccer-ball problem and LLI are related, and that the geometry, probed by a test particle, e.g., via the geodesic motion that it follows, in general depends on the ratios of the inertial, gravitational and rest masses of the test particle. In other words, the observed geometry is no longer just a function of the background metric, but also depends on the properties of the test particle (i.e., observer) itself, unless the QEP holds exactly! While it may be argued that such observer dependence of measurable physical quantities is already a feature of standard quantum mechanics, and that the above mass ratio dependence is present in our QEP, one notes here that the aforementioned test particle dependence should be there even at the purely classical gravity level. Although seemingly counter-intuitive, there is nothing intrinsically impermissible about it, and furthermore such a necessary dependence from quantum mechanics now persists in its classical limit as well.
Methods
Special relativity
One first considers the relativistic dispersion relation and its non-relativistic limit. From the non-relativistic limit of the energy-momentum dispersion relation one finds
where one distinguishes between the inertial and rest masses. One can trace back the steps, and in a straightforward manner identify where masses mα enter the standard relativistic energy-momentum dispersion relation, which then reads
Note that the form of Eq. (24) is necessary to obtain the non-relativistic limit of Eq. (23). Since there is no corresponding non-relativistic relation for massless particles, there is no equivalent of Eq. (24) for photons. Therefore, the issues related to the equivalence principle and its quantum counterpart, as studied in this work, do not apply to massless particles. To satisfy Eq. (24) and the Lorentz scalar for the four-momenta \({p}^{\mu }{p}_{\mu }={m}_{{{{{{{{\rm{R}}}}}}}}}^{2}{c}^{2}\), the four-momentum must take the form of
where p = mIvγ is the relativistic momentum, γ is the Lorentz factor and E = mRc2γ is the energy of the particle, equivalent to Eq. (24). By using the definitions of E and p containing the Lorentz factor γ, and plugging them in the Lorentz scalar product for four-momenta \({p}^{\mu }{p}_{\mu }={m}_{{{{{{{{\rm{R}}}}}}}}}^{2}{c}^{2}\), one can immediately see that the Lorentz factor γ also contains a ratio of inertial and rest masses
The above modified Lorentz factor can be interpreted in terms of a modification to the time dilation and length contraction of a relativistic particle. Note that the above Lorentz factor is also a function of mα. Specifically, a function of the ratio mI/mR. This is the case for all of the following considerations.
To define the four-velocity, one can write the four-momentum from Eq. (25) in terms of the four-velocity as
where factoring out the masses appears as such that \({u}^{\mu }{u}_{\mu }=\frac{{m}_{{{{{{{{\rm{R}}}}}}}}}}{{m}_{{{{{{{{\rm{I}}}}}}}}}}\) is satisfied. From the above, one can read out the zeroth component of the four-velocity as
Using the modified Lorentz transformation from the main paper with the four-velocity defined above, for a particle moving in the x-direction, one finds the composition law for velocities
where v0 is the particle velocity corresponding to γ0 in the initial inertial frame, v is the velocity of the boosted inertial frame corresponding to γ and \({v}^{{\prime} }\) is the particle velocity corresponding to \({\gamma }^{{\prime} }\) as measured in the boosted inertial frame. It is interesting to note that Eq. (29b) does not explicitly depend on mα, and is identical as in the standard case. However, the addition of velocities does depend on mα, as seen in Eq. (29c), due to the dependence of the Lorentz factors on mα.
General relativity
Here, the formulation of the equivalence principle is generalized to curved space-times, using masses mα.
Weak gravitational field and the schwarzschild metric
In GR, the motion of a particle in a background space-time is described by the geodesic equation9
where τ represents the proper time. In the case of the weak field limit of GR, when gμν = ημν + hμν and ∣hμν∣ ≪ 1 (in the following, all terms of second order and higher in hμν are ignored), and the Newtonian limit for which \(\frac{{{{{{{{\rm{d}}}}}}}}{x}^{i}}{{{{{{{{\rm{d}}}}}}}}\tau }\ll \frac{c\,{{{{{{{\rm{d}}}}}}}}t}{{{{{{{{\rm{d}}}}}}}}\tau }\), one has \({\Gamma }_{00}^{i}=-\frac{1}{2}{\partial }_{i}{h}_{00}\) and the geodesic equation reduces to
To obtain h00, one must identify \(\frac{{{{{{{{{\rm{d}}}}}}}}}^{2}{x}^{i}}{{{{{{{{\rm{d}}}}}}}}{t}^{2}}\) and \(\frac{c\,{{{{{{{\rm{d}}}}}}}}t}{{{{{{{{\rm{d}}}}}}}}\tau }\) in the non-relativistic limit, using test particle masses mα. For the former, one considers Newton’s second law applied to gravity, with the inertial and gravitational masses assumed to be different
where ϕ is the gravitational potential and g = − ∇ ϕ is the gravitational field. The acceleration can be written in component form as
Since one must consider the non-relativistic limit to obtain h00, in which case we can set γ = 1, since v2 ≪ c2. To evaluate the small deviation (in this case of the 00 component) of the metric from the flat Minkowski space-time, due to a weak gravitational field, one plugs Eqs. (28) and (33) in Eq. (31) to obtain
Using the same approach, one obtains the modifications for other components of the metric. Because the equations in this work make use of a contravariant metric gμν, one must calculate the inverse of gμν. For the case of a diagonal metric, the inverse simply inverts the diagonal elements. Therefore,
since ϕ ≪ c2 in the weak field limit. These results can then be used to write the weak field metric as
where one can see that the mass ratio mG/mR modifies the metric in the temporal and all spatial components, while the inertial mass mI does not affect it. The off-diagonal elements are of order \({{{{{{{\mathcal{O}}}}}}}}({c}^{-5})\) and can be ignored. Note that the above metric is given in a Cartesian coordinate system for convenience of use in Earth based experiments. It turns out that every p2 obtains a factor 1/mI in front (or equivalently, every v2 obtains a mI in front), every ϕ obtains a factor mG in front and every c2 obtains a factor of mR in front, which can be used as a rule of thumb to enter masses mα in expressions.
For a strong gravitational field, described by a Schwarzschild space-time, the general solution for a covariant metric, before evaluating the integration constant K, has the form
The constant K is obtained in the asymptotic limit, where r ⟶ ∞ and the gravitational field is weak. Therefore, one can use the result from Eq. (34) to compare to the g00 component of the above Schwarzschild metric. It follows that
The inverse of the metric from Eq. (37), using the above constant, exactly corresponds to the modified Schwarzschild metric
In the above one can see that the mass ratio mG/mR modifies the metric only in the temporal and radial components, while the inertial mass mI does not affect it. Note that in the weak field limit of the geodesic equation from Eq. (30) for the above metric, one obtains the second Newton’s law from Eq. (32).
Kerr metric
To obtain the additional integration constant L, due to the rotation of the central object, the procedure of Papapetrou11 is followed. The general form of a Kerr metric with the constants K and L can be written as9
where Δ = r2 + Kr + L2 and \({\rho }^{2}={r}^{2}+{L}^{2}{\cos }^{2}\theta\). Since the procedure of Papapetrou11 makes use of Cartesian coordinates, it is convenient to write the above metric in Cartesian coordinates as a line element9
where \(x=(r\cos \varphi +L\sin \varphi )\sin \theta\), \(y=(r\sin \varphi -L\cos \varphi )\sin \theta\), \(z=r\cos \theta\) and r4 − r2(x2 + y2 + z2 − L2) − L2z2 = 0.
One obtains the metric of a rotating object in the asymptotic limit from the weak field Einstein equations (gμν = ημν + hμν with ∣hμν∣ ≪ 1), which is later compared to the asymptotic limit of the metric in Eq. (41), to obtain constants K and L. The weak field Einstein equations read as11
where
κ is a constant, which takes the value 8πG/c4 in the standard case (for now κ is taken to be arbitrary, because it obtains modifications, as seen in Eq. (61)) and 0Tμν is the special-relativistic energy-momentum tensor which describes the source of the gravitational field for a weak gravitational field. The Kerr metric is stationary, which means that the temporal derivative in Eq. (42) vanishes and one obtains
The general solution to the above equation reads as
where x is an arbitrary location outside the gravitating object, X is the location inside the gravitating object, over which the integral is evaluated, and R is the distance between the two locations, defined as
where the Einstein summation rule is applied and r2 = xixi is the square of the distance of the observer from the central object. Note that in this notation x1 = x, x2 = y, x3 = z, X1 = X, X2 = Y and X3 = Z. Since all quantities must be evaluated in the asymptotic limit, where \(\frac{{X}^{i}}{r}\ll 1\), one can write 1/R (as seen in Eq. (45)) as
The energy-momentum tensor of a distribution of matter is defined as 0Tμν = ρc2uμuν, where uμ = ημνuν is defined through Eq. (27). For the case of a rotating body, which rotates around the z axis in Cartesian coordinates, the components of 0Tμν turn out to be
where ρ is the matter distribution density, v is the rotational velocity of the central object, assumed to be small v ≪ c, and \({}_{0}^{0}{T}_{01}\) and \({}_{0}^{0}{T}_{0}\) are the unmodified tensor components. Note that other tensor components vanish. Using the above energy-momentum tensor and Eq. (47), one can write the two components of the solution from Eq. (45) as
where the axial symmetry of the system ensures that the 1/r terms and terms with other coordinate components vanish. Choosing the z-axis as the axis of rotation, the special relativistic angular momentum reads
The axial symmetry of the system also relates the two terms in the above definition as
Note that the energy-momentum tensor components in the definition of the angular momentum is contravariant. To make this consistent with the previous steps, it is straightforward to show the relations to the covariant energy-momentum tensor components \({}_{0}^{0}{T}^{01}={-\,}_{0}^{0}{T}_{01}\) and \({}_{0}^{0}{T}^{02}={-\,}_{0}^{0}{T}_{02}\). Using these relations, the second and third expressions in Eq. (48), and Eq. (51), the solutions from Eq. (49) become
What remains is to evaluate the constant κ. To do this, one must first consider the first integral of the geodesic equation
To compare this result with the Newtonian limit, one neglects all terms (1/r)2 and higher in the above expression, after plugging in the Kerr metric. One considers a geodesic trajectory for a particle orbiting the central object in the x − y plane. After some algebraic manipulation and differentiating over φ of Eq. (53), one obtains
where \(\ell \equiv {r}^{2}\frac{{{{{{{{\rm{d}}}}}}}}\varphi }{{{{{{{{\rm{d}}}}}}}}s}\) is a constant of motion, related to angular momentum, for an orbiting particle. According to Newtonian mechanics, the energy of an orbiting particle is constant for any distance from the central object r, and is given by
where one can clearly differentiate between mI and mG. Without loss of generality, one can assume that the particle is orbiting the central object in the x − y plane. Then, one can write the particle position vector as \({{{{{{{\bf{r}}}}}}}}=r(\cos \varphi ,\sin \varphi )\) and its angular momentum as \({{{{{{{\bf{J}}}}}}}}={{{{{{{\bf{r}}}}}}}}\times {{{{{{{\bf{p}}}}}}}}=\sqrt{{m}_{{{{{{{{\rm{R}}}}}}}}}{m}_{{{{{{{{\rm{I}}}}}}}}}}\,c\,\ell \,\hat{z}\), where p is defined by the spatial components of Eq. (27). Using the above, \({{{{{{{\bf{J}}}}}}}}=I{{{{{{{\boldsymbol{\omega }}}}}}}}={m}_{{{{{{{{\rm{I}}}}}}}}}{r}^{2}\frac{{{{{{{{\rm{d}}}}}}}}\varphi }{{{{{{{{\rm{d}}}}}}}}t}\hat{z}\) and differentiating with respect to φ, one obtains
By comparing Eqs. (54) and (56), one can identify the constant K, which is exactly the same as in the Schwarzschild and weak field cases (see Eq. (38)). Note that the angular momentum, used to derive Eq. (56) corresponds to the angular momentum of a test particle orbiting the central object and is different from the angular momentum, defined in Eq. (50), which corresponds to the rotation of the central object.
To obtain κ, one uses the first expression in Eq. (48) with Eq. (44) to obtain
which must correspond to the Poisson’s equation for the gravitational potential in the weak field limit
Note that the above Poisson’s equation does not obtain corrections, since the mG/mI ratio would modify it on both sides equally, which would then cancel. By comparing the above two equations, it turns out that
and by using the inverse relation of Eq. (43), the g00 component in the weak field approximation turns out to be
Precisely the same terms appear in components g11, g22 and g33. Since the constant K and potential ϕ are known (see Eq. (38)), the modified constant κ can easily be identified as
thus effectively modifying the Einstein equations as
By plugging the modified Einstein constant from Eq. (61) in the expressions from Eq. (52) and using Eq. (43), one obtains
At this point one can define the constant \(a\equiv \frac{{J}^{z}}{Mc}\). Using a and the above information, one can write the line element of the metric of a rotating object as
Comparing the above line element with the asymptotic limit when r ⟶ ∞ of the line element from Eq. (41), one can identify the remaining constant L as
The obtained constants K and L are then plugged in the metric from Eq. (40), which is then inverted to obtain the contravariant Kerr metric as
where \({\Sigma }^{2}({m}_{\alpha })={\left({r}^{2}+\frac{{m}_{{{{{{{{\rm{I}}}}}}}}}}{{m}_{{{{{{{{\rm{R}}}}}}}}}}{a}^{2}\right)}^{2}\,\,-\frac{{m}_{{{{{{{{\rm{I}}}}}}}}}}{{m}_{{{{{{{{\rm{R}}}}}}}}}}{a}^{2}\Delta {\sin }^{2}\theta\), \(\,\,\Delta ({m}_{\alpha })={r}^{2}-\frac{{m}_{{{{{{{{\rm{G}}}}}}}}}}{{m}_{{{{{{{{\rm{R}}}}}}}}}}\frac{2GM}{{c}^{2}}r+\frac{{m}_{{{{{{{{\rm{I}}}}}}}}}}{{m}_{{{{{{{{\rm{R}}}}}}}}}}{a}^{2}\) and \({\rho }^{2}={r}^{2}+\frac{{m}_{{{{{{{{\rm{I}}}}}}}}}}{{m}_{{{{{{{{\rm{R}}}}}}}}}}{a}^{2}{\cos }^{2}\theta\).
Feshbach-Villars formalism in curved space-time
For convenience the Feshbach-Villars formalism is used for bosons, suitably generalized to background curved space-times. The effective Hamiltonian for bosons in a curved space-time background in the Feshbach-Villars formalism takes the form
where τk (k = 1, 2, 3) are the Pauli matrices, gμν is the metric of the curved space-time background (μ, ν = 0, 1, 2, 3 with 0 representing the time component and i, j = 1, 2, 3 the spatial components), m is the mass of the boson, c the speed of light, \({\hat{p}}_{0}=i\frac{\hslash }{c}{\nabla }_{0}\), \({\hat{p}}_{i}=-i\hslash {\nabla }_{i}\), ℏ the reduced Planck constant and ∇μ the covariant derivative compatible with the background metric. Note that the Feshbach-Villars formalism in curved space-time has been previously studied in other representations, as seen in refs. 13, 14. However, the formulation with the simple representation given in Eq. (67) is introduced, for the first time to the best of the authors’ knowledge, in this work. Also, the Pauli matrices in this formalism are used here simply for convenience and are in no way related to the spin of the particle. They are also not affected by the curvature of space-time, unlike for the Dirac matrices, which are relevant for particles with half-integer spins.
The eigenstates of the effective Hamiltonian in Eq. (67) are represented by a pair of scalar functions φ and χ as
The above definition of scalar functions φ and χ is used for convenience to formulate a Hamiltonian corresponding to relativistic bosons. They are related to the two degrees of freedom of the wavefunction from the Klein-Gordon equation, which correspond to the two possible charge states12. The Hamiltonian in Eq. (67) and the wavefunction in Eq. (68) satisfy a Schrödinger-like equation
In the above, the partial derivative with respect to time is identical to the covariant derivative, i.e., ∂0Ψ = ∇0Ψ, since Ψ consists of scalar functions. This identity is taken into account in the following steps.
The wavefunction from the Klein-Gordon equation in this formalism is defined as \({\Phi }_{{}_{{{{{{{{\rm{KG}}}}}}}}}}=\varphi +\chi\). To see that the above formalism indeed represents the Klein-Gordon equation in curved space-time, one plugs Eqs. (67) and (68) in Eq. (68) to obtain two coupled differential equations
and
The above pair of equations represent a coupled system of equations for φ and χ. However, it is required to obtain the equation of motion for \({\Phi }_{{}_{{{{{{{{\rm{KG}}}}}}}}}}\), since it is defined as the wavefunction from the Klein-Gordon equation. To achieve this, Eqs. (70) and (71) can be used to retrieve additional relations between φ and χ. Eqs. (70) and (71) may then be added together and derived over time to obtain a useful identity
In the above, the identity ∇αgμν = 0 has been used. Furthermore, subtracting Eq. (71) from (70), one obtains
Plugging Eq. (72) in Eq. (73) and with a bit of algebraic manipulation, one obtains the Klein-Gordon equation in curved space-time15
As expected, in the non-relativistic limit and in flat space-time, the standard quantum mechanical results are obtained. This shows that Eq. (67) is indeed the right Hamiltonian for a bosonic particle in an arbitrary gravitational field. This demonstrates that the Klein-Gordon equation, relevant for spin 0 particles can be written as an effective Hamiltonian for a Schrödinger-like equation.
By implementing the different masses to the Hamiltonian in Eq. (67), as seen in the main paper, and following the above procedure, one obtains a modified Klein-Gordon Equation in curved space-time
where \({\tilde{\nabla }}_{\mu }=\left(\frac{1}{c}{\nabla }_{0},\sqrt{\frac{{m}_{{{{{{{{\rm{R}}}}}}}}}}{{m}_{{{{{{{{\rm{I}}}}}}}}}}}{{{{{{{\boldsymbol{\nabla }}}}}}}}\right)\), which is consistent with the quantum version of Eq. (25), when multiplied by iℏ.
References
Einstein, A. The Meaning of Relativity, Princeton University Press: Aberdeen, UK (1922).
Weinberg, S. Gravitation and Cosmology: Principles and Applications of the General Theory of Relativity, John Wiley and Sons: New York, USA (1972).
Will, C. M. The confrontation between general relativity and experiment. Living Rev. Rel. 17, 4 (2014).
Zwiebach, B. A First Course in String Theory, Cambridge University Press: Cambridge, UK (2004).
Nicolai, H., Peeters, K. & Zamaklar, M. Loop quantum gravity: an outside view. Class. Quant. Grav. 22, R193 (2005).
Amelino-Camelia, G. Doubly-special relativity: facts, myths and some key open issues. Symmetry 2, 230–271 (2010).
Zych, M. & Brukner, Č. Quantum formulation of the Einstein Equivalence Principle. Nature Phys. 14, 1027–1031 (2018).
Hossenfelder, S. The soccer-ball problem. SIGMA 10, 074 (2014).
Chandrasekhar, S. The Mathematical Theory of Black Holes, Oxford University Press: New York, USA (1983).
Schwartz, P. K. & Giulini, D. Post-Newtonian Hamiltonian description of an atom in a weak gravitational field. Phys. Rev. A 100, 052116 (2019).
Papapetrou, A. Lectures on General Relativity, D. Reidel Publishing Company: Dordrecht, NL (1974).
Feshbach, H. & Villars, F. Elementary relativistic wave mechanics of spin 0 and spin 1/2 particles. Rev. Mod. Phys. 30, 24–45 (1958).
Cognola, G., Vanzo, L. & Zerbini, S. Relativistic wave mechanics of spinless particles in a curved space-time. Gen. Rel. Grav. 18, 971–982 (1986).
Tagirov, E. A. Field-theoretical approach to quantum mechanics in curved space-times. Class. Quant. Grav. 16, 2165–2185 (1999).
Parker, L. E. & Toms, D. J. Quantum Field Theory in Curved Spacetime, Quantized Fields and Gravity, Cambridge University Press: Cambridge, UK (2009).
Parker, L. One electron atom as a probe of space-time curvature. Phys. Rev. D 22, 1922–1934 (1980).
Colella, R., Overhauser, A. W. & Werner, S. A. Observation of gravitationally induced quantum interference. Phys. Rev. Lett. 34, 1472–1474 (1975).
Littrell, K. C., Allman, B. E. & Werner, S. A. Two-wavelength-difference measurement of gravitationally induced quantum interference phases. Phys. Rev. A 56, 1767–1780 (1997).
Numao, T., Macdonald, J. A., Marshall, G. M., Olin, A. & Fujiwara, M. C. A New pi+ lifetime measurement. Phys. Rev. D 52, 4855–4859 (1995).
Brooke, G., Meyer, M. A. & Wolfendale, A. W. The energy spectrum of cosmic ray pions near sea level in the range 0.7-150 GeV. Proc. Phys. Soc. 83, 871–877 (1964).
Park, R. S. et al. Precession of Mercury’s perihelion from ranging to the messenger spacecraft. Astron. J. 153, 121 (2017).
Acknowledgements
This work was supported by the Natural Sciences and Engineering Research Council of Canada. We thank the anonymous referees for their useful comments which helped in improving the manuscript.
Author information
Authors and Affiliations
Contributions
S.D., M.F. and G.L. have contributed to all aspects of the research, with the leading input from M.F.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Communications Physics thanks Joshua Foo and the other, anonymous, reviewers for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Das, S., Fridman, M. & Lambiase, G. General formalism of the quantum equivalence principle. Commun Phys 6, 198 (2023). https://doi.org/10.1038/s42005-023-01306-w
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s42005-023-01306-w


