Abstract
Quantum electrodynamics (QED), the first relativistic quantum field theory, describes light–matter interactions at a fundamental level and is one of the pillars of the Standard Model (SM). Through the extraordinary precision of QED, the SM predicts the energy levels of simple systems such as the hydrogen atom with up to 13 significant digits1, making hydrogen spectroscopy an ideal test bed. The consistency of physical constants extracted from different transitions in hydrogen using QED, such as the proton charge radius rp, constitutes a test of the theory. However, values of rp from recent measurements2,3,4,5,6,7 of atomic hydrogen are partly discrepant with each other and with a more precise value from spectroscopy of muonic hydrogen8,9. This prevents a test of QED at the level of experimental uncertainties. Here we present a measurement of the 2S–6P transition in atomic hydrogen with sufficient precision to distinguish between the discrepant values of rp and enable rigorous testing of QED and the SM overall. Our result ν2S–6P = 730,690,248,610.79(48) kHz gives a value of rp = 0.8406(15) fm at least 2.5-fold more precise than from other atomic hydrogen determinations and in excellent agreement with the muonic value. The SM prediction of the transition frequency (730,690,248,610.79(23) kHz) is in excellent agreement with our result, testing the SM to 0.7 parts per trillion (ppt) and, specifically, bound-state QED corrections to 0.5 parts per million (ppm), their most precise test so far.
Similar content being viewed by others
Main
The binding energy of atomic hydrogen can be expressed as1
in which n, l and J are, respectively, the principal, orbital and total electronic angular momentum quantum numbers of the energy level of interest. \({f}_{{nJ}}^{{\rm{Dirac}}}\) is the Dirac eigenvalue (∝1/n2 in leading order), whereas the second term \({f}_{{nlJ}}^{{\rm{QED}}}\) (∝1/n3 in leading order) contains the corrections from bound-state QED, such as self-energy and vacuum polarization, including muonic and hadronic contributions1. Both terms depend on the fine-structure constant α and the electron-to-proton mass ratio mp/me, which are known with sufficient accuracy from other experiments that do not require bound-state QED (refs. 1,10,11,12,13,14). The third term is the leading-order nuclear size correction for S-states (l = 0), accounting for the finite root-mean-square (rms) charge radius of the proton, rp. The second and third terms constitute the Lamb shift and contribute about 1 ppm and 100 ppt, respectively, to the 2S–6P transition frequency (Extended Data Table 1). The unitless terms are converted to SI units (International System of Units) using the Rydberg constant R∞.
To compare measured energy levels or transition frequencies with equation (1), rp and R∞ must be known (speed of light in vacuum c and Planck’s constant h are defined). In practice, rp and R∞ are largely determined from such measurements themselves and more than two measurements of distinct transitions are necessary to test equation (1).
A special case is the determination of rp with laser spectroscopy of muonic hydrogen8,9,15. In this exotic atom, the electron is replaced with a negative muon, whose larger mass increases CNS of equation (1) by four orders of magnitude, allowing a precise determination of rp without requiring other high-precision input. Because the nuclear size correction scales as 1/n3 as the other QED corrections, discrepant rp values from atomic and muonic hydrogen can indicate missing or incomplete QED terms (Methods). Notably, the value of rp from the muonic measurement was found to be significantly smaller (>5σ) than the then-established value (CODATA 2014 (ref. 16)).
This proton radius puzzle led to extensive research efforts2,3,4,5,6,7,17. We first addressed it with a precision measurement of the 2S–4P transition in atomic hydrogen2, which favoured the muonic result, but could not conclusively (>5σ) rule out the previous value. Subsequent measurements in atomic hydrogen have followed3,4,5,6,7 but they are partly discrepant with the muonic value and with each other, and none is precise enough to conclusively test the muonic value, as visualized in Fig. 1. Until now, this has prevented a verification of QED at the level of experimental uncertainties.
Previous determinations of rp from atomic hydrogen spectroscopy (refs. 2,3,4,5,6,7; black circles) are partly discrepant with each other and with the value of rp from spectroscopy of muonic hydrogen (refs. 8,9,15; violet square) and therefore could not conclusively resolve the initial 5.6σ discrepancy between the 2010 muonic value and the then-established larger value (as summarized in the CODATA 2014 global adjustment of fundamental constants16; brown hexagon). The value of rp from atomic hydrogen spectroscopy of the 2S–6P transition in this work (blue bar and diamond) is at least 2.5-fold more precise than other atomic hydrogen determinations and in excellent agreement with the muonic value. It disagrees with the fourfold less precise CODATA 2014 value by 5.5σ. The rp values are determined by combining each measurement with the 1S–2S transition frequency18 and equation (1) (Pearson correlation coefficient r < 0.05 between rp values). The most recent CODATA 2022 global adjustment (ref. 1; green triangle) essentially corresponds to the muonic value owing to the exceptionally low uncertainty of the latter. Error bars show one-standard-deviation uncertainties. Electron–proton scattering data are not shown, as different analyses give significantly different values of rp (refs. 1,48). Lattice QCD calculations of rp show promise but are not yet competitive49. See Extended Data Fig. 2 for the Rydberg constant R∞ from atomic hydrogen combined with the muonic value of rp.
Here we report on laser spectroscopy of the 2S–6P transition in atomic hydrogen with sufficient precision to distinguish between the discrepant values of rp. This precision corresponds to finding the transition frequency to one part in 15,000 of the experimental linewidth, to our knowledge unprecedented for laser spectroscopy, requiring a thorough understanding of any asymmetric distortions of the line shape at that level and a large experimental signal-to-noise ratio. By combining our measurement with the precisely known 1S–2S transition frequency18, we determine rp with 2.5-fold higher precision than the previous best determination from atomic hydrogen5 (Fig. 1). Our value of rp is in excellent agreement with the muonic value but fourfold more precise than and in significant disagreement (5.5σ) with the CODATA 2014 value16. Consequently, we use the muonic value of rp as input to equation (1) (along with the 1S–2S transition frequency), which allows us to compare the SM prediction of the 2S–6P transition frequency with our measurement. This constitutes a test of the SM to 0.7 ppt and of bound-state QED corrections to 0.5 ppm.
Principle of the measurement
2S–6P transition
We study the 2S–6P transition in a cryogenic beam of hydrogen atoms using Doppler-free one-photon laser spectroscopy. Although the transition has been previously observed with laser spectroscopy19,20, this work presents a substantial improvement. Using a linearly polarized, 410-nm spectroscopy laser, we alternately examine two dipole-allowed transitions from the metastable initial \({2{\rm{S}}}_{1/2}^{F=0},{m}_{F}=0\) level: the 2S–6P1/2 transition to the \({6{\rm{P}}}_{1/2}^{F=1},{m}_{F}=0\) level and the 2S–6P3/2 transition to the \({6{\rm{P}}}_{3/2}^{F=1},{m}_{F}=0\) level, as shown in Fig. 2a (F, total angular momentum quantum number; mF, magnetic quantum number). The excited 6P levels rapidly decay, directly or through cascades, to the 1S and 2S manifolds, resulting in a Γ = 3.90 MHz natural transition linewidth. These decays, predominantly the direct Lyman-ε decay to the 1S manifold, are the experimental signal and the fluorescence line shape is observed in line scans by recording this signal at different spectroscopy laser detunings. A fraction of γei/Γ = 3.9% or 7.9% of decays from the excited level lead back to the initial 2S level for the 2S–6P1/2 or 2S–6P3/2 transitions, respectively. Notably, quantum interference (QI) between excitation–decay paths that go through either excited level but lead to the same final level can cause substantial distortions of the fluorescence line shape. The associated line shifts are on the order of Γ2/ΔνFS(6P) ≈ Γ/100 (refs. 2,21,22), in which ΔνFS(6P) ≈ 405 MHz is the 6P fine-structure splitting between the excited levels. Because the magnitude and sign of the distortions depend on the detection direction (relative to the laser polarization), here we use a large detection solid angle and a magic polarization angle to strongly suppress the shift22.
a, Relevant level scheme of atomic hydrogen (not to scale). Solid arrows indicate laser-driven transitions and dashed arrows indicate spontaneous decay. Spectroscopy laser is on resonance with 2S–6P1/2 (dark blue) or 2S–6P3/2 (light blue) transitions. Levels shown in grey are not resonantly coupled by lasers. b, Key components of experimental apparatus (to scale, cutaway). Dashed black lines (orthogonal to spectroscopy laser) visualize atomic beam offset angle α0 (exaggerated) and laser polarization angle θL. c, Typical line scan of the 2S–6P1/2 transition (acquired within 40 s; full detuning range is ±50 MHz; Methods). The fluorescence signal (top detector; kcts, kilocounts) is recorded for different velocity groups τi with mean speeds \(\bar{v}\), with three shown here (orange, green and purple circles; τ16 scaled (×3) for visibility). Error bars show expected one-standard-deviation (σ) shot noise. The FWHM linewidth ΓF reduces for slower velocity groups as Doppler broadening reduces. Solid lines show simulated line shape (scaled and offset to match the signal) and the dotted orange line shows the Voigt line shape fit to τ2 (see Extended Data Fig. 1 for fit residuals). P2S–6P = 10 μW spectroscopy laser power and α0 = 0 mrad were used. d, Resonance frequencies ν0 (circles) determined from Voigt line shape fits to velocity groups of scan of panel c versus \(\bar{v}\). Error bars show 1σ fit uncertainty. Extrapolation to zero speed (blue line; blue shading, 1σ confidence interval) gives Doppler-free transition frequency νe and Doppler slope κ. ν0 has been corrected for light force, QI and second-order Doppler shifts and all other corrections. e, Histogram of all 598 detector-averaged Doppler-free transition frequencies νe (data group G3 of Extended Data Table 3) determined with the same experimental parameters as the line scan of panels c and d.
Experimental apparatus
The key in-vacuum components of the experimental apparatus (described in detail in ref. 23) are shown in Fig. 2b. Briefly, a cryogenic beam of hydrogen atoms is formed by a copper nozzle (circular aperture with r1 = 1 mm radius) held at temperature TN = 4.8 K. The atoms are prepared in the initial \({2{\rm{S}}}_{1/2}^{F=0},{m}_{F}=0\) level by Doppler-free two-photon excitation from the 1S ground level with a preparation laser (243 nm wavelength; Fig. 2a,b), collinear with the atomic beam. The divergence of the atomic beam is limited to approximately 10 mrad in the transverse (x) direction by a collimating aperture (1.2 mm width, placed 154 mm after the nozzle).
The atomic beam enters the 2S–6P spectroscopy region inside a cylindrical detector assembly, in which, at a distance L = 204 mm from the nozzle, it crosses counterpropagating (along x) spectroscopy laser beams at an adjustable atomic beam offset angle |α0| = 0–12 mrad from the orthogonal. In the ideal case of laser beams with identical wavefront curvature and power, this excitation scheme produces a line shape whose centre of mass is free of first-order Doppler shifts (but not necessarily free of Doppler broadening), as the interaction with the respective beams results in Doppler shifts of equal magnitude but opposite sign. Here an active fibre-based retroreflector (AFR)24,25,26, consisting of polarization-maintaining fibre, collimator and high-reflectivity mirror, generates the required high-quality, wavefront-retracing beams (2.2 mm 1/e2 intensity radius; P2S–6P = 5–30 μW power in each beam). The AFR is attached to the rotatable detector cylinder, allowing an in situ adjustment of α0 (1 mrad accuracy). We align α0 close to zero to avoid splitting the line shape into two Doppler components from the counterpropagating beams, except when characterizing the light force shift (LFS; see below). The polarization angle θL of the laser beams, relative to the axis of the cylinder (along y), is set to 56.5°, the magic angle at which QI distortions are suppressed, or orthogonal to it (146.5°).
The photons emitted by the 6P decay eject photoelectrons from the cylinder walls, which are drawn to and counted with channel electron multipliers at the top and bottom of the cylinder (top and bottom detectors). Each detector covers >20% of the total solid angle, further suppressing QI distortions. A segmented Faraday cage surrounds the spectroscopy region, shielding external electric fields and allowing the application of bias fields to characterize stray electric fields and the dc-Stark shift caused by such fields (Methods).
Doppler-free one-photon spectroscopy
We investigate different atomic velocity groups τi (i = 1–16; Extended Data Table 2) by periodically blocking the 1S–2S preparation laser, thereby intermittently stopping the production of 2S atoms, and recording the signal as a function of delay time τ = 10–2,560 μs. The longer the τ, the slower the 2S atoms contributing to the signal. The speed distribution of atoms contributing to the signal is well described by a Maxwell–Boltzmann flux distribution with an extra multiplicative factor exp(−vcut-off/v) with a characteristic cut-off speed vcut-off ≈ 50 m s−1, accounting for the loss of slower atoms through collisions (Methods). The mean speeds \(\bar{v}\) of the velocity groups cover a wide range (253(5) to 65(1) m s−1; overall mean speed \(\langle \bar{v}\rangle =195(6)\,{\rm{m}}{{\rm{s}}}^{-1}\)) and their transverse velocities are approximately Gaussian distributed with a full width at half maximum (FWHM) ranging from 3.4 to 0.6 m s−1 (parentheses give the standard deviation over the data groups of Extended Data Table 3).
Figure 2c shows the top detector signal for three velocity groups (circles) for a typical line scan of the 2S–6P1/2 transition. Fast velocity groups, such as τ2 (orange circles), have a FWHM linewidth ΓF substantially larger than the natural linewidth Γ, owing to Doppler broadening from their large transverse velocity widths. Conversely, for slower velocity groups (green and purple circles), ΓF approaches Γ as the transverse velocity width decreases. Overall, ΓF ranges from 9.6(4) to 5.5(6) MHz (Extended Data Table 2).
Our line shape simulations (solid lines; here the QI model is shown; Methods) are in excellent agreement with the experimental data, reproducing the Doppler broadening with vcut-off as the only free parameter. The resonance frequency ν0 of each velocity group is determined by fitting simple Voigt or Voigt doublet line shapes (dotted line; Methods) to the data. This differs from the approach taken in our previous 2S–4P measurement2, in which a theoretically motivated, asymmetric line shape model was used to account for QI distortions. Here the QI distortions are much smaller, owing to the magic polarization angle and large detection solid angle, and distortions from the LFS dominate (Extended Data Fig. 1), for which no equivalent line shape model is known to us. Instead, we fit the same simple line shape model to the simulations and correct the experimental resonance frequency with our LFS and QI simulations.
To remove any residual first-order Doppler shift, we use the linear model \({\nu }_{0}={\nu }_{{\rm{e}}}+\kappa \bar{v}\) to extrapolate the resonance frequencies ν0 of each line scan to zero speed (Fig. 2d). This gives the Doppler-free transition frequency νe, the Doppler slope κ and the resulting effective frequency correction \(\Delta {\nu }_{{\rm{e}}}=-\kappa \langle \bar{v}\rangle \) (Methods). Figure 2e shows νe of all 598 line scans recorded with the same experimental parameters as in Fig. 2c,d. In total, the dataset presented here contains 3,155 line scans, acquired in three measurement runs (A, B and C) and grouped into 17 experimental parameter combinations (data groups; Extended Data Table 3).
Light force shift
Just as light waves diffract on a periodic structure, matter waves can diffract on the periodic structure formed by a standing wave of light, as first predicted for electrons by Kapitza and Dirac27. Here a similar diffraction can occur as the atoms, which may be understood as matter waves, cross the standing intensity wave formed by the counterpropagating spectroscopy laser beams used to suppress the first-order Doppler shift (Fig. 2b). This effect, along with scattering on the standing wave, leads to a distortion of the fluorescence line shape and a line shift (LFS). Although such shifts have been observed in other experiments using standing waves28,29,30, the behaviour and size of the shift and the necessary theoretical treatment are highly dependent on the exact experimental conditions. A travelling wave can be used to avoid the LFS31 but this approach has not yet demonstrated the level of Doppler shift suppression required here.
Diffraction of matter waves, as for light waves, requires some degree of spatial coherence, with diffraction becoming important when the atoms’ transverse coherence length lc,t along the standing wave is comparable with its periodicity of λ/2 = 205 nm. Treating the cryogenic nozzle as a thermal source of atoms, we find lc,t to be λdB,th = 0.8 nm (ref. 32) and therefore much smaller than λ/2, in which \({\lambda }_{{\rm{dB}},{\rm{th}}}=\sqrt{{h}^{2}{k}_{{\rm{B}}}{T}_{{\rm{N}}}/2{\rm{\pi }}{m}_{{\rm{H}}}}\) is the thermal de Broglie wavelength (mH, hydrogen mass; kB, Boltzmann constant). However, as is well known from the van Cittert–Zernike theorem33, the transverse coherence length is enhanced by propagation, as seen in matter wave interference of large molecules from a thermal source34. At a distance L from the nozzle, this results in
for atoms with a speed v and de Broglie wavelength of λdB = h/mHv. For v = 200 m s−1, the mean speed of atoms examined in the experiment, lc,t is 129 nm at the standing wave and hence comparable with its periodicity. We therefore need to model the atoms as partially coherent matter waves, that is, the atoms are partially delocalized over the standing wave, unlike the localized description35 valid for different experimental conditions28,29.
We use the Wigner function36 to find a quantum-mechanical description of the atomic beam, quantizing the motion along the standing wave while treating other directions classically. This results in a comparable value of lc,t as estimated above23. The interaction of the atoms with the standing wave can be described in the combined basis of internal energy levels and external momenta along the standing wave (Supplementary Methods). The absorption of a photon from the forward-travelling (+) or backward-travelling (−) beam excites atoms from the 2S to the 6P level and changes their momentum along the beams by ±ħKL (wavenumber KL = 2π/λ; ħ = h/2π). Likewise, stimulated emission into either beam returns atoms to the 2S level and changes their momentum by ∓ħKL. This corresponds to a change in velocity by the recoil velocity, vrec = ħKL/mH ≈ 0.97 m s−1, comparable with the typical transverse velocity vx of the atoms. Because the 6P level predominantly spontaneously decays to the 1S ground level, from which there is no re-excitation while the signal is recorded, the maximum relevant momentum change is here ±4ħk.
The small fraction γei/Γ of the spontaneous decays back to the initial 2S level leads to a random momentum change along the direction of the standing wave, as the direction of the emitted photon is randomly distributed37. The associated Doppler shift of such a decay (at most twice the recoil shift of Δνrec = 1,176.03 kHz) is below the natural linewidth Γ and therefore the atom can be re-excited to the 6P level and ultimately contribute to the signal. However, because γei/Γ ≪ 1, only at most one such back decay is relevant here.
We first simulate the interaction of the atoms with the standing wave for an atom initially in a momentum eigenstate, that is, as a fully delocalized matter wave, with transverse velocity vx ≈ v(δα + α0). Figure 3a shows the LFS νLFS, found with a Voigt line shape fit to the simulated line shape, for an atom with v = 200 m s−1 (note that νLFS is always symmetric in vx). The hypothetical situation without back decay to the 2S level (dotted black line) describes pure diffraction of the matter wave on the standing wave. At vx = vrec, the Bragg condition is met and the atoms coherently scatter photons between the two counterpropagating beams, resulting in a narrow resonance with shifts exceeding 200 kHz (Fig. 3b). Below this resonance, the shift is negative and approximately constant around zero vx, and above the resonance, the shift is positive and tends to zero as vx increases. Allowing back decay, on the other hand, allows scattering of the atoms on the standing wave. This primarily leads to a positive shift above the resonance that approximately scales with γei/Γ, as shown for the 2S–6P1/2 (γei/Γ = 3.9%; blue solid line) and 2S–6P3/2 (γei/Γ = 7.9%; dashed red line) transitions.
a, Simulation of LFS νLFS, resulting from interaction of atoms with the standing intensity wave (periodicity λ/2 = 205 nm) formed by counterpropagating spectroscopy laser beams. The simulation describes the atoms as delocalized over nodes and antinodes, crossing the standing wave with speed v = 200 m s−1 and at angle δα + α0 to orthogonal (transverse velocity of vx ≈ v(δα + α0); Fig. 2b). Results are shown for 2S–6P1/2 (solid blue line) and 2S–6P3/2 (dashed red line) transitions (at equal Rabi frequency), for which, respectively, 3.9% and 7.9% of the 6P level decays lead back to the 2S level (Fig. 2a). The hypothetical situation without back decay is also shown (dotted black line). At vx = vrec ≈ 0.97 m s−1 (the recoil velocity), a Bragg resonance occurs, leading to a large shift. The shift is symmetric in vx and negative (positive) for |vx| ≲ vrec (|vx| ≳ vrec). The simulated distribution (smoothed and scaled for visibility) of vx for an atomic beam offset angle α0 of 0 mrad (12 mrad) is indicated at the top (bottom) for velocity groups τ2, τ13 and τ16 (arbitrary vertical units). b, Detail of the Bragg resonance. c, Similar to Fig. 2c but for a line scan with α0 = 12 mrad, which splits the line into two Doppler components. The line is fitted with Voigt doublet line shape (Methods). d, Doppler-free transition frequency νe (blue circles) of the 2S–6P1/2 transition for data taken at P2S–6P = 30 μW and |α0| = 0, 7.5 and 12 mrad and corrected for all systematic effects except LFS. The simulation of LFS (orange circles) is in excellent agreement with the experimental data. Error bars show combined statistical and systematic uncertainty (experimental data) or uncertainty from input parameters (simulation).
The partially coherent atomic beam corresponds to an incoherent sum over atoms in various momentum eigenstates (Supplementary Methods). Because the transverse velocity spread of the beam is on the order of vrec (see shaded area at the top of Fig. 3a showing the velocity distribution of selected velocity groups and Extended Data Table 2), this prevents us from experimentally resolving the Bragg resonance and partly averages out the shift (to at most −0.96 kHz and −1.28 kHz for the 2S–6P1/2 and 2S–6P3/2 transitions, respectively, both occurring for velocity group τ13).
To test our model and simulation of the LFS, we measured the transition frequency of the 2S–6P1/2 transition at atomic beam offset angles of α0 = ±7.5 mrad and α0 = ±12 mrad, as well as the dataset with α0 = 0 mrad (Fig. 3c,d). In particular, we find the difference in Doppler-free transition frequency between the data taken at α0 = ± 12 mrad and α0 = 0 mrad (both for P2S–6P = 30 μW), having applied a correction of 0.17(28) kHz for all systematic effects except the LFS, to be
The uncertainty is dominated by the statistical uncertainty of 1.63 kHz of the ±12 mrad data. This difference is in excellent agreement with the LFS simulation, which predicts νe,LFS(±12 mrad) − νe,LFS(0 mrad) = 4.21(61) kHz. The accompanying distortion of the experimental line shape, clearly observable in the asymmetry of fit residuals (Extended Data Fig. 1), is distinctly different for the two values of α0 and in excellent agreement with the simulations. The measured LFS at α0 = ±7.5 mrad is also in agreement with the simulation but has a comparatively large statistical uncertainty of 3.04 kHz (Fig. 3d). This comparison is also a powerful test of the Doppler shift suppression scheme (equation (3) includes a Doppler shift correction of Δνe = −1.92(1.81) kHz) because of the large difference in α0 and of the data analysis because of the qualitatively different line shapes involved.
Quantum interference
We use the QI simulations developed and extensively tested in ref. 2 to estimate any remaining QI shift, which we find to be −0.25(36) kHz and 0.05(15) kHz for the 2S–6P1/2 and 2S–6P3/2 transitions, respectively (Methods). These values include a small subset of data taken orthogonal to the magic angle, at which QI distortions are larger, to test our simulations. Also, we make use of the fact that the distortions are of opposite sign (and different magnitude) for the two transitions22 and combine the 2S–6P1/2 and 2S–6P3/2 transition frequencies with a 1:2 ratio into the 2S–6P fine-structure centroid (Methods). This reduces the shift to only −0.05(2) kHz.
2S–6P transition frequencies
By averaging all available data (Fig. 4), we find the transition frequencies of the 2S–6P1/2 transition (ν1/2) and the 2S–6P3/2 transition (ν3/2) to be
The final one-standard-deviation uncertainties (σ) are the combined statistical (‘stat’) and systematic (‘sys’) uncertainties, with the former corresponding to the Doppler shift extrapolation uncertainty. All corrections and uncertainties are listed in Extended Data Table 4, with contributions not discussed in the main text detailed in Methods and Supplementary Methods. The results from the two detectors, averaged here, agree within their uncertainties. The average Doppler slopes \(\bar{\kappa }\) of both transitions are compatible with zero.
a, Measured transition frequencies νe of the 2S–6P1/2 transition for different data groups (that is, combinations of experimental parameters and measurement runs), as labelled below each data point and listed in Extended Data Table 3. Data were taken for various combinations of spectroscopy laser power (10, 20 and 30 μW), atomic beam offset angle α0 (α0 = 0 mrad unless stated otherwise) and laser polarization angle θL (magic angle θL = 56.5° by default; non-magic angle θL = 146.5° marked accordingly). The data were collected over several months in 3-week-long runs, run A (circles), run B (diamonds) and run C (squares). Error bars show combined statistical and systematic one-standard-deviation (σ) uncertainty and the purple-shaded area shows the weighted mean of all data and its 1σ uncertainty. Black markers show the results without experiment-specific corrections, including for the light force, QI and first-order and second-order Doppler shifts (Extended Data Table 4). b, Doppler slopes κ for the data groups of panel a, determined in situ using velocity-resolved detection (Fig. 2). The mean speed of the atomic beam is \(\langle \bar{v}\rangle \approx 195\,{\rm{m}}\,{{\rm{s}}}^{-1}\). \(\bar{\kappa }\) is the weighted mean Doppler slope. c,d, Same as panels a and b but for the 2S–6P3/2 transition. Spectroscopy laser powers have a 2:1 ratio for the 2S–6P1/2 and 2S–6P3/2 transitions to keep their Rabi frequencies identical. The Pearson correlation coefficient between νe and κ is r = −0.78 over all data groups.
By taking the difference of the two measured transition frequencies, we find the 6P fine-structure splitting between the \({6{\rm{P}}}_{1/2}^{F=1}\) and \({6{\rm{P}}}_{3/2}^{F=1}\) levels,
Extended Data Table 4 lists the corrections and uncertainties. The QED prediction ΔνFS,QED(6P) = 405,164.51(1) kHz (Methods), which at our level of accuracy does not depend on rp, is in excellent agreement (ΔνFS(6P) − ΔνFS,QED(6P) = 0.11(97) kHz). This comparison tests the (uncorrelated) Doppler shift extrapolation and corrections for the light force, QI and dc-Stark shifts, with the last two of opposite sign for the two transitions.
Finally, we combine the two transition frequencies into the 2S–6P fine-structure centroid,
with the hyperfine-structure (HFS) correction ΔνHFS(ν2S–6P) = −132,985.25(1) kHz (Methods). The corrections and uncertainties for ν2S–6P are listed in Table 1. The total applied correction, excluding the precisely known recoil shift and HFS corrections, corresponds to 3.6σ, with the largest individual correction (for the LFS) corresponding to 2.4σ. The relative uncertainty corresponds to a sixfold improvement over our previous 2S–4P measurement2.
R ∞ and r p from atomic hydrogen
Combining the 2S–6P fine-structure centroid ν2S–6P of equation (8) with a measurement of the 1S–2S transition frequency18, and using equation (1), we determine the proton rms charge radius as
The total uncertainty arises mostly from the uncertainty of ν2S–6P (‘exp’), with the QED uncertainty of equation (1) (‘QED’) approximately threefold lower (contributions from the 1S–2S transition frequency and other physical constants are negligible). Figure 1 shows equation (9) with other relevant determinations of rp. Equation (9) is the most precise value for rp from any measurement other than the muonic measurement9,15, being 2.5-fold and sixfold more precise than the next best determination from atomic hydrogen5 and our 2S–4P measurement2, respectively. Equation (9) is in excellent agreement with the muonic measurement15 (rp = 0.84060(39) fm) but disagrees with the CODATA 2014 proton radius16 by 5.5σ. Instead of rp, we may also determine the 1S Lamb shift as \({{\mathcal{L}}}_{\exp }(1{\rm{S}})\,=\,{\mathrm{8,172,744.13}(14)}_{{\rm{Q}}{\rm{E}}{\rm{D}}}{(3.56)}_{\exp }\,{\rm{k}}{\rm{H}}{\rm{z}}\,=\) \(\mathrm{8,172,744.1}(3.6)\,{\rm{kHz}}\) (Methods).
The Rydberg constant can be similarly extracted from the combination of the 1S–2S and 2S–6P transition frequencies, giving R∞ = 10,973,731.568152(14) m−1 (1.3 ppt; Pearson correlation coefficient r = 0.94 with equation (9)). However, because the theory predictions of both frequencies depend on R∞ and rp, we cannot make full use of the relative precision of 0.66 ppt of ν2S–6P.
Test of the SM prediction
Having confirmed the muonic value of rp, we now explicitly compare the SM prediction ν2S–6P,SM for the 2S–6P fine-structure centroid with the experimentally determined value ν2S–6P. Combining the 1S–2S transition frequency18 and the muonic value of rp (ref. 15) with equation (1) (Methods), we find
The prediction is in excellent agreement with our experimental result with a difference of ν2S–6P − ν2S–6P,SM = 0.00(53) kHz, corresponding to a test of the SM at a relative precision of 0.7 ppt. This precision is comparable with the test of the SM prediction of the electron magnetic moment, which is at present limited to 0.7 ppt by discrepant measurements of α (ref. 12). Likewise, we compare the 1S Lamb shift prediction \({{\mathcal{L}}}_{{\rm{QED}}}(1{\rm{S}})\,=\,{\mathrm{8,172,744.1}(1.3)}_{{\rm{QED}}}{(1.0)}_{{{\rm{r}}}_{{\rm{p}}}}\,{\rm{kHz}}\,=\,\mathrm{8,172,744.1}(1.7)\,{\rm{kHz}}\) with \({{\mathcal{L}}}_{\exp }(1{\rm{S}})\), which, to our knowledge, is the most precise test (0.5 ppm) of bound-state QED corrections so far.
Extended Data Table 1 lists the QED corrections and their uncertainties. Our experimental uncertainty is comparable with the muonic and hadronic vacuum polarization corrections and reaches the level of three-photon corrections (∝α5), matching the highest-order terms in the prediction of the electron magnetic moment12. Although the experimental uncertainty is 2.7-fold larger than the QED uncertainty (dominated by two-photon and radiative-recoil corrections), the latter is partly estimated by the expected size of uncalculated terms1,38 as opposed to experimental tests. In fact, a recent recalculation of the two-photon self-energy39 would shift \({{\mathcal{L}}}_{{\rm{QED}}}(1{\rm{S}})\) on the order of the experimental σ of \({{\mathcal{L}}}_{\exp }(1{\rm{S}})\).
Equivalently, we can compare the values of R∞ determined from ν2S–6P or the 1S–2S transition frequency, each in combination with the muonic value of rp and equation (1). For ν2S–6P, this gives
with the uncertainty limited by the precision of ν2S–6P. Using the 1S–2S transition frequency yields a compatible R∞ = 10,973,731.5681523(65) m−1 (r1S–2S = 0.40 correlation with equation (11)), limited by the uncertainty of \({{\mathcal{L}}}_{{\rm{QED}}}(1{\rm{S}})\). Equation (11) is compatible with and 50% more precise than the R∞ world average1, which has an expanded uncertainty accounting for scatter in the input data. Adding the 2S–6P measurement will probably reduce scatter in future adjustments as less precise measurements are removed. This situation is illustrated in Extended Data Fig. 2. Our measurement can also be used to improve new physics constraints on weakly interacting bosons with masses in the keV range40,41.
The techniques demonstrated here may be used with any 2S–nP transitions in atomic hydrogen and deuterium42, with a precision matching or exceeding that of SM predictions feasible. Together with complementary approaches43,44,45,46,47, we expect this to substantially advance bound-state QED tests.
Methods
Data acquisition
Here we give further relevant details on the experimental scheme and apparatus (both described in detail in ref. 23). The run time of the cryogenic atomic beam is limited to freezing cycles of approximately 2 h by the accumulation of frozen (molecular) hydrogen inside the nozzle, which is removed by heating the nozzle to room temperature. The vacuum (2 × 10−7 mbar, dominated by molecular hydrogen) inside the 2S–6P spectroscopy region is maintained by differential pumping with a cryopump to minimize pressure shifts50,51,52, with the temperature of the apparatus allowed to equilibrate on each measurement day before collecting data.
The power of the linearly polarized, 243-nm 1S–2S preparation laser53,54 is resonantly enhanced in an in-vacuum, standing-wave cavity to P1S–2S ≈ 1 W per direction (297 μm 1/e2 intensity waist radius). The observed 1S–2S transition linewidth is approximately 3 kHz (FWHM; atomic detuning, as opposed to laser detuning), limited by single-photon ionization of the 2S level55. Therefore, the \({2{\rm{S}}}_{1/2}^{F=1}\) levels are only populated by Doppler-sensitive two-photon excitation, leading to a population of approximately 7 × 10−7 in each sublevel relative to the population in the \({2{\rm{S}}}_{1/2}^{F=0}\) level. The detuning of the preparation laser is set several times per freezing cycle by observing the 1S–2S transition. An equal-slit-width optical chopper running at 160 Hz periodically blocks the preparation laser (which sets delay time τ = 0 μs) to enable the velocity-resolved detection. The channel electron multipliers are switched off with a fast high-voltage switch while the preparation laser is unblocked to prevent saturation from scattered 243-nm light.
By using a linearly polarized 2S–6P spectroscopy laser53,54 and the \({2{\rm{S}}}_{1/2}^{F=0}\) level as the initial level, only transitions to the \({6{\rm{P}}}_{1/2}^{F=1}\) hyperfine level (2S–6P1/2 transition) and the \({6{\rm{P}}}_{3/2}^{F=1}\) hyperfine level (2S–6P3/2 transition) are dipole-allowed, whereas the excitation of the \({6{\rm{P}}}_{1/2}^{F=0}\) and \({6{\rm{P}}}_{3/2}^{F=2}\) levels is prevented by angular momentum conservation (Fig. 2a). The line strengths (∝μ2, in which μ is the dipole moment) of the 2S–6P1/2 and 2S–6P3/2 transitions have a 1:2 ratio. We use spectroscopy laser powers P2S–6P with a ratio of 2:1 for the two transitions to keep their Rabi frequencies \({\varOmega }_{0}\propto \mu \sqrt{{P}_{2{\rm{S}}-6{\rm{P}}}}\) identical (the peak Rabi frequency is (2π × 126) krad s−1 at our highest spectroscopy laser powers of 30 and 15 μW, respectively). Most 6P decays (branching ratio Γe–1S/Γ = 88.2%) are Lyman decays to the 1S manifold, with the most energetic, direct Lyman-ε decay (Γdet/Γ = 80.5%) dominating, whereas the remaining γe–2S/Γ = 11.8% are Balmer decays to the 2S manifold, of which in turn a fraction γei/Γ leads back to the initial \({2{\rm{S}}}_{1/2}^{F=0}\) level (see Section 1.2 in the Supplementary Methods). The metastable 2S levels are treated as stable here, as their natural lifetime (122 ms) is much longer than the time the atoms spend in the atomic beam (see Section 2.6 in the Supplementary Methods for the 2S decay contribution to the signal background).
The channel electron multipliers count the fluorescence photons from the 6P decays, either by detecting the photoelectrons emitted by the photons from the detector cylinder walls or, to a lesser extent, directly detecting the photons. Because the photoelectron yield strongly increases with photon energy (for both colloidal graphite and oxidized aluminium, the surface materials of the Faraday cage and the detector cylinder, respectively), fluorescence from Lyman-ε decay (13.2 eV photon energy) constitutes approximately 97% of the signal detected by the channel electron multipliers. The counts are binned into the 16 velocity groups by their delay time τ, with the bins chosen to cover a wide range of mean speeds \(\overline{v}\) while exhibiting a sufficient signal-to-noise ratio (bin width 50–550 μs; Extended Data Table 2). For each line scan, we accumulate counts over 160 chopper cycles at each spectroscopy laser detuning. Intermittently, excess scatter and spiking was observed for the bottom detector and its signal was subsequently discarded (≈11% of line scans; Extended Data Table 3). We attribute this to the bottom detector being cooled down to close to, and possibly below, its lower operating temperature limit because of its vicinity to the cryopump.
At least once per measurement day, the nozzle and the collimating aperture are centred on the preparation laser. At the start of each freezing cycle, the atomic beam offset angle α0, which is controlled by a linear motor equipped with a position sensor, is aligned to zero with a 1 mrad alignment uncertainty. This is achieved by blocking the returning beam of the spectroscopy laser (using an in-vacuum shutter in front of the high-reflectivity mirror of the AFR25,26), determining the (now unsuppressed) Doppler slope κ for several angles and moving to the angle at which κ is zero, which is set as α0 = 0 mrad. To record line scans at a non-zero angle ±α0, we first move to either +α0 or −α0 (chosen randomly) and then to the opposite sign, recording typically 5–10 scans at each angle, and repeating this procedure several times per freezing cycle. The fibre–collimator distance of the AFR is optimized25,26 at least once per freezing cycle.
We use fixed sets of 30 symmetric (15 unique) detunings Δ of the spectroscopy laser frequency to sample the 2S–6P fluorescence line shape, with different sets used for α0 = 0, ±7.5 and ±12 mrad to account for the different line shapes (see Section 5.1.3 and Table 5.2 of ref. 23), all of which have ±50 MHz as the largest detuning. The detunings were chosen to minimize the statistical uncertainty of the Doppler-free transition frequency νe, whereas the number of detunings and the acquisition time (1 s) at each detuning were chosen to balance between sufficient line sampling, signal-to-noise ratio and number of line scans per freezing cycle. For each line scan, the order of the detunings is randomized to minimize the influence of drifts in the signal. At the beginning of each freezing cycle, the centre laser frequency (to which the detunings are added) was chosen randomly from a normal distribution. This distribution was centred on the 2S–6P transition frequency expected from the muonic measurement of the proton radius9, with a standard deviation of 12 kHz to cover the transition frequency expected from the CODATA 2014 value of the proton radius16. The laser frequencies are referenced to the caesium frequency standard using an optical frequency comb56,57 and a global navigation satellite system (GNSS)-referenced hydrogen maser (see Section 2.7 in the Supplementary Methods).
We switched between examining the 2S–6P1/2 and 2S–6P3/2 transitions several times during the measurement, except during measurement run C, for which only the 2S–6P1/2 transition was examined for several values of α0 (Extended Data Table 3).
Voigt and Voigt doublet line shapes
A Voigt line shape23,58 is the convolution of a Lorentzian line shape (with FWHM linewidth ΓL) and a Gaussian line shape (with FWHM linewidth ΓG). It has a combined FWHM linewidth \({\varGamma }_{{\rm{F}}}\approx 0.5346{\varGamma }_{{\rm{L}}}\,+\) \(\sqrt{0.2166{{\varGamma }_{{\rm{L}}}}^{2}+{{\varGamma }_{{\rm{G}}}}^{2}}\) (ref. 59) and amplitude A and is centred on the resonance frequency ν0. A constant offset y0 is added to account for the experimentally observed signal background, resulting in five free parameters.
The Voigt doublet line shape is here defined as the sum of two Voigt line shapes with generally different resonance frequencies ν1 and ν2 and amplitudes A1 and A2 but equal Lorentzian and Gaussian linewidths23 (and a constant offset y0), resulting in seven free parameters. Its resonance frequency is the centre of mass of the constituent line shapes, that is, ν0 = (A1ν1 + A2ν2)/(A1 + A2).
Data analysis
The resonance frequency ν0 of each velocity group and detector of each line scan is determined by least-square fitting Voigt line shapes to the signal (for data with α0 = 0 mrad, that is, data groups G1A–G12). For data with α0 ≠ 0 mrad (data groups G13 and G14), in which the line can split into two Doppler components, the Voigt doublet line shape is used instead. For the fits, we assume the uncertainty on the signal yj, that is, the number of fluorescence photons detected at detuning j, to be the photon-number shot (Poissonian) noise, given by \(\sqrt{{y}_{j}}\) (because yj ≫ 1, the Poisson distribution can be approximated with a normal distribution).
We use reduced chi-squared \({\chi }_{{\rm{red}}}^{2}={\chi }^{2}/k\) (with k degrees of freedom) as a measure of the goodness of fit. The average \({\chi }_{{\rm{red}}}^{2}\) is expected to be 1 if the fitted line shapes exactly match the experimental line shapes and if the signal fluctuations are fully described by shot noise. We first discuss the Voigt fits (k = 25), for which we find \({\chi }_{{\rm{red}}}^{2}\) to be substantially larger than 1 (up to \({\chi }_{{\rm{red}}}^{2}\approx 4.6\) at our highest spectroscopy laser power) for all but the slowest velocity groups. Only for those velocity groups, which have a comparatively lower signal and Doppler broadening, does \({\chi }_{{\rm{red}}}^{2}\) approach 1 (\({\chi }_{{\rm{red}}}^{2}\approx 1.1\) for τ16 for all data groups).
We identify two distinct contributions to the increased \({\chi }_{{\rm{red}}}^{2}\). First, there are small model deviations between the fitted and experimental line shapes, caused by non-Gaussian (and non-Lorentzian) broadening and saturation effects not included in the former and clearly observable in the fit residuals (up to approximately 2% deviation; see black circles at the top of Extended Data Fig. 1b). Our simulated line shapes (both LFS and QI simulations) include these effects and consequently show much better agreement with the experimental line shapes (see dashed and solid lines in Extended Data Fig. 1b). We use this to determine the effect of the deviations on \({\chi }_{{\rm{red}}}^{2}\) by repeating the line shape fits using simulated data instead of experimental data. In this Monte Carlo simulation, the appropriate line shape simulation is scaled and offset to match the experimental data of each line scan and then shot noise is added to the simulated signal. The simulated \({\chi }_{{\rm{red}}}^{2}\) values reproduce, and therefore the deviations explain, approximately 70% of the excess (that is, \({\chi }_{{\rm{red}}}^{2}-1\)) in the experimental \({\chi }_{{\rm{red}}}^{2}\) values.
Second, there are fluctuations in the signal that lead to excess (technical) noise above shot noise, which we attribute to fluctuations in the atomic flux (of metastable 2S atoms). In particular, we identify correlations between \({\chi }_{{\rm{red}}}^{2}\) of a line scan and fluctuations of the nozzle temperature (Pearson correlation coefficient r = 0.45) and the preparation laser power (r = 0.26) during the line scan, both of which directly affect the number of 2S atoms reaching the spectroscopy region. The correlation with the spectroscopy laser power is much smaller (r = 0.03). We find that the simple assumption of a velocity-group-independent, 1% rms fluctuation of the signal from detuning to detuning explains, on average, the remaining excess of the experimental \({\chi }_{{\rm{red}}}^{2}\) values.
The \({\chi }_{{\rm{red}}}^{2}\) behaviour is very similar for the Voigt doublet fits (k = 23) to the α0 = ±7.5 mrad data (data group G13). For the α0 = ±12 mrad data (data group G14), the two Doppler components start to separate, particularly for fast velocity groups (Fig. 3c). This leads to large model deviations (up to 14%; see bottom of Extended Data Fig. 1b) from saturation effects, as some atoms interact with both spectroscopy laser beams (|Δ| ≲ Γ2P), whereas others only interact with one beam (|Δ| ≳ Γ2P). Consequently, \({\chi }_{{\rm{red}}}^{2}\) can exceed 20, which is largely explained (90% of excess) by the deviations, as again determined from the Monte Carlo simulation.
Asymmetric (about the line centre) deviations between the fitted and experimental line shapes, arising as a result of LFS and QI, do not substantially influence \({\chi }_{{\rm{red}}}^{2}\) because of their small size. However, both symmetric and asymmetric deviations lead to a sampling bias, as discussed in Section 2.5 in the Supplementary Methods.
The uncertainty of the experimental resonance frequency ν0 is estimated from the line shape fits assuming only shot noise, that is, the technical noise is not taken into account at this point. The value of ν0 for each velocity group and detector of each line scan is corrected for the LFS and QI shift by subtracting, respectively, νLFS and νQI, which are determined from the corresponding simulated line shapes (see Section 1.1 in the Supplementary Methods). The mean speed \(\overline{v}\) and rms speed \({\overline{v}}_{{\rm{rms}}}\) of each velocity group are also determined from these simulated line shapes (see Section 1.1 in the Supplementary Methods; Extended Data Table 2 gives the average value of \(\overline{v}\) for each velocity group). The second-order Doppler shift ΔνSOD is calculated using \({\overline{v}}_{{\rm{rms}}}\) (see Section 2.1 in the Supplementary Methods) and likewise subtracted from ν0.
Next, using the simulated values of the speed \(\bar{v}\), the Doppler shift extrapolation is performed to find the Doppler-free transition frequency νe and the Doppler slope κ. The values νe and κ found for each line scan are inherently strongly correlated (Pearson correlation coefficient re ≈ −0.97) and their uncertainties are found by propagating the uncertainties of ν0. Although the averaging process outlined below reduces this correlation, the correlation remains substantial (r = −0.78) between the averaged values of νe and κ of different data groups. The LFS, QI shift and second-order Doppler shift all depend on the speed of the 2S atoms, either indirectly through the interaction time or directly. This results in non-zero Doppler slopes if not corrected for. In particular, the LFS, on average, would lead to κ ≈ −5 Hz (m s−1)−1 and it is only by accounting for it that the experimentally determined values of κ are, on average, compatible with zero.
The \({\chi }_{{\rm{red}}}^{2}\) of the Doppler shift extrapolation (k = 14) is close to 1 (\({\chi }_{{\rm{red}}}^{2}=1.04(2)\) on average; standard deviation over data groups in parentheses), showing that the data are well described by a linear model and technical noise is small compared with shot noise. This contrasts with the excess noise observed in the line shape fits, as discussed above. We attribute this to the different timescales involved: the different velocity groups are recorded for about 100 μs and within 2,560 μs of each other, whereas the signal at each detuning is accumulated for 1 s before moving to the next detuning. That is, the Doppler shift extrapolation is mainly susceptible to technical noise on timescales of 100 μs, whereas the signal at different detunings is susceptible to technical noise on timescales of about 1 s, which is, for example, the timescale expected for nozzle temperature fluctuations.
Next we find the weighted mean of νe from the two detectors for each line scan (except when no data from the bottom detector are available, in which case data from the top detector are used), taking into account experimentally determined correlations (r = 0.36(15) on average). We attribute these correlations, which tend to increase with spectroscopy laser power and therefore the signal, again to the detuning-to-detuning fluctuations in the atomic flux that are common mode to both detectors (and the velocity groups). We then form the weighted mean of the detector-averaged νe for each freezing cycle in a given data group. The reduced chi-squared \({\chi }_{{\rm{red}},{\rm{FC}}}^{2}\) of this average is typically greater than 1 (\({\bar{\chi }}_{{\rm{red}},{\rm{FC}}}^{2}=1.44(23)\) on average), which we attribute again to fluctuations in the atomic flux. We account for this excess scatter by scaling the uncertainty of νe by the corresponding \(\sqrt{{\chi }_{{\rm{red}},{\rm{FC}}}^{2}}\) if \({\chi }_{{\rm{red}},{\rm{FC}}}^{2} > 1\). This procedure shifts the determined transition frequencies (by changing the weighting of the data) by less than 20 Hz, much smaller than the associated uncertainties. A weighted mean of the detector-averaged, uncertainty-scaled νe is formed for each data group and the remaining corrections are applied (Table 1 and Extended Data Table 4). Finally, the relevant data groups are (weighted) averaged to find the transition frequencies of the 2S–6P1/2 and 2S–6P3/2 transitions. Throughout the analysis, the weights of the averages are based only on the (scaled) statistical frequency uncertainty (including for the Doppler slope κ), with the (correlated) uncertainty of the corrections not included in the weights. The statistical weights w2S–6P of the data groups for the determination of the 2S–6P fine-structure centroid ν2S–6P are given in Extended Data Table 3.
The data analysis was blinded by adding a randomly chosen offset frequency to the transition frequencies. The offset frequency was only removed after all of the main systematic effects had been studied. Further small corrections, identified after the offset frequency was removed, resulted in a negligible shift of the determined transition frequencies of at most 10 Hz. The results of the data analysis (performed by L.M.) were confirmed by a second, independently implemented analysis (performed by V.W.).
Modelling of atomic beam and fluorescence line shape
The fluorescence line shape of the 2S–6P transition is modelled by a Monte Carlo simulation of the atomic beam as a set of atomic trajectories, the trajectories’ interaction with the 1S–2S preparation and 2S–6P spectroscopy lasers and their contribution to the fluorescence signal. The procedure is described in detail in Section 1.1 in the Supplementary Methods. Two complementary models describe the interaction with the spectroscopy laser, the QI model (see below and Section 1.2 in the Supplementary Methods) and the LFS model (see main text and Section 1.3 in the Supplementary Methods).
Speed distribution of atomic beam
We use the signal of the velocity groups as a time-of-flight measurement of the speed distribution of the atomic beam. To this end, we compare the experimental values of the line amplitudes A of the velocity groups to the values of A of line shapes simulated using a given speed distribution (see Section 1.1 in the Supplementary Methods). We find that the probability distribution of the speed v of atoms leaving the nozzle towards the 2S–6P spectroscopy region is well described by a modified Maxwell–Boltzmann flux distribution for a wide range of experimental parameters (see ref. 23 for details). The flux distribution is given by
in which \({\mathcal{N}}\) is a normalization constant. The extra factor exp(−vcut-off/v) accounts for the depletion of slower atoms through collisions inside the nozzle, inside the beam and with the background gas60,61,62. A similar depletion has been observed in our 2S–4P measurement and other atomic hydrogen beams3,63.
Using the above comparison of experimental data with simulations, vcut-off is found to be 50 m s−1 on average. Extended Data Table 3 lists the average value and variation for each data group. A substantial part of the variation is because of the fact that vcut-off typically increases during a freezing cycle (see Fig. 6.1 of ref. 23). This is because the accumulation of frozen hydrogen decreases the diameter of the nozzle, causing an increase in the gas pressure and therefore collisions inside the nozzle. We take this variation into account when determining (the uncertainty of) the mean speeds of the velocity groups and the simulation corrections (see Section 1.1 in the Supplementary Methods and Extended Data Table 5).
QI shift
We simulate QI-distorted line shapes with a model combining optical Bloch equations with simulations of the spatial detection efficiency, averaged over a set of trajectories representing the atomic beam (see Sections 1.1 and 1.2 in the Supplementary Methods). The QI shift is here defined as the centre frequency of a line shape fit to the simulated line shapes (see Section 1.1 in the Supplementary Methods). The validity of this approach was demonstrated by the excellent agreement between the observed and simulated QI shifts in our previous measurement of the 2S–4P transition2, in which the shifts were more than sevenfold larger because of the smaller detection solid angle and larger linewidth22.
The simulated QI shifts, as a function of polarization angle θL, are found to be at most 7.1 kHz and −4.1 kHz for the 2S–6P1/2 and 2S–6P3/2 transitions at our highest spectroscopy laser power (used to give upper limits here and below), respectively. At the magic angle θL = 56.5°, the shifts reduce to at most −0.87(54) kHz and 0.45(27) kHz (including ±3° alignment uncertainty; again at the highest spectroscopy laser power). When we account for data taken at θL = 146.5° (see below) and at lower laser powers (with statistical weights as given in Extended Data Table 3), we obtain the overall simulated shifts (−0.25(36) kHz and 0.05(15) kHz) given in the main text. The cancellation inherent in the 2S–6P fine-structure centroid reduces the shift to at most −0.37 kHz at any polarization angle and to below 0.01 kHz at around θL = 56.5° (below the otherwise negligible ac-Stark shift; see Section 1.2 in the Supplementary Methods). Including all data results in the shift of −0.05(2) kHz given in the main text.
The magic angle θL = 56.5° used in the measurement was determined with simulations before the measurement began, whereas more refined simulations of the spatial detection efficiency (completed after the measurement and used for all simulation results given here) result in a magic angle of approximately 52°. Moreover, the magic angle also slightly shifts with laser power (by up to 2° for the powers used here; see Section 1.2 in the Supplementary Methods). However, the QI shifts are still strongly suppressed at the magic angle used, despite it being slightly different from the optimal value.
To test our QI model and simulations, a limited amount of data were taken at θL = 146.5° (Fig. 4 and Extended Data Table 3), that is, orthogonal to the magic polarization angle, which we compare with the data taken at θL = 56.5°. For the 2S–6P1/2 transition, the difference in Doppler-free transition frequency of νe(θL = 146.5°) − νe(θL = 56.5°) = −0.01(1.69) kHz is in excellent agreement with 0 after correcting for a differential QI shift of 3.44(92) kHz and a differential Doppler shift of −1.42(1.42) kHz. We may also compare the experimental and simulated line shape distortions at θL = 146.5°, in which the QI shift dominates over the LFS, by comparing the asymmetry of the experimental and simulated fit residuals (Extended Data Fig. 1). We find excellent agreement, especially at detunings larger than the linewidth, for which the line shape distortions from QI are largest.
For the 2S–6P3/2 transition, we find a moderate (2.3 standard deviations) tension with a difference of νe(θL = 146.5°) − νe(θL = 56.5°) = 4.08(1.77) kHz in the transition frequency, having corrected for a differential QI shift of −1.78(47) kHz. The removed differential Doppler shift is likewise significantly non-zero (−3.66(1.68) kHz). This correlation is consistent with, but not conclusive evidence for, the non-zero difference being caused by random errors affecting the Doppler shift extrapolation. This conclusion is also supported by the fact that there is no tension in the velocity-group-averaged resonance frequency, that is, assuming zero Doppler shift (see Fig. 6.9 of ref. 23). Furthermore, we find the line shape distortions at θL = 146.5° to be compatible with our QI simulations but incompatible with a QI shift of opposite sign and twice the magnitude as implied by the measured difference (Extended Data Fig. 1). We therefore conclude that the tension probably results from random errors in the determination of the resonance frequencies.
2S–6P fine-structure centroid, 6P fine-structure splitting and HFS corrections
The 2S–6P fine-structure centroid ν2S–6P is the transition frequency from the 2S HFS centroid to the 6P fine-structure centroid. It is determined from the two measured transition frequencies ν1/2 and ν3/2 for the transitions from the \({2{\rm{S}}}_{1/2}^{F=0}\) level to the \({6{\rm{P}}}_{1/2}^{F=1}\) level (2S–6P1/2 transition) and the \({6{\rm{P}}}_{3/2}^{F=1}\) level (2S–6P3/2 transition), respectively, by first correcting ν1/2 and ν3/2 for the 2S and 6P HFS and then averaging the corrected ν1/2 and ν3/2 weighted by their fine-structure multiplicity ratio of 1:2 (equivalent to the ratio of the line strengths ∝μ2 of the 2S–6P1/2 and 2S–6P3/2 transitions, in which μ is the dipole moment) to find the 6P fine-structure centroid. This results in equation (7), with the HFS corrections included in ΔνHFS(ν2S–6P), as detailed below.
The hyperfine interaction splits the fine-structure levels 2S1/2, 6P1/2 and 6P3/2 into doublets64,65. The relevant HFS levels \({2{\rm{S}}}_{1/2}^{F=0}\), \({6{\rm{P}}}_{1/2}^{F=1}\) and \({6{\rm{P}}}_{3/2}^{F=1}\) are shifted from the fine-structure levels by the HFS energies \(\Delta {\nu }_{{\rm{HFS}}}({2{\rm{S}}}_{1/2}^{F=0})\), \(\Delta {\nu }_{{\rm{HFS}}}({6{\rm{P}}}_{1/2}^{F=1})\) and \(\Delta {\nu }_{{\rm{HFS}}}({6{\rm{P}}}_{3/2}^{F=1})\), respectively (see Fig. 6.11 of ref. 23 for the relevant level scheme). The value of \(\Delta {\nu }_{{\rm{HFS}}}({2{\rm{S}}}_{1/2}^{F=0})\) and its uncertainty are obtained from a measurement66 of the 2S HFS splitting ΔνHFS(2S1/2) as
\(\Delta {\nu }_{{\rm{HFS}}}({6{\rm{P}}}_{1/2}^{F=1})\) and \(\Delta {\nu }_{{\rm{HFS}}}({6{\rm{P}}}_{3/2}^{F=1})\) can be calculated as detailed in refs. 64,65. They include small corrections from off-diagonal elements in the HFS Hamiltonian, which mix HFS levels with the same value of F but different values of J. Because only the F = 1 level of each HFS doublet is shifted by this effect, the centres of gravity of the HFS doublets are shifted by \(\Delta {\nu }_{{\rm{HFS}}}^{{\rm{o}}.{\rm{d}}.}({6{\rm{P}}}_{1/2})\) and \(\Delta {\nu }_{{\rm{HFS}}}^{{\rm{o}}.{\rm{d}}.}({6{\rm{P}}}_{3/2})\). Using the values for the 6P HFS splittings ΔνHFS(6P1/2), ΔνHFS(6P3/2) and the values for \(\Delta {\nu }_{{\rm{HFS}}}^{{\rm{o}}.{\rm{d}}.}({6{\rm{P}}}_{1/2})\), \(\Delta {\nu }_{{\rm{HFS}}}^{{\rm{o}}.{\rm{d}}.}({6{\rm{P}}}_{3/2})\) given in Table 1 of ref. 64 (see also equations (29) and (30) in ref. 65 and comments therein), we find
\(\Delta {\nu }_{{\rm{HFS}}}({6{\rm{P}}}_{1/2}^{F=1})\) and \(\Delta {\nu }_{{\rm{HFS}}}({6{\rm{P}}}_{3/2}^{F=1})\) are assumed to be fully correlated.
The resulting HFS correction of the 2S–6P fine-structure centroid ν2S–6P is
in which we rounded to the nearest 10 Hz, as done for all corrections.
SM predictions of transition frequencies and bound-state QED test
We find the SM prediction ν2S–6P,SM of the 2S–6P fine-structure centroid using
in which EnlJ are the fine-structure level energies from equation (1) and ν1S–2S is the measured frequency of the 1S–2S hyperfine centroid18. This parametrization removes the explicit dependence of EnlJ on R∞. Using the muonic value of rp (ref. 15), we find the value of ν2S–6P,SM given in equation (10).
Extended Data Table 1 lists the contributions to ν2S–6P,SM. Along with the Dirac eigenvalue (\(c{R}_{\infty }{f}_{{nJ}}^{\mathrm{Dirac}}\) of equation (1); equation (30) in ref. 1), we list the individual QED corrections (\(c{R}_{{\infty }}{f}_{{nlJ}}^{\mathrm{QED}}\) and \(c{R}_{\infty }{\delta }_{l0}({C}_{\mathrm{NS}}/{n}^{3})\,{r}_{{\rm{p}}}^{2}\) of equation (1); equations (32)–(64) in ref. 1), with the sum of the corrections corresponding to the Lamb shift \({\mathcal{L}}\) (\({\mathcal{L}}(nlJ)\) for a single fine-structure level; \({\mathcal{L}}(2{\rm{S}}-6{\rm{P}})=(1/3){\mathcal{L}}(6,1,1/2)+(2/3){\mathcal{L}}(6,1,3/2)-{\mathcal{L}}(2,0,1/2)\) for the 2S–6P fine-structure centroid). Extended Data Table 1 also gives the QED-only uncertainty, which we define as the uncertainty excluding contributions from rp, α, mp/me and ν1S–2S. All listed QED corrections scale as cR∞C/n3 in leading order for S-states, in which cR∞C is the corresponding leading-order QED correction to the 1S level. This includes the (leading order, ∝α2 × α2, with the first α2 factor absorbed in R∞ = α2mec/2h in equation (1)) nuclear size correction highlighted in equation (1), for which \(C={C}_{{\rm{NS}}}{r}_{{\rm{p}}}^{2}\). Therefore, the corrections between different S-states are correlated, which is taken into account in the uncertainty of ν2S–6P,SM. Non-S-states have generally much smaller corrections (for example, the fractional corrections are 1.3 × 10−6 of the 2S binding energy but 4.4 × 10−9 of the 6P binding energy) and, in particular, their nuclear size corrections are at present negligible (at most 7 × 10−16 of the 6P binding energy). There is no correlation between the different QED corrections and we find the total QED-only uncertainty of 179 Hz by adding the uncertainties of the corrections in quadrature. Overall, E(2S–6P) and E(1S–2S) are highly correlated (r = 0.995 if only considering QED-only uncertainty).
The other dominant source of uncertainty for ν2S–6P,SM is the muonic value of rp, which contributes 138 Hz to the uncertainty of ν2S–6P,SM. Although the uncertainty of rp itself partly originates from QED corrections15, the QED predictions for the energy levels of hydrogen and muonic hydrogen are uncorrelated at the present level of accuracy1 and we therefore treat the QED-only uncertainty and the uncertainty from rp as uncorrelated. The uncertainty contributions from α (1.7 Hz), mp/me (0.1 mHz) and ν1S–2S (3 Hz) are negligible. In total, this results in an uncertainty of 226 Hz on ν2S–6P,SM.
It is instructive to find the sensitivity of a prediction νSM, derived in the same way as ν2S–6P,SM, to changes in C. Making use of the 1/n3 scaling of the QED corrections (and using the approximation EnlJ ≈ chR∞(−1/n2 + δl0C/n3), that is, ignoring all non-leading-order corrections to \({f}_{{nJ}}^{{\rm{Dirac}}}\) and all QED corrections except C), we find
in which n′, l′ → n, l is the transition to be predicted (for example, 2S–6P transition for ν2S–6P,SM) and \({\tilde{n}}^{{\prime} },\,{\tilde{l}}^{{\prime} }\to \tilde{n},\,\tilde{l}\) is the measured transition used as input (for example, 1S–2S transition for ν2S–6P,SM). The sensitivity of ν2S–6P,SM is ∂(ν2S–6P,SM/cR∞)/∂C = 0.134. By contrast, using the 1S–3S transition instead of the 2S–6P transition leads to a 1.8-fold lower sensitivity (∂(νSM/cR∞)/∂C = 0.074) because the relative contribution of C to the 1S–3S transition frequency is approximately twice as large as for the 2S–6P transition frequency. Combined with its 1.5-fold smaller uncertainty, ν2S–6P therefore tests C with 2.7-fold higher precision than the 1S–3S measurement5. Because the nuclear size correction \(c{R}_{\infty }{C}_{\mathrm{NS}}\,{r}_{{\rm{p}}}^{2}/{n}^{3}\) scales as 1/n3 like the other bound-state QED corrections cR∞C/n3, comparing rp found from atomic hydrogen and from muonic hydrogen is a test of bound-state QED, as any missing or miscalculated terms in atomic hydrogen of the form cR∞C/n3 would lead to a discrepancy. Furthermore, the precision with which rp can be extracted from a given combination of measurements is therefore a direct measure of the precision of the implied QED test. Note that, because Extended Data Table 1 lists the corrections of ν2S–6P,SM, which are approximately −cR∞C/23, the sensitivity relative to the listed corrections is −8 × 0.134 = −1.072. For example, artificially removing the hadronic vacuum polarization shifts ν2S–6P,SM by −1.072 × 425.1 Hz = −0.46 kHz, or approximately one experimental σ of ν2S–6P.
We may also extract the 1S Lamb shift \({\mathcal{L}}(1{\rm{S}})\) by writing equation (1) as
in which \({\mathcal{L}}(1{\rm{S}})\equiv {\mathcal{L}}(1,0,1/2)=c{R}_{\infty }({f}_{1,0,1/2}^{\mathrm{QED}}+{C}_{\mathrm{NS}}{r}_{{\rm{p}}}^{2})\) and \(\delta {\mathcal{L}}(nlJ)={\mathcal{L}}(nlJ)\,-\) \({\delta }_{0l}{\mathcal{L}}(1{\rm{S}})/{n}^{3}\) is the state-specific Lamb shift. By combining equation (19) for the 2S–6P and 1S–2S transitions and using ν2S–6P and ν1S–2S as inputs, we find \({{\mathcal{L}}}_{\exp }(1{\rm{S}})={\mathrm{8,172,744.13}(14)}_{{\rm{QED}}}{(3.56)}_{\exp }\,{\rm{kHz}}\,=\) \(\mathrm{8,172,744.1}(3.6)\,{\rm{kHz}}\). Terms proportional to state-specific Lamb shifts contribute 224 MHz to \({{\mathcal{L}}}_{\exp }(1{\rm{S}})\). The uncertainty is dominated by the uncertainty of ν2S–6P (‘exp’), with the QED uncertainty of the state-specific Lamb shifts (‘QED’) being much smaller and contributions from ν1S–2S (22 Hz) and physical constants (10 Hz from α; mp/me, rp, R∞ less than 0.1 Hz) negligible. As expected from the discussion above, \({{\mathcal{L}}}_{\exp }(1{\rm{S}})\) is 2.7-fold more precise than the next best determination using the 1S–3S measurement5.
The corresponding QED prediction of the 1S Lamb shift is found by combining equation (1) with the muonic value of rp, giving \({{\mathcal{L}}}_{{\rm{QED}}}(1{\rm{S}})\,=\)\({\mathrm{8,172,744.1}(1.3)}_{{\rm{QED}}}{(1.0)}_{{{\rm{r}}}_{{\rm{p}}}}\,{\rm{kHz}}=\mathrm{8,172,744.1}(1.7)\,{\rm{kHz}}\). Its uncertainty is dominated by both QED uncertainty and the uncertainty of rp, whereas other sources are negligible (3 Hz from α; mp/me, R∞ less than 0.01 Hz). \({{\mathcal{L}}}_{\exp }(1{\rm{S}})\) and \({{\mathcal{L}}}_{{\rm{QED}}}(1{\rm{S}})\) are uncorrelated, as their respective QED uncertainties are uncorrelated1. They are in excellent agreement and test the 1S Lamb shift and thereby bound-state QED corrections to 0.5 ppm. Complementary tests of bound-state QED in strong electromagnetic fields with highly charged ions at present achieve a relative precision of 1 × 10−4 but can be more sensitive to terms of high order in (Zα) (Z is the nuclear charge number; omitted elsewhere here because Z = 1 for atomic hydrogen)47,67,68.
Finally, the QED prediction ΔνFS,QED(6P) for the 6P fine-structure splitting between the \({6{\rm{P}}}_{1/2}^{F=1}\) and \({6{\rm{P}}}_{3/2}^{F=1}\) levels is (using equations (14) and (15))
The nuclear size corrections in ΔνFS,QED(6P) are of order α2 × α4 and α2 × α5 and amount to 70 mHz, as the leading-order correction term \(c{R}_{{\infty }}{C}_{\mathrm{NS}}{r}_{{\rm{p}}}^{2}\propto {\alpha }^{2}\times {\alpha }^{2}\) of equation (1) and corrections of order α2 × α3 only apply to S-states1.
dc-Stark shift
Static stray electric fields in the 2S–6P spectroscopy region can lead to a dc-Stark shift of the observed transition frequency. Here the dc-Stark shift Δνdc is well described as quadratic in strength E = |E| of the electric field E in all relevant experimental regimes, that is,
in which βdc is the applicable quadratic dc-Stark shift coefficient. We distinguish two experimentally relevant field strength regimes: the stray-field regime (E < 1 V m−1), which covers the range of stray electric fields present in the experiment, and the bias-field regime (E = 10–45 V m−1), which covers the range of applied bias fields used to determine the stray electric fields. Notably, although equation (21) is found to approximately hold in either regime, the coefficient βdc may differ, as is the case for the 2S–6P3/2 transition (Section 1.4 in the Supplementary Methods). The quadratic behaviour arises because the energy levels contributing to the net shift are well separated in energy from the perturbed level in either of our regimes. However, the shift of the involved levels between the regimes can lead to substantially different energy separations and thereby different values of βdc.
The Faraday cage surrounding the spectroscopy region (Fig. 2b) shields it from external electric fields, including those used to draw the photoelectrons to the channel electron multipliers (whose input surfaces are held at 270 V; see Section 4.6.2 of ref. 23). The colloidal graphite coating on all surfaces of the Faraday cage suppresses stray electric fields from the surfaces themselves (from charged oxide layers, contact potentials from dissimilar conductors or local changes in the work function) by, ideally, forming a uniform conductive layer with a uniform work function (see Section 4.6.1 of ref. 23). However, effects such as imperfect shielding of external fields, imperfect graphite coating or temperature gradients leading to thermoelectric voltages or gradients in the work function can prevent the complete suppression of stray fields.
To address this, we measure the stray electric field by applying voltages to the six electrodes forming the Faraday cage (meshes at the top and bottom and four equal-sized segments of the cylinder wall) and using the atoms themselves as field sensors (see Section 4.6.7 of ref. 23), similar to approaches in refs. 5,69. Equal and opposite bias voltages are applied to opposing electrodes to create a bias electric field \({\bf{E}}={E}_{i}\hat{{\bf{i}}}\) with strength |Ei| = 10–45 V m−1 along the given direction (i = x, y, z, as defined in Fig. 2b). By measuring the shifted 2S–6P transition frequency νe(Ei) from a fit to the fluorescence line shape of several line scans with opposite-polarity values of Ei and using the quadratic dependence νe(Ei) = βdc,i(Ei − ΔEi)2 + νe(Ei = 0 V m−1), we determine the stray electric field component ΔEi along the given direction. This measurement also yields experimental values of βdc,i at the bias field strengths for each transition. Extended Data Fig. 3 shows examples of such stray field measurements for both the 2S–6P1/2 and 2S–6P3/2 transitions, along with simulation results (see Section 1.4 in the Supplementary Methods). On average, each measurement includes eight line scans with non-zero bias field and there are 98 and 21 measurements for the 2S–6P1/2 and 2S–6P3/2 transitions, respectively. The stray electric field components determined during the three measurement runs are shown in Extended Data Fig. 4, in which we only include measurements using the 2S–6P1/2 transition because using the 2S–6P3/2 transition gives compatible, but substantially larger uncertainty, values of ΔEi. Overall, the stray electric field has a strength less than 1 V m−1 and predominantly points along the axis of the detector cylinder (the y-direction).
Using these stray field measurements, we estimate the dc-Stark shift correction and its uncertainty for the transition frequencies. The determination of the relevant quadratic dc-Stark shift coefficients in the stray-field regime by the use of experimentally verified simulations is described in Section 1.4 in the Supplementary Methods. The weighted mean and standard deviation of each stray electric field component ΔEi are determined over each of the three measurement runs, using the 2S–6P1/2 stray field measurements, as shown in Extended Data Fig. 4. We treat each measurement run separately to account for differences in the detector assembly (the meshes in the detector cylinder were replaced between run A and run B; see Section 4.6.1 of ref. 23) and the month-long breaks in between the runs. Furthermore, we opt to use the standard deviation of the stray field components to estimate the dc-Stark shift uncertainty because we believe at least part of the variation of the stray fields to be physical in origin (as opposed to unaccounted excess measurement scatter) but are not confident that the resulting variations in the dc-Stark shift will average out. With this, we find dc-Stark shifts Δνdc of 0.20(21) kHz and −0.02(6) kHz for the 2S–6P1/2 and 2S–6P3/2 transitions, respectively, in which the uncertainty is dominated by the stray electric fields in the first case and by coefficients in the second case, and shifts for both transitions mainly determined by the dominant stray electric field component along the y-direction. ν1/2 and ν3/2 have been corrected for the dc-Stark shift by subtracting the corresponding value. The Pearson correlation coefficient of the correction between the 2S–6P1/2 and 2S–6P3/2 transitions is found to be r = −0.30 by propagation of uncertainty.
Data availability
The experimental data and simulation results that support the findings of this study are available from Zenodo70.
Code availability
All code used for the data analysis and simulations of this study is available from the corresponding author.
References
Mohr, P. J., Newell, D. B., Taylor, B. N. & Tiesinga, E. CODATA recommended values of the fundamental physical constants: 2022. Rev. Mod. Phys. 97, 025002 (2025).
Beyer, A. et al. The Rydberg constant and proton size from atomic hydrogen. Science 358, 79–85 (2017).
Fleurbaey, H. et al. New measurement of the 1S–3S transition frequency of hydrogen: contribution to the proton charge radius puzzle. Phys. Rev. Lett. 120, 183001 (2018).
Bezginov, N. et al. A measurement of the atomic hydrogen Lamb shift and the proton charge radius. Science 365, 1007–1012 (2019).
Grinin, A. et al. Two-photon frequency comb spectroscopy of atomic hydrogen. Science 370, 1061–1065 (2020).
Brandt, A. D. et al. Measurement of the 2S1/2–8D5/2 transition in hydrogen. Phys. Rev. Lett. 128, 023001 (2022).
Scheidegger, S. & Merkt, F. Precision-spectroscopic determination of the binding energy of a two-body quantum system: the hydrogen atom and the proton-size puzzle. Phys. Rev. Lett. 132, 113001 (2024).
Pohl, R. et al. The size of the proton. Nature 466, 213–216 (2010).
Antognini, A. et al. Proton structure from the measurement of 2S-2P transition frequencies of muonic hydrogen. Science 339, 417–420 (2013).
Parker, R., Yu, C., Weicheng, Z., Brian, E. & Müller, H. Measurement of the fine-structure constant as a test of the Standard Model. Science 360, 191–195 (2018).
Morel, L., Yao, Z., Cladé, P. & Guellati-Khélifa, S. Determination of the fine-structure constant with an accuracy of 81 parts per trillion. Nature 588, 61–65 (2020).
Fan, X., Myers, T. G., Sukra, B. A. D. & Gabrielse, G. Measurement of the electron magnetic moment. Phys. Rev. Lett. 130, 071801 (2023).
Heiße, F. et al. High-precision measurement of the proton’s atomic mass. Phys. Rev. Lett. 119, 033001 (2017).
Patra, S. et al. Proton-electron mass ratio from laser spectroscopy of HD+ at the part-per-trillion level. Science 369, 1238–1241 (2020).
Pachucki, K. et al. Comprehensive theory of the Lamb shift in light muonic atoms. Rev. Mod. Phys. 96, 015001 (2024).
Mohr, P. J., Newell, D. B. & Taylor, B. N. CODATA recommended values of the fundamental physical constants: 2014. Rev. Mod. Phys. 88, 035009 (2016).
Antognini, A., Hagelstein, F. & Pascalutsa, V. The proton structure in and out of muonic hydrogen. Annu. Rev. Nucl. Part. Sci. 72, 389–418 (2022).
Parthey, C. G. et al. Improved measurement of the hydrogen 1S–2S transition frequency. Phys. Rev. Lett. 107, 203001 (2011).
de Beauvoir, B. et al. Metrology of the hydrogen and deuterium atoms: determination of the Rydberg constant and Lamb shifts. Eur. Phys. J. D 12, 61–93 (2000).
Beyer, A. et al. Precision spectroscopy of 2S–nP transitions in atomic hydrogen for a new determination of the Rydberg constant and the proton charge radius. Phys. Scr. 2015, 014030 (2015).
Horbatsch, M. & Hessels, E. A. Shifts from a distant neighboring resonance. Phys. Rev. A 82, 052519 (2010).
Udem, T. et al. Quantum interference line shifts of broad dipole-allowed transitions. Ann. Phys. 531, 1900044 (2019).
Maisenbacher, L. Precision Spectroscopy of the 2S-nP Transitions in Atomic Hydrogen. PhD thesis, Ludwig-Maximilians-Universität München (2020). Revised version, https://arxiv.org/abs/2512.20543 (2025).
Beyer, A. et al. Active fiber-based retroreflector providing phase-retracing anti-parallel laser beams for precision spectroscopy. Opt. Express 24, 17470–17485 (2016).
Wirthl, V. et al. Improved active fiber-based retroreflector with intensity stabilization and a polarization monitor for the near UV. Opt. Express 29, 7024–7048 (2021).
Wirthl, V. et al. Improved active fiber-based retroreflector with intensity stabilization and a polarization monitor for the near UV: erratum. Opt. Express 30, 7340–7341 (2022).
Kapitza, P. L. & Dirac, P. A. M. The reflection of electrons from standing light waves. Math. Proc. Camb. Philos. Soc. 29, 297–300 (1933).
Salomon, C., Dalibard, J., Aspect, A., Metcalf, H. & Cohen-Tannoudji, C. Channeling atoms in a laser standing wave. Phys. Rev. Lett. 59, 1659–1662 (1987).
Minardi, F. et al. Frequency shift in saturation spectroscopy induced by mechanical effects of light. Phys. Rev. A 60, 4164–4167 (1999).
Zheng, X., Sun, Y. R., Chen, J.-J., Wen, J.-L. & Hu, S.-M. Light-force-induced shift in laser spectroscopy of atomic helium. Phys. Rev. A 99, 032506 (2019).
Wen, J.-L. et al. Doppler-free spectroscopy of an atomic beam probed in traveling-wave fields. Phys. Rev. A 107, 042811 (2023).
Bloch, I., Hänsch, T. W. & Essilinger, T. Measurement of the spatial coherence of a trapped Bose gas at the phase transition. Nature 403, 166–170 (2000).
Born, M. et al. Principles of Optics: Electromagnetic Theory of Propagation, Interference and Diffraction of Light 7th edn (Cambridge Univ. Press, 1999).
Arndt, M. et al. Wave–particle duality of C60 molecules. Nature 401, 680–682 (1999).
Metcalf, H. J. & van der Straten, P. Laser Cooling and Trapping (Springer, 1999).
Hillery, M., O’Connell, R., Scully, M. & Wigner, E. Distribution functions in physics: fundamentals. Phys. Rep. 106, 121–167 (1984).
Mølmer, K., Castin, Y. & Dalibard, J. Monte Carlo wave-function method in quantum optics. J. Opt. Soc. Am. B 10, 524–538 (1993).
Karshenboim, S. G., Ozawa, A., Shelyuto, V. A., Szafron, R. & Ivanov, V. G. The Lamb shift of the 1s state in hydrogen: two-loop and three-loop contributions. Phys. Lett. B 795, 432–437 (2019).
Yerokhin, V. A., Harman, Z. & Keitel, C. H. Two-loop electron self-energy for low nuclear charges. Phys. Rev. Lett. 133, 251803 (2024).
Delaunay, C. et al. Self-consistent extraction of spectroscopic bounds on light new physics. Phys. Rev. Lett. 130, 121801 (2023).
Potvliege, R. M., Nicolson, A., Jones, M. P. A. & Spannowsky, M. Deuterium spectroscopy for enhanced bounds on physics beyond the standard model. Phys. Rev. A 108, 052825 (2023).
Wirthl, V. Precision Spectroscopy of the 2S-6P Transition in Atomic Deuterium. PhD thesis, Ludwig-Maximilians-Universität München (2022).
Yzombard, P., Thomas, S., Julien, L., Biraben, F. & Nez, F. 1S–3S cw spectroscopy of hydrogen/deuterium atom. Eur. Phys. J. D 77, 23 (2023).
Bullis, R. G., Tavis, W. L., Weiss, M. R. & Yost, D. C. Narrow resonances in Rydberg hydrogen spectroscopy. Phys. Rev. A 110, 052807 (2024).
Moreno, J. et al. Toward XUV frequency comb spectroscopy of the 1S–2S transition in He+. Eur. Phys. J. D 77, 67 (2023).
Alighanbari, S., Giri, G. S., Constantin, F. L., Korobov, V. I. & Schiller, S. Precise test of quantum electrodynamics and determination of fundamental constants with HD+ ions. Nature 581, 152–158 (2020).
Morgner, J. et al. Stringent test of QED with hydrogen-like tin. Nature 622, 53–57 (2023).
Gao, H. & Vanderhaeghen, M. The proton charge radius. Rev. Mod. Phys. 94, 015002 (2022).
Djukanovic, D. et al. Precision calculation of the electromagnetic radii of the proton and neutron from lattice QCD. Phys. Rev. Lett. 132, 211901 (2024).
Jentschura, U. D. & Adhikari, C. M. Long-range interactions for hydrogen: 6P–1S and 6P–2S systems. Atoms 5, 48 (2017).
Jentschura, U. D., Adhikari, C. M., Dawes, R., Matveev, A. & Kolachevsky, N. Pressure shifts in high-precision hydrogen spectroscopy. I. Long-range atom–atom and atom–molecule interactions. J. Phys. B At. Mol. Opt. Phys. 52, 075005 (2019).
Matveev, A., Kolachevsky, N., Adhikari, C. M. & Jentschura, U. D. Pressure shifts in high-precision hydrogen spectroscopy: II. Impact approximation and Monte-Carlo simulations. J. Phys. B At. Mol. Opt. Phys. 52, 075006 (2019).
Alnis, J., Matveev, A., Kolachevsky, N., Udem, T. & Hänsch, T. W. Subhertz linewidth diode lasers by stabilization to vibrationally and thermally compensated ultralow-expansion glass Fabry-Pérot cavities. Phys. Rev. A 77, 053809 (2008).
Kolachevsky, N. et al. Low phase noise diode laser oscillator for 1S–2S spectroscopy in atomic hydrogen. Opt. Lett. 36, 4299–4301 (2011).
Haas, M. et al. Two-photon excitation dynamics in bound two-body Coulomb systems including ac Stark shift and ionization. Phys. Rev. A 73, 052501 (2006).
Holzwarth, R. et al. Optical frequency synthesizer for precision spectroscopy. Phys. Rev. Lett. 85, 2264–2267 (2000).
Giunta, M. et al. Real-time phase tracking for wide-band optical frequency measurements at the 20th decimal place. Nat. Photon. 14, 44–49 (2020).
Loudon, R. The Quantum Theory of Light 3rd edn (Oxford Univ. Press, 2000).
Olivero, J. J. & Longbothum, R. L. Empirical fits to the Voigt line width: a brief review. J. Quant. Spectrosc. Radiat. Transfer 17, 233–236 (1977).
Olander, D. R., Jones, R. H. & Siekhaus, W. J. Molecular beam sources fabricated from multichannel arrays. IV. Speed distribution in the centerline beam. J. Appl. Phys. 41, 4388–4391 (1970).
Estermann, I., Simpson, O. C. & Stern, O. The free fall of atoms and the measurement of the velocity distribution in a molecular beam of cesium atoms. Phys. Rev. 71, 238–249 (1947).
Manista, E. J. & Center, L. R. Distortion of Atomic-Beam Velocity Distributions Due to Classical Hard-Sphere Gas Scattering (National Aeronautics and Space Administration, 1965).
Cooper, S. F., Brandt, A. D., Rasor, C., Burkley, Z. & Yost, D. C. Cryogenic atomic hydrogen beam apparatus with velocity characterization. Rev. Sci. Instrum. 91, 013201 (2020).
Kramida, A. A critical compilation of experimental data on spectral lines and energy levels of hydrogen, deuterium, and tritium. At. Data Nucl. Data Tables 96, 586–644 (2010).
Horbatsch, M. & Hessels, E. A. Tabulation of the bound-state energies of atomic hydrogen. Phys. Rev. A 93, 022513 (2016).
Kolachevsky, N. et al. Measurement of the 2S hyperfine interval in atomic hydrogen. Phys. Rev. Lett. 102, 213002 (2009).
Sturm, S. et al. G-factor measurement of hydrogenlike 28Si13+ as a challenge to QED calculations. Phys. Rev. A 87, 030501 (2013).
Morgner, J. et al. Testing interelectronic interaction in lithium-like tin. Science 388, 945–949 (2025).
Scheidegger, S., Agner, J. A., Schmutz, H. & Merkt, F. Metrology of Rydberg states of the hydrogen atom. Phys. Rev. A 108, 042803 (2023).
Maisenbacher, L. Dataset for the hydrogen 2S-6P transition measurement at MPQ. Zenodo https://doi.org/10.5281/zenodo.15874385 (2026).
Acknowledgements
We thank S. G. Karshenboim for helpful discussions and W. Simon, K. Linner and H. Brückner for technical support. Computations were performed on the HPC systems at the Max Planck Computing and Data Facility. This work was supported by European Research Council (ERC) grant H-SPECTR (grant agreement ID 101141942). L.M. acknowledges a Feodor Lynen Fellowship from the Alexander von Humboldt Foundation. V.W. acknowledges support from the International Max Planck Research School for Advanced Photon Science (IMPRS-APS). A.M. and A.G. acknowledge support from the German Research Foundation (DFG) (project IDs 390524307 and EXC-2111-390814868, respectively). R.P. acknowledges support from the PRISMA+ Cluster of Excellence (EXC 2118/1) funded by the DFG in the German Excellence Strategy (project ID 39083149). T.W.H. acknowledges support from the Carl Friedrich von Siemens Foundation and the Max Planck Foundation.
Funding
Open access funding provided by Max Planck Society.
Author information
Authors and Affiliations
Contributions
L.M. and V.W. set up, maintained and performed the experiment. L.M. wrote the software for and set up the data acquisition system. L.M. conducted the data analysis, which V.W. independently reproduced. A.M. and L.M. performed modelling and simulations. T.U., L.M. and V.W. evaluated the QED predictions and tests. L.M. prepared the manuscript. All authors (L.M., V.W., A.M., A.G., R.P., T.W.H. and T.U.) contributed to the discussion and analysis of the systematic uncertainties and edited the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature thanks Saïda Guellati-Khelifa and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data figures and tables
Extended Data Fig. 1 Test of line shape models.
a, Detuning asymmetry of fit residuals \({\varepsilon }_{\Delta }^{-}\) at spectroscopy laser detuning Δ for four different experimental settings, with experimental data (black circles), LFS simulation (dashed blue line) and QI simulation (solid orange line) shown. The sum of \({\varepsilon }_{\Delta }^{-}\) for LFS and QI simulations is also shown (dotted green line). The normalized fit residuals εΔ are given by (yΔ − yfit,Δ)/yfit,Δ, in which yΔ is the experimental or simulated signal and yfit,Δ the corresponding value from the Voigt or Voigt doublet line shape fit to the signal, and their detuning asymmetry and symmetry are defined as \({\varepsilon }_{\Delta }^{-}=({\varepsilon }_{\Delta }-{\varepsilon }_{-\Delta })/2\) and \({\varepsilon }_{\Delta }^{+}=({\varepsilon }_{\Delta }+{\varepsilon }_{-\Delta })/2\). Note that the curves have been offset (dashed grey horizontal lines) to enhance visibility. Top, for data taken at zero atomic beam offset angle α0 and at magic polarization angle θL = 56.5°, in which the QI effect is strongly suppressed, the experimentally observed asymmetry \({\varepsilon }_{\Delta }^{-}\) is well explained by the LFS simulation. All relevant data from measurement run B are used. See Fig. 2c for an example line shape. Second from top, for α0 = ±12 mrad (and magic polarization angle; see Fig. 3c for an example line shape), the LFS simulation predicts a different asymmetry compared with zero α0, which again matches the experiment well. This difference in asymmetry is directly observed in the transition frequency as shown in Fig. 3d. Third from top, for a non-magic polarization angle of θL = 146.5°, the QI effect leads to a strong asymmetry at large detunings Δ ≳ 5 MHz ≈ Γ, which is observed in the experimental data (shown for the 2S–6P1/2 transition; α0 = 0 mrad). At smaller detunings, both the QI effect and LFS contribute to the asymmetry and the sum of their asymmetries describes the experimental data well. Bottom, for the non-magic polarization angle for the 2S–6P3/2 transition, the asymmetry from the QI effect has the opposite sign and is about twofold smaller compared with the 2S–6P1/2 transition. b, Detuning symmetry of fit residuals \({\varepsilon }_{\Delta }^{+}\) for the first two cases of panel a. The deviations are caused by non-Gaussian broadening and saturation effects, which are not included in the fitted line shape models but are included in the simulations. Only results for the top detector are shown, as the detector signals are highly correlated.
Extended Data Fig. 2 Rydberg constant R∞ from atomic hydrogen.
The R∞ values are determined by combining each atomic hydrogen measurement2,3,5,6,7,18 with the proton radius rp from muonic hydrogen spectroscopy8,9,15 and equation (1). The large, correlated QED uncertainty of equation (1) for S-states (along with the common rp input) leads to a large correlation between some R∞ values, reducing their sensitivity as QED tests (r1S–2S is the Pearson correlation coefficient with R∞ from 1S–2S measurement18; r1S–2S < 0.1 if not shown). In particular, r1S–2S = 0.91 for the 2020 1S–3S measurement because of the dominant 1S level QED uncertainty5, whereas r1S–2S = 0.40 for the 2S–6P measurement because of the (eightfold) lower QED uncertainty of the 2S level. For reference, we show the value from the CODATA 2022 global adjustment1, which includes an uncertainty expansion factor to account for the scatter of the data. The addition of the high-precision, relatively uncorrelated 2S–6P measurement is likely to greatly reduce the effective scatter of future global adjustments, as less precise measurements are removed from the adjustment. Error bars show one-standard-deviation uncertainties.
Extended Data Fig. 3 In situ determination of stray electric fields.
a, The line of the 2S–6P1/2 transition (circles) shifts as a bias electric field Ex (here along the x-direction; Fig. 2b) is applied. Velocity group τ9 is shown, as its mean speed is closest to the mean speed \(\langle \bar{v}\rangle \) of all velocity groups. Solid lines show the simulated line shape for each value of Ex (scaled to match experimental data). A Voigt line shape fit (not shown) is used to find the resonance frequency. b, The line shape of the 2S–6P3/2 transition (circles) splits into two distinct components as Ex is increased and excitations of 6S and 6D levels become allowed. The simulated line shapes (solid lines) reproduce this splitting well and describe the relative amplitudes of the resulting components reasonably well. A simulation of a single atomic trajectory without Doppler broadening (dashed green line; scaled for visibility; Ex = 43 V m−1) reveals the more complex underlying substructure. A Voigt doublet line shape fit (not shown) is used to find the centre-of-mass resonance frequency of the line shape. c, A single stray field measurement using the 2S–6P1/2 transition. The Doppler-free transition frequency νe(Ex) (blue circles; error bars show statistical uncertainty only), found by the usual Doppler shift extrapolation, varies quadratically with Ex. A quadratic fit (solid blue line) gives the stray electric field component ΔEx (Extended Data Fig. 4) and quadratic dc-Stark shift coefficient βdc. The simulation (orange diamonds) and its fit (dashed orange line) agree with the experiment. The field-induced shift of the transition frequencies coincides with the energy shift of the excited level of the 2S–6P1/2 transition. d, Similar to panel c but using the 2S–6P3/2 transition, again showing good agreement between experiment and simulation. The shift of the centre-of-mass transition frequency is much smaller than both the shift of either the resolved line shape components of panel b or the energy shift of the excited level of the 2S–6P3/2 transition. Only results for the top detector are shown, as the detector signals are highly correlated. See Methods and Supplementary Methods for details.
Extended Data Fig. 4 Stray electric fields during measurement runs.
The stray electric field components ΔEx (blue circles), ΔEy (orange squares) and ΔEz (green diamonds) in the 2S–6P spectroscopy region, as determined by spectroscopy of the 2S–6P1/2 transition with applied bias electric fields (Extended Data Fig. 3 and Methods). See Fig. 2b for the coordinate system definition. Error bars show statistical uncertainties of the detector-averaged data. Shaded areas show the standard deviation for each component and measurement run, with weighted means and standard deviations (in parentheses) shown in the legends. The stray electric field predominantly points along the y-direction, which is the axis of the detector cylinder.
Supplementary information
Supplementary Information
Supplementary Methods Sections 1 and 2 and Supplementary References.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Maisenbacher, L., Wirthl, V., Matveev, A. et al. Sub-part-per-trillion test of the Standard Model with atomic hydrogen. Nature (2026). https://doi.org/10.1038/s41586-026-10124-3
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41586-026-10124-3






