Abstract
Neutrinos are the most abundant fundamental matter particles in the Universe and play a crucial part in particle physics and cosmology. Neutrino oscillation, discovered about 25 years ago, shows that the three known species mix with each other. Anomalous results from reactor and radioactive-source experiments1 suggest a possible fourth neutrino state, the sterile neutrino, which does not interact through the weak force. The Karlsruhe Tritium Neutrino (KATRIN) experiment2, primarily designed to measure the neutrino mass using tritium β-decay, also searches for sterile neutrinos suggested by these anomalies. A sterile-neutrino signal would appear as a distortion in the β-decay energy spectrum, characterized by a discontinuity in curvature (kink) related to the sterile-neutrino mass. This signature, which depends only on the shape of the spectrum rather than its absolute normalization, offers a robust, complementary approach to reactor experiments. Here we report the analysis of the energy spectrum of 36 million tritium β-decay electrons recorded in 259 measurement days within the last 40 eV below the endpoint. The results exclude a substantial part of the parameter space suggested by the gallium anomaly and challenge the Neutrino-4 claim. Together with other neutrino-disappearance experiments, KATRIN probes sterile-to-active mass splittings from a fraction of an eV2 to several hundred eV2, excluding light sterile neutrinos with mixing angles above a few per cent.
Similar content being viewed by others
Main
The three known neutrino flavours, electron (νe), muon (νμ) and tau (ντ), are neutral elementary particles (leptons) that interact only through the weak force, making them challenging to detect and crucial for understanding both particle physics and cosmology3. Neutrinos produced in a specific flavour state can transform into other flavour states. This phenomenon, known as neutrino flavour oscillation, links flavour states (νe, νμ, ντ), which determine their weak interactions, with mass states (ν1, ν2, ν3), which are crucial for understanding the dynamics of the Universe. A hypothetical fourth neutrino state (ν4), with a well-defined mass m4, could exist as a natural extension of the Standard Model of particle physics4. This mass state would predominantly lack a marked flavour component, with small contributions from e, μ and τ flavours. This minimal flavour association is why it is often referred to as a ‘sterile neutrino’ (νs). Active neutrinos can oscillate into sterile neutrinos, and mixing between these states acts as the fundamental mechanism for indirectly detecting sterile neutrinos. In the context of a 3 + 1 neutrino model1, the extended PMNS matrix U is a 4 × 4 unitary matrix that describes the mixing between flavour and mass eigenstates. The element |Ue4|2, also labelled sin2(θee), sets the oscillation amplitude3. For direct comparison with disappearance oscillation experiments, this is often recast as sin2(2θee) = 4sin2(θee)(1 − sin2(θee)). A sterile-neutrino signature typically manifests as deviations in the outgoing lepton flux or energy spectrum.
Interest in sterile neutrinos has been driven by a series of experimental anomalies reported over the past three decades. Among these, the gallium anomaly5 (GA) and the reactor antineutrino anomaly1 (RAA) are the most relevant for the work presented here. The GA refers to a deficit in electron neutrino flux observed during radiochemical experiments, which involve MeV-scale neutrinos and very short baselines of a few metres. Initially reported by the Gallium Experiment (GALLEX)6 and the Soviet–American Gallium Experiment (SAGE) experiments7,8, it was recently confirmed by the Baksan Experiment on Sterile Transitions (BEST) experiment9,10. By contrast, the RAA describes discrepancies between predicted and measured fluxes at longer baselines of about 10–100 m, with neutrino energies around 4 MeV. Several experiments, including DANSS (Detector of the reactor AntiNeutrino based on Solid Scintillator)11, PROSPECT (Precision Reactor Oscillation and Spectrum Experiment)12 and STEREO (Search for Sterile Reactor Neutrino Oscillations)13 have reported null results in direct tests of sterile neutrino oscillations by searching for spectral distortions. By contrast, the Neutrino-4 experiment14 has claimed a positive signal consistent with both the RAA and GA, reporting an oscillation pattern in the neutrino energy spectrum compatible with sterile neutrinos, with parameters m4 ≈ 2.70 ± 0.22 eV and sin2(2θee) = 0.36 ± 0.12. However, this result remains unconfirmed, and no scientific consensus has been reached4. In light of these conflicting findings, leading interpretations of the RAA increasingly point to biases in flux predictions or underestimated systematic uncertainties1,15.
Other notable anomalies include observations from the Liquid Scintillator Neutrino Detector (LSND) experiment16, which ran during the 1990s and detected an excess of electron antineutrino events in a muon antineutrino beam. This finding was interpreted as potential evidence for sterile-neutrino involvement. Around the same time, the KARMEN experiment17, which tested the same oscillation channel, observed no such excess, placing it in tension with LSND. Later, the MiniBooNE experiment18,19,20, operating at approximately 10 times the energy and baseline of LSND, observed an unexpected surplus of electron neutrino and antineutrino events in a muon-flavour neutrino beam. These results further highlight the unresolved questions surrounding sterile neutrinos21.
Despite variations in neutrino flavour, energy and baseline, these anomalies collectively hint at the possibility of non-standard neutrino oscillations involving sterile neutrinos associated with a mass range of 0.1 eV to several tens of eV (ref. 4). This is inferred from the relation \(\Delta {m}_{4}^{2}\approx \frac{1.27\times E({\rm{MeV}})}{L({\rm{m}})}\,{{\rm{eV}}}^{2}\), linking the oscillation length to the energy-to-baseline ratio. However, their statistical significance, ranging from 2 to 4 standard deviations, offers only weak evidence.
The existence of sterile neutrinos remains controversial, primarily because of the challenges in fully understanding systematic uncertainties and backgrounds of each experiment. This complexity is reflected in a diverse experimental landscape. For instance, measurements sensitive to the same mixing channel as Karlsruhe Tritium Neutrino (KATRIN) (|Ue4|2), such as Double Chooz22 and Daya Bay23, have reported null results. Moreover, experiments probing complementary channels, such as NOvA24, MINOS/MINOS+ (Main Injector Neutrino Oscillation Search)25 and Super-Kamiokande/T2K26, which primarily constrain ∣Uμ4∣2, have found no evidence for sterile neutrino mixing. Similarly, IceCube/DeepCore27, sensitive to both |Uμ4|2 and |Uτ4|2, has also reported null results. Collectively, these findings reinforce the robustness of the three-flavour neutrino framework.
Moreover, although the involvement of sterile neutrinos cannot be ruled out, global fits have quantified significant tensions among the various experimental results28. For instance, the gallium anomaly points to larger mixing angles than those inferred from reactor data, and Neutrino-4 reports a signal not corroborated by other reactor experiments. These inconsistencies suggest that, if sterile neutrinos are responsible, scenarios beyond the minimal 3 + 1 active-sterile mixing framework may be required29.
Taken together, these experimental results underscore the importance of continued precision measurements, such as those performed by KATRIN, to further probe the sterile neutrino hypothesis and clarify the origin of these anomalies.
Sterile-neutrino search with KATRIN
The KATRIN experiment2,30 aims primarily to determine the absolute neutrino mass scale by analysing the β-decay of molecular tritium.
Towards this goal, KATRIN performs a precision measurement of the electron energy spectrum down to 40 eV below the endpoint at 18.57 keV. To date, KATRIN has set an upper limit on the effective neutrino mass at 0.45 eV (90% confidence level (CL)) and aims to achieve a sensitivity of less than 0.3 eV by the end of data collection in 2025 (ref. 31). KATRIN also enables a highly sensitive search for a potential fourth neutrino-mass state32,33.
In the case of three active neutrinos, the differential spectrum \({R}_{{\rm{\beta }}}(E,{E}_{0},{m}_{{\rm{\nu }}}^{2})\) of the super-allowed tritium β-decay can be calculated with very high precision for a given value of electron kinetic energy E, the endpoint energy of the spectrum E0 and the effective neutrino mass \({m}_{{\rm{\nu }}}=\sqrt{{\sum }_{i=1}^{3}|{U}_{{\rm{e}}i}{|}^{2}{m}_{i}^{2}}\). Its shape is determined by energy and momentum conservation, the Fermi function with Z′ = 2 for the helium nucleus and calculable corrections related to rotational, vibrational and electronic excitations of the T2 and 3HeT+ (refs. 34,35,36).
During β-decay, a hypothetical neutrino mass eigenstate with mass m4 may be emitted, with a probability determined by the mixing between electron neutrino and sterile neutrino states. To incorporate a sterile neutrino, the original model is extended to include an additional contribution associated with a fourth neutrino-mass state, \({m}_{4}^{2}\), and active-to-sterile mixing, sin2(θee). The resulting differential decay spectrum is modelled as
Observationally, a sterile neutrino would lead to a twofold imprint on the electron spectrum: a distinct kink signature and a broader global distortion. The kink feature allows determination of the fourth neutrino mass m4, as it occurs at an energy equal to the endpoint energy minus m4. The amplitude of the global distortion is proportional to the mixing strength between sterile and active neutrinos, parametrized by sin2(θee). These phenomena are shown in Fig. 1 (right), highlighting the distinctive signature of a sterile neutrino on a β-decay spectrum.
The rear wall and electron gun, the windowless gaseous tritium source, the transport and pumping section, the pre- and main spectrometers, and the segmented focal-plane detector in which the electrons are recorded. The figure on the top right shows the expected spectral signature of a fourth neutrino state with mass m4 = 10 eV in the tritium β-decay spectrum. The kink feature appears at E0 − m4, whereas the overall distortion is governed by the mixing amplitude sin2(θee). An exaggerated mixing amplitude is shown for clarity.
KATRIN exploits its sub-eV sensitivity to the neutrino mass31,37 to search for sterile neutrinos at the eV scale, focusing on masses and mixing angles suggested by the RAA and GA. Here, the same apparatus and dataset as used in ref. 31 are used.
The search conducted by KATRIN complements oscillation-based methods that directly detect neutrinos. Instead of direct detection, KATRIN measures the effects of sterile neutrinos on the electron energy spectrum. The experimental signature is well defined, supported by precise spectral measurements and validated through meticulous calibration. Furthermore, the search benefits from a high signal-to-background ratio across most of the sterile-neutrino mass range investigated.
Setup and dataset
The KATRIN setup30, which spans over 70 m, consists of three main modules: a high-luminosity windowless gaseous tritium source (WGTS) of up to 100 GBq, a high-resolution magnetic adiabatic collimation and electrostatic (MAC-E) high-pass filter and an electron detector (Fig. 1). The WGTS provides a stable, high-purity tritium source used at 30 K and 80 K by continuously injecting gas at the centre of its 10-m-long beam tube. Tritium is then removed downstream using differential and cryogenic pumping, reducing pressure by 12 orders of magnitude to minimize background in the spectrometer. β-Decay electrons are guided through the beamline by strong magnetic fields of the order of several tesla. The MAC-E filter38,39, comprising the main and pre-spectrometer, precisely analyses electron energies. The magnetic field in the main spectrometer gradually decreases to Bana ≲ 6.3 × 10−4, ensuring adiabatic collimation of electron momenta. The spectrometers transmit electrons with kinetic energy above the retarding energy qU. With a filter width smaller than 2.8 eV, KATRIN can precisely identify a potential kink-like structure associated with a fourth neutrino state consistent with the sterile-neutrino interpretation of the RAA and GA. Electrons passing through the filter are counted by the focal-plane detector (FPD), a silicon PIN diode segmented into 148 pixels for spatial resolution40. KATRIN acquires the integral beta spectrum by recording count rates at various set points of retarding energy, qUi, with qUi spanning E0 − 40 eV ≤ qUi ≤ E0 + 135 eV. The integrated spectrum rate Rmodel is given by
where the differential spectrum Rβ from equation (2) is convolved with the response function f, which primarily accounts for scattering in the WGTS and the spectrometer transmission. A background rate Rbg is also added to the tritium β-decay signal. This background consists of three components31: a dominant energy-independent background base \({R}_{{\rm{bg}}}^{{\rm{base}}}\), an unconfirmed but possible retarding energy-dependent background \({R}_{{\rm{bg}}}^{qU}(q{U}_{i})\) and a small contribution from electrons trapped in the Penning trap between the pre- and main spectrometers41, denoted \({R}_{{\rm{bg}}}^{{\rm{Penning}}}(q{U}_{i})\). The tritium signal normalization, AS, is treated as a free-fit parameter. The endpoint energy, E0, is left free, as it reflects the tritium Q-value and work function differences between the source, rear wall and spectrometer, introducing a sub-electronvolt uncertainty that prevents it from being fixed in the fit.
Data from the first33 and second32 measurement campaigns of KATRIN have already set stringent constraints on sterile-neutrino parameters, based on the analysis of 6 million electrons within the last 40 eV below the endpoint. An improved analysis, using data from the first five measurement campaigns conducted between March 2019 and June 2021, is presented in this study. Each scan included approximately 40 set points of the retarding energy qUi in the range of [E0 − 300 eV, E0 + 135 eV] and lasted 2.1–3.3 h. Measurements above the endpoint enable accurate background determination. A total of 1,757 out of 1,895 scans were selected after data-quality cuts, comprising 36 million signal and background electrons within the same region of interest. This selection excludes scans in which monitoring systems indicated instabilities in key experimental parameters, such as electromagnetic fields or source conditions. This dataset allows a search for sterile neutrinos with a mass m4 of up to approximately 33 eV.
During the first campaign (KNM1, for KATRIN Neutrino-mass Measurement 1), the tritium-gas column density was set below its design value at ρd = 1.08(1) × 1021 m−2, where ρ represents the average gas density and d = 10 m is the length of the tritium source cavity. This value was subsequently increased to ρd = 4.20(4) × 1021 m−2 during the second campaign (KNM2). In the third campaign (KNM3), the background level was halved by moving the central analysing plane closer to the FPD detector, a configuration denoted as shifted analysing plane (SAP) that reduced volume-dependent backgrounds42. KNM3 was divided into its SAP setting (KNM3-SAP) and the symmetrical nominal analysing plane configuration (KNM3-NAP) for validation purposes. After KNM3, the calibration of the setup was enhanced by implementing a method that used 83mKr conversion electrons to monitor electric potential variations within the tritium source. This approach involved co-circulating a tiny quantity of 83mKr with tritium at a column density of ρd = 3.8 × 1021 m−2, which required raising the source temperature from 30 K to 79 K. In KNM4, measures were taken to address the scan-time-dependent background related to a Penning trap formed between the two spectrometers. This was achieved by lowering the pre-spectrometer potential, thereby eliminating the trap. Moreover, the scanning-time distribution for the scan steps was optimized to enhance neutrino-mass sensitivity, transitioning from the nominal KNM4-NOM to a more efficient configuration KNM4-OPT. Between KNM1 and KNM4, tritium accumulated on the gold surface of the rear wall, generating unwanted β-decay electrons. Before KNM5, a successful ozone cleaning was performed on the rear wall43, effectively reducing the tritium activity by three orders of magnitude. Out of the 148 available pixels, 117 were used in KNM1 and KNM2, and 126 in KNM3-SAP, KNM3-NAP, KNM4-NOM, KNM4-OPT and KNM5. The remaining pixels were excluded because they either showed increased noise or were partially shadowed by structural components of the beamline.
Testing the sterile-neutrino hypothesis
KATRIN measures the effective neutrino mass by analysing the shape of the tritium β-decay spectrum near the kinematic endpoint. In the standard neutrino mass analysis, four key parameters are fitted: the squared active-neutrino mass (\({m}_{{\rm{\nu }}}^{2}\)), the endpoint energy (E0), the signal amplitude (As) and the background rate (Rbg). For the sterile-neutrino analysis, two additional parameters are introduced: the squared mass of the fourth neutrino state (\({m}_{4}^{2}\)) and the mixing amplitude (sin2(θee)). These are included in the model described in equation (3). Depending on the scenario being tested, the squared active-neutrino mass can either be left as a free parameter in the fit or set to zero. For the main results presented here, we assume a hierarchical scenario in which m1,2,3 ≪ m4, which allows mν to be set to zero. This is justified by neutrino oscillation data, which constrain the smallest possible effective neutrino mass to about 9 meV for the normal hierarchy (m1 < m2 < m3) and 50 meV for the inverted hierarchy (m3 < m1 < m2), well below the sensitivity of the current KATRIN measurements.
A systematic 50 × 50 logarithmic grid search spans \({m}_{4}^{2}\) from 0.1 eV2 to 1,600 eV2 and sin2(θee) from 10−3 to 0.5, examining parameter combinations and their impact on the spectral shape. At each grid point, the χ2 function is minimized with respect to three free parameters (E0, AS and \({R}_{{\rm{bg}}}^{{\rm{base}}}\)). To quantify the results, a confidence interval at 95% CL was derived based on Wilks’ theorem44 for two degrees of freedom, using \(\Delta {\chi }^{2}={\chi }^{2}\,-\,{\chi }_{\min }^{2}=5.99\), where \({\chi }_{\min }^{2}\) represents the minimum chi-square value across the grid. Systematic uncertainties were incorporated using the pull-term method, constrained by experimental data and calibration measurements.
As shown in Fig. 2, the analysis of simulated data produces an open contour in the parameter space when no sterile-neutrino signal is present (left), and a closed contour when such a signal is detected, indicating a statistically significant sterile neutrino presence (right). The resulting constraints on sterile-neutrino parameters are derived entirely from the shape of the experimental spectrum and validated through simulations.
To enhance the robustness of the analysis, the best fit \({\chi }_{\min }^{2}\) is restricted to \({m}_{4}^{2} < \mathrm{1,000}\,{{\rm{eV}}}^{2}\), ensuring that the sterile-neutrino branch, which appears at energies less than E0 − m4, spans multiple data points in the observed β-spectrum.
A blinding scheme (Methods) was applied to ensure an unbiased analysis.
Before analysing the actual datasets, Asimov datasets simulated under the respective operational conditions were thoroughly examined and cross-checked by two independent analysis teams.
Following these studies, the analysis of the actual data then proceeded, starting with independent assessments of the five campaigns (KNM1–KNM5) before unblinding and combining the datasets. During unblinding, the campaign-wise analysis showed an issue in KNM4, in which a distinct structure in the fit residuals was detected (Methods). A re-evaluation identified a problem with data combination, leading to the division of KNM4 into two sub-periods, KNM4-NOM and KNM4-OPT, for analysis. Moreover, some systematic effects were revisited and refined. This issue, discovered in the sterile-neutrino analysis, also affected the neutrino mass analysis31.
KATRIN in the light of neutrino-oscillation results
No significant sterile-neutrino signal was found in the KATRIN search. Consequently, we present the new 95% CL exclusion contour derived from the first five measurement campaigns (KNM1–KNM5), shown in Fig. 3 (black). This limit is compared with constraints from other key experiments probing electron neutrino and antineutrino disappearance. As short-baseline neutrino oscillation experiments measure different observables than β-decay experiments, appropriate variable transformations are required for comparison. Although KATRIN directly probes sin2(θee), oscillation experiments typically report the effective mixing angle, defined as sin2(2θee) = 4sin2(θee)(1 − sin2(θee)). The relevant mass splitting is approximated by \(\Delta {m}_{41}^{2}\approx {m}_{4}^{2}-{m}_{{\rm{\nu }}}^{2}\), valid to within approximately 2 × 10−4 eV2 (ref. 45). Figure 3 is obtained under the assumption \({m}_{{\rm{\nu }}}^{2}=0\), so that the sterile-neutrino mass parameter corresponds directly to the squared mass splitting relative to the active state. Alternative treatments, including cases in which \({m}_{{\rm{\nu }}}^{2}\) is left free, are discussed in the Methods.
For comparison, the exclusion contour from the first two KATRIN campaigns (KNM1 and KNM2) is also shown in light blue32. The improvements in KNM1–KNM5 reflect a sixfold increase in statistics and substantial improvements in the control of systematics.
The findings of KATRIN significantly constrain the parameter space associated with the RAA1. Although a region with small mixing angles remains viable, a large portion of the RAA parameter space is excluded. Furthermore, this KATRIN result challenges most of the parameter space favoured by the GA, recently reinforced by the BEST experiment9,10. In particular, the combined best-fit point from the BEST, GALLEX and SAGE experiments, at \(\Delta {m}_{41}^{2}\approx 1.25\,{{\rm{eV}}}^{2}\) and sin2(2θee) ≈ 0.34, is excluded with 96.56% CL. Therefore, these results challenge the light sterile-neutrino hypothesis.
KATRIN results complement reactor-neutrino oscillation experiments by probing large \(\Delta {m}_{41}^{2}\) values, reaching down to a few eV2, whereas reactor experiments are more sensitive to lower \(\Delta {m}_{41}^{2}\) values. The sensitivities of KATRIN and reactor-based experiments coincide at approximately \(\Delta {m}_{41}^{2}\approx 3\,{{\rm{eV}}}^{2}\) for a mixing angle of sin2(2θee) ≈ 0.1. Among reactor experiments, PROSPECT (Fig. 3, yellow) currently provides the most stringent exclusion limits, benefiting from effective background suppression through pulse shape discrimination (PSD)12. STEREO13, shown in light orange, is second in sensitivity, using a Gd-doped scintillator for systematic and background control. DANSS, represented in dark orange, achieves high sensitivity because of its large neutrino-candidate statistics, collecting approximately 4 million events over 3 years at various baselines11.
The first five measurement campaigns of KATRIN have fully excluded the 95% CL contour reported by Neutrino-4, which claimed oscillation evidence at \(\Delta {m}_{41}^{2}=7.3\,{{\rm{eV}}}^{2}\) (ref. 14), corresponding to m4 ≈ 2.70 ± 0.22 eV for m1 ≪ m4. This claim has been a topic of debate, as the contours of Neutrino-4 reach only the sensitivity limits of PROSPECT and STEREO, in which their statistical power decreases. By contrast, the sterile neutrino parameter space favoured by Neutrino-4 is now fully excluded by KATRIN, which rejects the Neutrino-4 best-fit point at 99.99% CL.
Conclusions and outlook
Using data from the first five measurement campaigns (KNM1–KNM5), conducted between 2019 and 2021 and comprising 36 million electrons near the tritium β-decay endpoint, KATRIN has carried out a new search for light sterile neutrinos. KATRIN places stringent constraints on much of the parameter space suggested by the RAA and GA, except for cases with very small mixing angles. KATRIN also disfavours the parameter region indicated by the Neutrino-4 claim14. The findings of KATRIN complement reactor-based oscillation experiments, such as PROSPECT12 and STEREO13, extending the sterile neutrino parameter space probed to larger mass-splitting scales.
With data collection continuing through 2025, the sensitivity of KATRIN is expected to increase substantially (Methods), allowing for improved searches for light sterile neutrinos. The dataset will then exceed 220 million electrons in the region of interest, more than six times the current statistics. In 2026, KATRIN will be upgraded with the TRISTAN detector46, enabling differential measurements of the full tritium β-decay spectrum. TRISTAN will complement the current KATRIN setup, which focuses on the endpoint region, by probing the entire spectrum. This high-statistics approach will allow the search for keV-scale sterile neutrinos, potential dark matter candidates, with mixing angles as low as one part per million (ref. 47).
Methods
Experimental setup
The KATRIN experiment measures the electron energy spectrum of tritium β-decay near the kinetic endpoint E0 ≈ 18.6 keV. The 70-m-long setup comprises a high-activity gaseous tritium source, a high-resolution spectrometer using the MAC-E-filter principle and a silicon p-i-n diode detector2,30.
Molecular tritium gas with high isotopic purity (up to 99%; ref. 48) is continuously injected into the middle of the WGTS, in which it streams freely to both sides. At the ends of the 10-m-long beam tube, more than 99% of the tritium gas is pumped out using differential pumping systems. With a throughput of up to 40 g day−1, an activity close to 100 GBq can be achieved49,50. The β-electrons are guided adiabatically by a 2.5 T magnetic field through the WGTS51. Before entering the spectrometer, they traverse two chicanes, in which the residual tritium flow is reduced by more than 12 orders of magnitude through differential52 and cryogenic pumping53. The β-electrons flowing in the upstream direction of the beamline reach the gold-plated rear wall, where they are absorbed. Besides separating the WGTS from the rear part of the KATRIN experiment that houses calibration tools, the rear wall also controls the source potential by a tunable voltage of \({\mathcal{O}}\) (100 mV).
Electrons guided towards the detector are subjected to energy filtering by two spectrometers. A smaller pre-spectrometer first rejects low-energy β-particles. The precise energy selection with \({\mathcal{O}}\) (1 eV) resolution is then performed by the 23-m-long and 10-m-wide main spectrometer. In both spectrometers, only electrons with a kinetic energy above the threshold energy qU are transmitted (high-pass filter). The electron momenta \({\bf{p}}={{\bf{p}}}_{\perp }+{{\bf{p}}}_{\parallel }\) are collimated adiabatically such that the transverse momentum is reduced to a minimum towards the axial filter direction by gradually decreasing the magnetic field towards the so-called analysing plane. In the main spectrometer, the magnetic field is reduced by four orders of magnitude to Bana ≲ 6.3 × 10−4 T. Behind the main spectrometer exit, the magnetic field is increased to its maximum value of Bmax = 4.2 T, resulting in a maximum acceptance angle of \({\theta }_{\max }=\arcsin (\sqrt{{B}_{{\rm{S}}}/{B}_{\max }})\approx 5{1}^{^\circ }\), where θ is the initial pitch angle of the electron. The filtered electrons are counted by the FPD, which is a silicon p-i-n diode segmented into 148 pixels (ref. 40). This detector is regularly calibrated with a 241Am source to ensure stable performance. Its efficiency is about 95%, with only small variations between pixels that remain constant over time. Effects that do not depend on the retarding potential are accounted for by the free normalization combined fit across groups or patches of pixels54.
Background electrons are indistinguishable from tritium β-decay electrons and thus contribute to the overall count rate at the detector. There are different sources and mechanisms that generate the background events. The majority originates from the spectrometer section of the experiment. Secondary electrons are created by cosmic muons and ambient gamma radiation on the inner spectrometer surface55,56 but are mitigated by magnetic shielding and a wire electrode system30. The decay of 219Rn and 220Rn inside the main spectrometer volume is reduced by cryogenic copper baffles that are installed in the pumping ducts of the main spectrometer57. Another source of background is electrons from radioactive decays produced in the low-magnetic-field part of the spectrometer58. These primary electrons can be trapped magnetically, ionizing residual-gas molecules and producing secondary electrons that are correlated in time, leading to a background component with a non-Poisson distribution. The dominating part of the background stems from the ionization of neutral atoms in highly excited states, which enter the main spectrometer volume in sputtering processes (decay of residual 210Pb) at the inner surface of the spectrometer. The low-energy electrons emitted in this process are accelerated to signal-electron energies and guided to the detector. The magnitude of this background component scales with the flux tube volume in the re-acceleration part of the main spectrometer. A re-configuration of the electromagnetic fields in the main spectrometer, called the SAP setting, reduces the background by a factor of 2 by shifting the plane of minimal magnetic field from the nominal position in the centre of the spectrometer towards the detector while compressing the flux tube at the same time42,59. After successful tests, this configuration was set as the new standard (see the next section). There is also the possibility of a Penning trap being formed between the pre- and main spectrometer. The stored particles can increase the background rate. To counter this effect, a conductive wire is inserted between scan steps into the beam tube to remove the stored particles41. Because the duration of the scan steps differs, this creates a scan-time-dependent background. Reducing the time between particle removal events and lowering the pre-spectrometer potential enabled a full mitigation of the scan-step-duration-dependent background starting with the KNM5 measurement period. Moreover, the transmission and detection probabilities of background electrons may slightly depend on the retarding-potential setting, potentially causing a retarding-potential-dependent background. This effect is addressed in the analysis through an additional slope component, constrained by dedicated background measurements31.
KNM12345 dataset
The data collected by KATRIN is organized into distinct measurement periods, referred to as KATRIN Neutrino Measurement (KNM) campaigns. The integral β-spectrum is measured through a sequence of defined retarding-energy set points, which we refer to as a scan. Each dataset comprises several hundred β-spectrum scans, with individual scan durations ranging from 125 min to 195 min. The measurement time distribution (MTD), shown in Extended Data Fig. 1 (bottom), determines the time allocated for each qUi, optimized for maximizing sensitivity to a neutrino-mass signal, where the index i corresponds to different retarding energy settings. The energy interval typically spans E0 − 300 eV ≤ qUi ≤ E0 + 135 eV, following an increasing, decreasing or random sequence. The analysis presented in this work uses set points ranging from E0 − 40 eV to E0 + 135 eV and is based on the first five measurement campaigns (KNM1–KNM5), summarized in Extended Data Table 1. Extended Data Fig. 2 shows the electron energy spectra for each of the five individual campaigns.
The first measurement campaign of the KATRIN experiment, KNM1, started in April 2019 with a relatively low source-gas density of ρd = (1.08 ± 0.01) × 1021 m−2 compared with the design value of ρd = 5 × 1021 m−2. Here, ρ denotes the average density of the source gas, and d = 10 m is the length of the tritium source. The source was operated at a temperature of 30 K. The measurement in KNM1 lasted for 35 days, recording a total of about 2 million electrons. The sterile-neutrino analysis of this dataset was published in ref. 33.
After a maintenance break, the next campaign, KNM2, started in October of the same year and lasted for 45 days, only 10 days longer than KNM1. However, more than double the number of electrons was measured because of the increased source-gas density. The density was raised to ρd = (4.20 ± 0.04) × 1021 m−2, which corresponds to 84% of the design value. Although the background rate was reduced from 0.29 count per second (cps) in KNM1 to 0.22 cps, it was still above the anticipated design value of 0.01 cps (ref. 2). The sterile-neutrino analysis of this dataset, in combination with KNM1, was published in ref. 32.
To further reduce the background rate, a new electromagnetic-field configuration42 was tested in the next campaign, starting in June 2020. KNM3 was split into two parts to validate the new shifted-analysing-plane (SAP) setting in contrast to the nominal (NAP) setting. Before the start of the measurement, the source temperature was increased to 79 K. This allowed the co-circulation of gaseous krypton 83mKr with the tritium gas to perform simultaneous calibration measurements and β-scans60,61. In the first part of KNM3, KNM3-SAP, the β-spectrum was measured in the SAP setting for 14 days, and the background was subsequently reduced to 0.12 cps. Although the SAP setting reduced the background almost by a factor of 2, the increased inhomogeneities in the magnetic and electric fields require the segmentation of the detector analysis into 14 patches with 14 individual models59. In KNM3-NAP, it was demonstrated that switching between both settings works, and the background rate increased to 0.22 cps, as expected. The second part also lasted 14 days, and about 2.5 million electrons were measured in all of KNM3.
Before launching a new measurement campaign, a target source-gas density of ρd = 3.8 × 1021 m−2 was chosen to keep the same experimental conditions for future measurements. With the start of KNM4 in September 2020, one of the challenges was to reduce the background caused by charged particles accumulated in the Penning trap between the main and pre-spectrometer. The accumulation was time dependent, as the trap was emptied after each qUi. To reduce this background, the run durations were shortened to empty the trap more often. Finally, the Penning-trap effect was eliminated by lowering the pre-spectrometer voltage. Furthermore, the MTD was optimized during the campaign to increase the neutrino-mass sensitivity. This sequence defines the split into the first part, KNM4-NOM, and the second part, with an optimized MTD, KNM4-OPT. In all of KNM4, more than 10 million electrons were recorded in 79 measurement days.
The final dataset used in this analysis is KNM5, which started in April 2021. It contains more than 16 million electrons recorded in 72 measurement days. Before the start of the measurement, the rear wall was cleaned of its residual tritium with ozone. This step was necessary to address the accumulation of tritium on the rear wall, which produced an additional spectrum on top of the tritium β-spectrum, introducing new systematic uncertainties31.
Model description
General model
The KATRIN experiment measures the neutrino mass by analysing the distortions near the endpoint of the tritium β-decay spectrum. The model used for this analysis is described in the main text and follows the formalism outlined in equations (2) and (3). It includes the theoretical β-spectrum, the experimental response function and parameters such as the active-neutrino mass squared \({m}_{{\rm{\nu }}}^{2}\), sterile-neutrino mass squared \({m}_{4}^{2}\) and the mixing parameter sin2(θee). The simulated imprint of a fourth mass eigenstate, with a mass of 10 eV and mixing of 0.05, is shown in Extended Data Fig. 1. To address the computational challenge posed by 1,609 data points requiring \({\mathcal{O}}(1{0}^{3})\) evaluations of \({R}_{{\rm{model}}}({A}_{{\rm{S}}},{E}_{0},{m}_{{\rm{\nu }}}^{2},{m}_{4}^{2},{\theta }_{{\rm{ee}}},q{U}_{i})\) per minimization step, an optimized direct-model calculation62 and a neural-network-based fast model prediction63 were used, as detailed in sections ‘The KaFit analysis framework’ and ‘The Netrium analysis framework’, respectively.
The KaFit analysis framework
KaFit (KATRIN Fitter) is a C++-based fitting tool developed for the KATRIN data analysis. It is part of KASPER—the KATRIN Analysis and Simulations Package64. KaFit uses the MIGRAD algorithm from the MINUIT numerical minimization library65 to fit KATRIN data. The fit minimizes the likelihood function −2log(L) in equation (4) with respect to parameters of interest such as the squared active-neutrino mass \({m}_{{\rm{\nu }}}^{2}\), \({m}_{4}^{2}\), sin2(θee) and nuisance parameters Θ (ref. 62).
Model evaluation is done by the Source Spectrum Calculation (SSC) module, which returns the predicted electron counts for a given set of input parameters. Within SSC, the integrated β-decay spectrum is calculated by numerically integrating the differential spectrum after convolving it with the experimental response function, as in equation (3). Using the measurement times and the experimental parameters, the SSC module computes the prediction for the number of counts for each retarding energy qUi.
KaFit computes the negative logarithm of likelihood using the experimental and model electron counts. Twice the negative log-likelihood is referred to as χ2 in equation (4). KaFit minimizes the χ2 and returns the best-fit parameters and the corresponding \({\chi }_{\min }^{2}\). A typical minimization requires \({\mathcal{O}}(1{0}^{4})\) evaluations of the χ2 function. For computational efficiency, KaFit and SSC make use of multiprocessor computing. The recalculation of all components of the spectrum model at each minimization step is avoided through caching, thereby reducing the overall minimization time by a factor of approximately \({\mathcal{O}}(1{0}^{3})\). Moreover, caching the response function f(E, qUi) (equation (3)) and reusing it across systematic studies further reduces redundant calculations, enhancing the overall efficiency. KaFit has been optimized to enhance the computational speed for sterile-neutrino analysis while maintaining sufficient precision for physics searches within the current statistical and systematic budget.
The Netrium analysis framework
In the second approach to perform the analysis, a software framework called Netrium63 is used. In this case, the KATRIN physics model is approximated in a fast and highly accurate way using a neural network. It can predict the model spectrum \({R}_{{\rm{model}}}({A}_{{\rm{S}}},{E}_{0},{m}_{{\rm{\nu }}}^{2},{m}_{4}^{2},{\theta }_{{\rm{ee}}},q{U}_{i})\) (equation (3)) for all main input parameters and improves the computational speed by about three orders of magnitude with respect to the numerical-model calculation. The neural network features an input and an output layer with two fully connected hidden layers in between. During the training process, \({\mathcal{O}}(1{0}^{6})\) sample spectra, calculated using the analytical model, are used to optimize the weights of the neural network.
Comparison
Cross-validation between the two different analysis approaches to calculate the model is a key component of the blinding strategy of KATRIN (see the next section) and serves as quality assurance for the analysis. In previous sterile-neutrino searches with the KATRIN experiment32,33, the two described analysis approaches were benchmarked with the analysis software Simulation and Analysis with MATLAB for KATRIN (Samak)66, not used for the current analysis. It is based on the covariance matrix approach and designed to perform high-level analyses of tritium β-spectra measured by the KATRIN experiment. For this comparison, the results of the combined sterile analysis, based on data from the first five measurement campaigns, are shown in Extended Data Fig. 3. The active-neutrino mass is set to \({m}_{{\rm{\nu }}}^{2}=0\,{{\rm{eV}}}^{2}\). An excellent agreement is achieved for the exclusion contour and the best-fit parameter between the analyses conducted with KaFit and Netrium.
Blinding strategy and unblinding process
KATRIN applies a rigorous blinding scheme to minimize human bias and ensure the integrity of the analysis process. This approach also enforces robust software validation to avoid potential programming errors. The benchmark for validation is established by fixing the active squared neutrino mass to \({m}_{{\rm{\nu }}}^{2}=0\,{{\rm{eV}}}^{2}\) in all steps of the analysis.
The process began with the generation of twin Asimov datasets for each measurement campaign, in which all input values are set to their expected means67. These datasets were independently evaluated by two teams using the KaFit and Netrium software frameworks. Cross-validations confirmed an agreement within a few per cent of sin2(θee) between the results from the two teams. Only after this validation step was the combined analysis of all twin Asimov datasets performed. Once the validation using the Asimov data was completed, the real data were analysed on a campaign-by-campaign basis. Again, results from both teams were compared to ensure consistency. Only when the campaign-wise results demonstrated agreement did the analysis proceed to the final step, in which the complete, unblinded dataset was evaluated.
During the sterile-neutrino analysis of the KNM4 campaign, a closed contour with 99.98% CL was unexpectedly identified. This result was anomalous, given that all other campaigns, contributing the bulk of the statistics, consistently yielded open contours. The anomaly prompted a temporary suspension of the unblinding procedure and initiated a detailed technical investigation. The investigation uncovered a technical error in the combination of sub-campaigns within KNM4. Specifically, data from periods with different MTDs and effective endpoints cannot be directly stacked together. To address this, the dataset was divided into two distinct periods: KNM4-NOM and KNM4-OPT, reflecting the respective sub-campaign configurations. Apart from correcting the data-combination process, the analysis framework was improved with updated inputs and enhanced validation checks to prevent similar issues in the future. The identification of this error during the sterile-neutrino search also led to a re-evaluation of the neutrino-mass analysis, which was based on the same dataset. Detailed descriptions of all modifications and their impact can be found in ref. 31.
Results for individual campaigns
There are seven individual datasets (with KNM3 and KNM4 each split into two), corresponding to the first five measurement campaigns (KNM1–KNM5), which are described in detail in the section ‘KNM12345 dataset’. Each of the datasets can be used separately to search for a sterile neutrino. For campaigns measured in the NAP setting, the counts of all active pixels are summed up to one spectrum. By contrast, datasets in the SAP setting feature a patch-wise data structure (14 patches), in which, for each patch, nine pixels with similar electromagnetic fields are grouped together. Therefore, each dataset in the SAP setting contains 14 different spectra.
A sterile-neutrino search using the grid-scan method is applied to the seven individual datasets, whose exclusion contours at 95% CL are shown in Extended Data Fig. 4. The differences in the accumulated signal statistics and their variations lead to different excluded regions in the sterile parameter space. The overall shape of the exclusion contours is very similar for all of the campaigns. The KNM4-NOM campaign enables the exclusion of a more extensive region of the sterile-neutrino parameter space associated with high sterile-neutrino masses; for low sterile-neutrino masses, the KNM5 campaign dominates.
Results for combined campaigns
Across the seven campaigns, a total of 68,237 scan steps were recorded, collecting approximately 36 million counts within the analysis window. As described in the section ‘KNM12345 dataset’, data were grouped primarily into seven sets according to the experimental conditions and FPD pixel configurations. Within each dataset, electrons collected at a particular retarding potential were summed over all the scans. This is possible given the ppm-level (10−6) high-voltage reproducibility of the main spectrometer68. For each of the NAP datasets (KNM1, KNM2 and KNM3-NAP), the counts from all the pixels were summed up to obtain datasets that feature counts per retarding potential. For the SAP datasets (KNM3-SAP, KNM4-NOM, KNM-OPT and KNM5), the detector pixels were grouped into 14 ring-like detector patches of nine pixels each, to account for variations in the electric potential and magnetic fields across the analysing plane42. Pixel grouping was determined based on transmission similarity. For each of the SAP datasets, counts per patch per retarding potential were obtained by summing over all scans and grouping pixels. These so-called merged datasets, containing 1,609 data points, were used in the sterile-neutrino analysis.
The combined data were analysed using the following definition of χ2:
where θ is the vector of fit parameters, I denotes the total number of data points or retarding energy setpoints qUi, Nobs,i is the observed count at the ith setpoint, Ntheo(qUi, θ) is the theoretically predicted count at the ith setpoint and \(f({N}_{{\rm{obs}},{\rm{i}}}| {N}_{{\rm{theo}}}(q{U}_{i},{\rm{\theta }}))\) represents the likelihood function, modelled using a Poisson distribution. The θ parameter values were inferred by minimizing χ2(θ).
For the combined KNM1–KNM5 analysis, further merging of the datasets is not possible because of the significant differences in the experimental conditions in which they were measured. The combined χ2 function incorporates contributions from both Gaussian and Poissonian likelihoods. Negative log-likelihoods of Gaussian models (\({\chi }_{{\rm{G}}}^{2}\)) were used for the NAP campaigns as the mean value of the counts summed over all pixels was large, allowing the counts to be modelled by a Gaussian distribution31. Negative log-likelihoods of Poissonian models (\({\chi }_{{\rm{P}}}^{2}\)) were applied to the SAP campaigns, as the mean value of the counts summed over pixels in each patch was not large enough. The combined χ2 function is expressed as
In this formulation, Θ is the vector of the physics parameters and the nuisance parameters of all the campaigns. Sigk and Siglp correspond to AS in equation (3) and Bgi and Bgjp represent the energy-independent background rate \({R}_{{\rm{bg}}}^{{\rm{base}}}\). Correlations among nuisance parameters ξ were captured by the covariance matrix Σ, with the mean values \(\widehat{\xi }\) determined from systematic measurements. In equation (5), \({{\mathcal{I}}}_{{\rm{G}}}=1,2,3\text{-NAP}\) represents the KNM-NAP campaigns modelled with Gaussian likelihoods, whereas \({{\mathcal{I}}}_{{\rm{P}}}=\text{3-SAP, 4-NOM, 4-OPT, 5}\) corresponds to the KNM-SAP campaigns modelled with Poissonian likelihoods. The exclusion bounds in Extended Data Fig. 5 compare results from the first two campaigns (KNM1–KNM2) to all five campaigns (KNM1–KNM5). The analysis highlights how increased statistics significantly improve exclusion limits across all \({m}_{4}^{2}\) values.
Systematic uncertainty
An accurate and detailed accounting of the systematic uncertainties is essential for obtaining robust estimates of \({m}_{4}^{2}\) and sin2(θee) from the tritium β-spectrum. Several systematic effects arise within the KATRIN beamline, influencing the shape of the measured spectrum. The systematic effects and inputs considered for this analysis are the same as in the neutrino-mass analysis31 and are treated as nuisance parameters. The central values \(\widehat{\xi }\) and covariance Σ of the systematics parameters were determined from calibration measurements and simulations. To include these effects in the likelihood function (equation (4)), they are modelled as a multivariate normal distribution \({\mathcal{N}}(\widehat{\xi },\varSigma )\). Subsequently, the likelihood function is multiplied by the normal distribution of each systematic effect. It decreases the likelihood function if the parameters deviate from their mean values \(\widehat{\xi }\).
To assess the impact of individual systematic uncertainties, a separate scan of the sterile parameter space is performed for each systematic effect, considering only the statistical uncertainty and the specific systematic uncertainty in each case. This is done on the simulated Asimov twin data with \({m}_{{\rm{\nu }}}^{2}=0\,{{\rm{eV}}}^{2}\). The impact of individual systematic uncertainties on the sensitivity contour is quite small. Therefore, to evaluate the influence of systematic uncertainties more quantitatively, a raster scan is performed. In this approach, for each fixed value of \({m}_{4}^{2}\) in the parameter grid, the 1σ sensitivity on the mixing sin2(θee) is calculated independently. By performing a raster scan, the number of degrees of freedom is effectively reduced to one at each grid point. The contribution of each systematic effect to the total uncertainty is determined using \({\sigma }_{{\rm{syst}}}=\sqrt{{\sigma }_{{\rm{stat}}+{\rm{syst}}}^{2}-{\sigma }_{{\rm{stat}}}^{2}}\). The raster scan results for the combined KNM1–KNM5 dataset are presented in Extended Data Fig. 6 for three values of \({m}_{4}^{2}\) as well as for all values of \({m}_{4}^{2}\) in Extended Data Fig. 7. We can see that the statistical uncertainty dominates over all systematic effects. Owing to increasing statistics at lower retarding energies, the ratio of statistical to systematic uncertainties depends on the value of \({m}_{4}^{2}\). Source-related uncertainties dominate the overall systematic uncertainty. The largest impact stems from the uncertainty on the density of the molecular tritium gas in the source column. It is followed by the uncertainty of the energy-loss function, which describes the probability that electrons will lose a certain amount of energy when scattering with the molecules in the source. Depending on the value of \({m}_{4}^{2}\), variations in the source potential and the scan-time-dependent background also significantly affect the overall systematic uncertainty. The uncertainty from the scan-step-duration-dependent background is the first contribution of a non-source-related systematic effect.
Final-state distribution systematics
One source of uncertainty affecting the shape of the β-electron spectrum in KATRIN arises from the final-state distribution (FSD) of molecular tritium. During the decay process, part of the available energy can excite the tritium molecule into rotational, vibrational (ro-vibrational) and electronic states, thereby reducing the energy transferred to the emitted electron. Consequently, the energy distribution of these states directly influences the differential β-decay spectrum used in the analysis.
In previous KATRIN campaigns, the systematic contribution of the FSD uncertainty was estimated by comparing two ab initio calculations—one from ref. 34 and an earlier version from ref. 69—leading to a rather conservative estimation. For the analysis presented here, the FSD-related uncertainty was re-evaluated following the procedure derived for the neutrino-mass analysis36, which allows the estimation of the impact of FSD on the total uncertainty by considering the details of the calculation of the final-states distribution, such as the validity of theoretical approximations and corrections, the uncertainty of experimental inputs and physical constants, and the convergence of the calculation itself.
The same FSDs used in the neutrino-mass analysis, generated by perturbing the nominal set of input parameters entering the calculation, were used to create the twin Asimov datasets of the null hypothesis (no sterile neutrinos). Each dataset was then fitted with Netrium, trained on a single nominal FSD, with different values of m4 and sin2(θee). By comparing the resulting sensitivity contours at the 95% CL (\(\Delta {\chi }_{{\rm{crit}}}^{2}=5.99\)) with that from the nominal case, the impact of FSD uncertainties can be quantified.
This analysis was performed using the KNM5 campaign alone, due to differences between the models used to generate FSDs in different campaigns. Nonetheless, the result of this analysis, presented in Extended Data Fig. 8, demonstrates that the various FSD effects have a negligible impact on the sensitivity contours, justifying the use of the nominal FSD for fitting with sufficient accuracy.
Applicability of Wilks’ theorem
The contours and their associated confidence regions reported in earlier sections were based on the critical threshold values \(\Delta {\chi }_{{\rm{crit}}}^{2}\) corresponding to a desired CL. The sterile-neutrino search uses the Δχ2 test statistic:
where H0 is the assumed truth for the sterile-neutrino mass and mixing angle \([{m}_{4}^{2},{\sin }^{2}({\theta }_{{\rm{ee}}})]\), and H1 represents the global best-fit hypothesis based on the observations. According to Wilks’ theorem44, as H0 represents a special case of H1, Δχ2 will follow a chi-squared distribution with two degrees of freedom in the large sample limit. Wilks’ theorem thus provides the critical threshold value \(\Delta {\chi }_{{\rm{crit}}}^{2}\) corresponding to a desired CL, which allows us to quickly determine whether H0 is compatible with H1. This simplifies the sterile-neutrino analysis by providing a critical threshold \(\Delta {\chi }_{{\rm{crit}}}^{2}=5.99\), which corresponds to a 5% probability of obtaining such a deviation purely because of random fluctuations under the null hypothesis. Thus, Δχ2 > 5.99 indicates a sterile-neutrino signal at 95% CL.
If Wilks’ theorem were not applicable, the cumulative distribution function (CDF) of the Δχ2 test statistic would need to be computed for multiple combinations of the sterile-neutrino mass and mixing angle \([{m}_{4}^{2},{\sin }^{2}({\theta }_{{\rm{ee}}})]\) using Monte Carlo simulations, which would require several thousand times more computational effort. To ensure the applicability of Wilks’ theorem to the analysis, the CDF of Δχ2 is numerically validated by performing calculations using Monte Carlo simulations for two sets of parameters. In the first case, \([{m}_{4}^{2}=0,{\sin }^{2}({\theta }_{{\rm{ee}}})=0]\) corresponds to the null hypothesis, whereas in the second case \([{m}_{4}^{2}=55.66\,{{\rm{e}}{\rm{V}}}^{2},{\sin }^{2}({\theta }_{{\rm{e}}{\rm{e}}})=0.013]\) represents the best fit for the combined KNM1–KNM5 dataset. The CDFs and \(\Delta {\chi }_{{\rm{crit}}}^{2}\) values obtained through Monte Carlo simulations were compared with those predicted by Wilks’ theorem.
For each of the sterile-neutrino mass and mixing-angle pairs, \({\mathcal{O}}(1{0}^{3})\) tritium β-decay spectra were generated by adding Poissonian fluctuations to the counts calculated using the model described in the section ‘General model’. For each spectrum, the grid-scan method described in the main text is used to find the best-fit point and to compute Δχ2 using equation (6). For the case with \({m}_{4}^{2}=0\,{{\rm{eV}}}^{2}\) and sin2(θee) = 0, the numerically computed or empirical CDF closely follows the analytical CDF of the chi-squared distribution, as shown in Extended Data Fig. 9. Similarly, in the scenario with \({m}_{4}^{2}=55.66\,{{\rm{eV}}}^{2}\) and sin2(θee) = 0.013, the empirical CDF also aligns closely with the analytical CDF. In both cases, the observed threshold values at the 95% CL are consistent with the theoretical expectation of Δχ2 = 5.99. Specifically, for \({m}_{4}^{2}=0\,{{\rm{eV}}}^{2}\) and sin2(θee) = 0, the critical Δχ2 at 95% CL is 6.07 ± 0.17, and for \({m}_{4}^{2}=55.66\,{{\rm{eV}}}^{2}\) and sin2(θee) = 0.013, it is 6.07 ± 0.20. The uncertainties on the threshold values are calculated using the bootstrapping method. The combination of the CDF and the threshold values confirms the applicability of Wilks’ theorem to the sterile-neutrino analysis described in the main text.
Results compared with expected sensitivity
Figure 3 shows that the sensitivity contour, derived from simulations, intersects with the exclusion contour, obtained from the KNM1–KNM5 experimental data. For \(\Delta {m}_{41}^{2} < 30\,{{\rm{e}}{\rm{V}}}^{2}\), the exclusion contour extends beyond the sensitivity contour, whereas at higher \(\Delta {m}_{41}^{2}\) values, the exclusion contour oscillates around the sensitivity contour.
From the methodology outlined in the previous section and the description of \({\mathcal{O}}(1{0}^{3})\) fluctuated Asimov datasets generated under the null hypothesis in the context of Wilks’ theorem, grid scans were performed to compute 95% CL contours. These scans distinguish between open contours (where Δχ2 < 5.99) and closed contours in which Δχ2 ≥ 5.99 and the best-fit value would be distinguishable from the null hypothesis. The statistical band, encompassing the sensitivity contour obtained from Asimov simulations along with its 1σ and 2σ fluctuations, was subsequently reconstructed. As shown in Extended Data Fig. 10, the KNM1–KNM5 exclusion contour, derived from experimental data, falls within the 95% confidence region of the fluctuated simulated open contours.
This test confirms that the observed intersection between the sensitivity and exclusion contours is consistent with expected statistical fluctuations in the experimental data.
Analysis with a free neutrino mass
In the main text, a hierarchical scenario is assumed with m4 ≫ mν. Under this assumption, for the sterile-neutrino analysis, \({m}_{{\rm{\nu }}}^{2}\) is set to 0 eV2. However, for a small sterile-neutrino mass and large mixing, the active and sterile branches can become degenerate. A strong negative correlation between the active and sterile-neutrino mass for small \({m}_{4}^{2}\le 30\,{{\rm{eV}}}^{2}\) was reported in Sec. VI C of ref. 32. Hence, for small sterile-neutrino masses, it is interesting to perform the analysis considering the active-neutrino mass as another free parameter.
In Extended Data Fig. 11a, the 95% CL exclusion contours for the combined KNM1–KNM5 dataset are presented with statistical and systematic uncertainties incorporated. Two cases are analysed: the hierarchical scenario with the active-neutrino mass fixed to 0 eV2 (scenario I, solid blue) and the scenario treating the squared active-neutrino mass \({m}_{{\rm{\nu }}}^{2}\) as an unconstrained nuisance parameter (scenario II, dashed orange). The overlaid colour map indicates the best-fit values of \({m}_{{\rm{\nu }}}^{2}\) across the parameter space in scenario II. For sterile-neutrino masses below 40 eV2, partial degeneracy between the active- and sterile-neutrino branches weakens the exclusion bounds in scenario II relative to scenario I, with negative best-fit \({m}_{{\rm{\nu }}}^{2}\) values highlighting the negative correlation. For larger sterile-neutrino masses, the exclusion contours of scenario II converge with those of scenario I as the correlation between the active- and sterile-neutrino masses decreases.
Apart from scenarios I and II, two additional scenarios were considered with specific restrictions on the squared active-neutrino mass. In scenario III, \({m}_{{\rm{\nu }}}^{2}\) is treated as a free parameter but penalized by a ±1 eV2 pull term centred around 0 eV2. In scenario IV, \({m}_{{\rm{\nu }}}^{2}\) is constrained by \(0\le {m}_{{\rm{\nu }}}^{2} < {m}_{4}^{2}\), known as the technical constraint, first proposed by ref. 45. In Extended Data Fig. 11b, the 95% CL exclusion contours for the combined KNM1–KNM5 dataset are presented, incorporating only the statistical uncertainties.
The technical constraint in scenario IV (yellow dashed line) is of interest as its contour follows that of scenario I for lower squared sterile-neutrino masses (<40 eV2) and that of scenario II for higher masses. Similar to Extended Data Fig. 11a, for only statistical uncertainties, the best-fit squared active-neutrino mass in scenario II is negative for sterile masses below 40 eV2. As the squared active-neutrino mass is restricted to positive values in scenario IV, the resulting best-fit value is 0 eV2, causing the contour of scenario IV to align with that of scenario I for low sterile-neutrino masses.
KATRIN final sensitivity forecast
A projected final sensitivity for the KATRIN experiment at 95% CL is estimated using a net measurement time of 1,000 days, following the same prescriptions as in ref. 32. The background rate is expected to be 130 mcps for 117 active pixels, and the systematic uncertainties are based on the design configuration2. The primary update from the design configuration is the background rate, reflecting the current value. Moreover, the statistical variation is calculated using \({\mathcal{O}}(1{0}^{3})\) randomized tritium β-decay spectra with counts fluctuating according to a Poisson distribution. Sensitivity contours are calculated for each random dataset, and the 1σ and 2σ allowed regions that define the statistical fluctuations of the projected dataset are identified. A comparison with the exclusion contour from KATRIN for the first five measurement campaigns, see Extended Data Fig. 12, shows that for sterile masses \(\Delta {m}_{41}^{2} < 2\,{{\rm{eV}}}^{2}\) the final sensitivity contour is surpassed but not beyond the 1σ statistical uncertainty band. For \(\Delta {m}_{41}^{2} > 2\,{{\rm{eV}}}^{2}\), the sensitivity projection highlights the potential of KATRIN to probe the unexplored sterile-neutrino parameter space and to reach sensitivities of sin2(2θee) < 0.01. Given the observed difference between our current exclusion and the median sensitivity, future data may also yield less stringent limits due to statistical fluctuations. This is accounted for in our sensitivity projections, as shown by the 1σ and 2σ bands in Extended Data Fig. 12, which illustrate the expected range of statistical variation around the median.
The Taishan Antineutrino Observatory (TAO)70, a satellite experiment of the Jiangmen Underground Neutrino Observatory (JUNO)71, is primarily designed to precisely measure reactor antineutrino spectra but also sensitive to light sterile neutrinos, particularly at low values of \(\Delta {m}_{41}^{2}\). The Precision Reactor Oscillation and Spectrum Experiment II (PROSPECT-II)72, an upgraded version of the original PROSPECT detector73, aims to enhance sensitivity to eV-scale sterile-neutrino oscillations. Together with the KATRIN experiment, these projects enable the exploration of sterile-neutrino mixing across a wide range of mass-splitting scales, allowing the probing of sin2(2θee) ≲ 0.03 for \(\Delta {m}_{41}^{2}\lesssim \mathrm{1,000}\,{{\rm{eV}}}^{2}\), as shown in Extended Data Fig. 12.
Data availability
The data and analysis inputs, as well as the results, are available at Zenodo74 (https://doi.org/10.5281/zenodo.17205925).
References
Mention, G. et al. The reactor antineutrino anomaly. Phys. Rev. D 83, 073006 (2011).
Angrik, J. et al. KATRIN Design Report 2004. FZKA Scientific Report No. 7090 (Forschungszentrum Karlsruhe, 2005).
Particle Data Group. Review of particle physics. Prog. Theor. Exp. Phys. 2023, 083C01 (2023).
Acero, M. A. et al. White paper on light sterile neutrino searches and related phenomenology. J. Phys. G Nucl. Part. Phys. 51, 120501 (2024).
Elliott, S., Gavrin, V. & Haxton, W. The gallium anomaly. Prog. Particle Nucl. Phys. 134, 104082 (2024).
GALLEX Collaboration. Final results of the 51Cr neutrino source experiments in GALLEX. Phys. Lett. B 420, 114–126 (1998).
Abdurashitov, J. N. et al. Measurement of the response of a gallium metal solar neutrino experiment to neutrinos from a 51Cr source. Phys. Rev. C 59, 2246–2263 (1999).
Abdurashitov, J. N. et al. Measurement of the response of a Ga solar neutrino experiment to neutrinos from a 37Ar source. Phys. Rev. C 73, 045805 (2006).
Barinov, V. V. et al. Results from the Baksan Experiment on Sterile Transitions (BEST). Phys. Rev. Lett. 128, 232501 (2022).
Barinov, V. V. et al. Search for electron-neutrino transitions to sterile states in the BEST experiment. Phys. Rev. C 105, 065502 (2022).
Machikhiliyan, I. The DANSS neutrino spectrometer: the results of reactor antineutrino studies. Phys. Part. Nucl. 53, 546–551 (2022).
Andriamirado, M. et al. Final search for short-baseline neutrino oscillations with the PROSPECT-I detector at HFIR. Phys. Rev. Lett. 134, 151802 (2025).
The STEREO Collaboration. STEREO neutrino spectrum of 235U fission rejects sterile neutrino hypothesis. Nature 613, 257–261 (2023).
Serebrov, A., Fomin, A. & Samoilov, R. The experiment Neutrino-4 on the search for sterile neutrino at SM-3 reactor. Moscow Univ. Phys. Bull. 77, 401–406 (2022).
Giunti, C., Li, Y. F., Ternes, C. A. & Xin, Z. Reactor antineutrino anomaly in light of recent flux model refinements. Phys. Lett. B 829, 137054 (2022).
Aguilar, A. et al. Evidence for neutrino oscillations from the observation of \({\bar{\nu }}_{e}\) appearance in a \({\bar{\nu }}_{\mu }\) beam. Phys. Rev. D 64, 112007 (2001).
Armbruster, B. et al. Upper limits for neutrino oscillations muon-anti-neutrino \({\bar{\nu }}_{\mu }\) → \({\bar{\nu }}_{e}\) electron-anti-neutrino from muon decay at rest. Phys. Rev. D 65, 112001 (2002).
Aguilar-Arevalo, A. A. et al. A search for electron neutrino appearance at the Δm2 ~ 1 eV2 scale. Phys. Rev. Lett. 98, 231801 (2007).
Aguilar-Arevalo, A. A. et al. Unexplained excess of electron-like events from a 1-GeV neutrino beam. Phys. Rev. Lett. 102, 101802 (2009).
Aguilar-Arevalo, A. A. et al. Updated MiniBooNE neutrino oscillation results with increased data and new background studies. Phys. Rev. D 103, 052002 (2021).
Hollenberg, S., Micu, O. & Pas, H. Neutrino-antineutrino oscillations as a possible solution for the LSND and MiniBooNE anomalies? Phys. Rev. D 80, 053010 (2009).
Abrahão, T. et al. Search for signatures of sterile neutrinos with Double Chooz. Eur. Phys. J. C 81, 775 (2021).
An, F. P. et al. Improved search for a light sterile neutrino with the full configuration of the Daya Bay experiment. Phys. Rev. Lett. 117, 151802 (2016).
Acero, M. A. et al. Search for active-sterile antineutrino mixing using neutral-current interactions with the NOvA experiment. Phys. Rev. Lett. 127, 201801 (2021).
Adamson, P. et al. Search for sterile neutrinos in MINOS and MINOS+ using a two-detector fit. Phys. Rev. Lett. 122, 091803 (2019).
Abe, K. et al. Search for light sterile neutrinos with the T2K far detector Super-Kamiokande at a baseline of 295 km. Phys. Rev. D 99, 071103 (2019).
Abbasi, R. et al. Search for a light sterile neutrino with 7.5 years of IceCube DeepCore data. Phys. Rev. D 110, 072007 (2024).
Giunti, C., Li, Y. F., Ternes, C. A., Tyagi, O. & Xin, Z. Gallium anomaly: critical view from the global picture of νe and \({\overline{\nu }}_{e}\) disappearance. J. High Energy Phys. 10, 164 (2022).
Giunti, C. & Ternes, C. A. Confronting solutions of the gallium anomaly with reactor rate data. Phys. Lett. B 849, 138436 (2024).
The KATRIN Collaboration. The design, construction, and commissioning of the KATRIN experiment. J. Instrum. 16, T08015 (2021).
The KATRIN Collaboration. Direct neutrino-mass measurement based on 259 days of KATRIN data. Science 388, 180–185 (2025).
Aker, M. et al. Improved eV-scale sterile-neutrino constraints from the second KATRIN measurement campaign. Phys. Rev. D 105, 072004 (2022).
Aker, M. et al. Bound on 3 + 1 active-sterile neutrino mixing from the first four-week science run of KATRIN. Phys. Rev. Lett. 126, 091803 (2021).
Saenz, A., Jonsell, S. & Froelich, P. Improved molecular final-state distribution of het+ for the β-decay process of T2. Phys. Rev. Lett. 84, 242–245 (2000).
Doss, N., Tennyson, J., Saenz, A. & Jonsell, S. Molecular effects in investigations of tritium molecule β decay endpoint experiments. Phys. Rev. C 73, 025502 (2006).
Schneidewind, S., Schürmann, J., Lokhov, A., Weinheimer, C. & Saenz, A. Improved treatment of the T2 molecular final-states uncertainties for the KATRIN neutrino-mass measurement. Eur. Phys. J. C 84, 494 (2024).
The KATRIN Collaboration. Direct neutrino-mass measurement with sub-electronvolt sensitivity. Nat. Phys. 18, 160–166 (2022).
Lobashev, V. M. & Spivak, P. E. A method for measuring the electron antineutrino rest mass. Nucl. Instrum. Meth. A 240, 305–310 (1985).
Picard, A. et al. A solenoid retarding spectrometer with high resolution and transmission for kev electrons. Nucl. Instrum. Meth. B 63, 345–358 (1992).
Amsbaugh, J. F. et al. Focal-plane detector system for the KATRIN experiment. Nucl. Instrum. Meth. A 778, 40–60 (2015).
Aker, M. et al. Suppression of Penning discharges between the KATRIN spectrometers. Eur. Phys. J. C 80, 821 (2020).
Lokhov, A. et al. Background reduction at the KATRIN experiment by the shifted analysing plane configuration. Eur. Phys. J. C 82, 258 (2022).
Aker, M. et al. In situ tritium decontamination of the KATRIN rear wall using an ultraviolet/ozone treatment. Fusion Sci. Technol. 80, 303–310 (2024).
Wilks, S. S. The large-sample distribution of the likelihood ratio for testing composite hypotheses. The Ann. Math. Stat. 9, 60–62 (1938).
Giunti, C., Li, Y. F. & Zhang, Y. Y. KATRIN bound on 3 + 1 active-sterile neutrino mixing and the reactor antineutrino anomaly. J. High Energy Phys. 05, 61 (2020).
Siegmann, D. et al. Development of a silicon drift detector array to search for keV-scale sterile neutrinos with the KATRIN experiment. J. Phys. G 51, 085202 (2024).
Mertens, S. et al. Sensitivity of next-generation tritium beta-decay experiments for keV-scale sterile neutrinos. J. Cosmol. Astropart. Phys. 02, 020 (2015).
Aker, M. et al. Quantitative long-term monitoring of the circulating gases in the KATRIN experiment using Raman spectroscopy. Sensors 20, 4827 (2020).
Priester, F., Sturm, M. & Bornschein, B. Commissioning and detailed results of KATRIN inner loop tritium processing system at Tritium Laboratory Karlsruhe. Vacuum 116, 42–47 (2015).
Hillesheimer, D. et al. Four years of tritium operation of the KATRIN experiment at TLK. Fusion Sci. Technol. 80, 465–471 (2024).
Arenz, M. et al. The KATRIN superconducting magnets: overview and first performance results. J. Instrum. 13, T08005 (2018).
Marsteller, A. et al. Neutral tritium gas reduction in the KATRIN differential pumping sections. Vacuum 184, 109979 (2021).
Röttele, C. et al. Characterization of the KATRIN cryogenic pumping section. Vacuum 208, 111699 (2023).
Aker, M. et al. Analysis methods for the first KATRIN neutrino-mass measurement. Phys. Rev. D 104, 012005 (2021).
Altenmüller, K. et al. Muon-induced background in the KATRIN main spectrometer. Astropart. Phys. 108, 40–49 (2019).
Altenmüller, K. et al. Gamma-induced background in the KATRIN main spectrometer. Eur. Phys. J. C 79, 807 (2019).
Görhardt, S. et al. Impact of a cryogenic baffle system on the suppression of radon-induced background in the KATRIN Pre-Spectrometer. J. Instrum. 13, T10004 (2018).
Fränkle, F. et al. KATRIN background due to surface radioimpurities. Astropart. Phys. 138, 102686 (2022).
Aker, M. et al. Measurement of the electric potential and the magnetic field in the shifted analysing plane of the KATRIN experiment. Eur. Phys. J. C 84, 1258 (2024).
Marsteller, A. et al. Operation modes of the KATRIN experiment Tritium Loop System using 83mKr. J. Instrum. 17, P12010 (2022).
KATRIN Collaboration. Measurement of the inhomogeneity of the KATRIN tritium source electric potential by high-resolution spectroscopy of conversion electrons from 83mKr. Eur. Phys. J. C 85, 757 (2025).
Kleesiek, M. et al. β-Decay spectrum, response function and statistical model for neutrino mass measurements with the KATRIN experiment. Eur. Phys. J. C 79, 204 (2019).
Karl, C., Eller, P. & Mertens, S. Fast and precise model calculation for KATRIN using a neural network. Eur. Phys. J. C 82, 439 (2022).
Behrens, J. Design and Commissioning of a Mono-energetic Photoelectron Source and Active Background Reduction by Magnetic Pulse at the KATRIN Experiment. PhD thesis, Westfälische Wilhelms-Universität Münster (2016).
James, F. & Roos, M. Minuit: a system for function minimization and analysis of the parameter errors and correlations. Comput. Phys. Commun. 10, 343–367 (1975).
Schlüter, L. Neutrino-Mass Analysis with sub-eV Sensitivity and Search for Light Sterile Neutrinos with the KATRIN Experiment. PhD thesis, Technische Universität München https://mediatum.ub.tum.de/1661391 (2022).
Cowan, G., Cranmer, K., Gross, E. & Vitells, O. Asymptotic formulae for likelihood-based tests of new physics. Eur. Phys. J. C 71, 1554 (2011).
Rodenbeck, C. et al. Wideband precision stabilization of the -18.6kV retarding voltage for the KATRIN spectrometer. J. Instrum. 17, P06003 (2022).
Fackler, O., Jeziorski, B., Kołos, W., Monkhorst, H. J. & Szalewicz, K. Accurate theoretical β-decay energy spectrum of the tritium molecule and its neutrino mass dependence. Phys. Rev. Lett. 55, 1388–1391 (1985).
Abusleme, A. et al. TAO conceptual design report: a precision measurement of the reactor antineutrino spectrum with sub-percent energy resolution. Preprint at https://arxiv.org/abs/2005.08745 (2020).
Abusleme, A. et al. JUNO physics and detector. Prog. Part. Nucl. Phys. 123, 103927 (2022).
Andriamirado, M. et al. PROSPECT-II physics opportunities. J. Phys. G Nucl. Part. Phys. 49, 070501 (2022).
Ashenfelter, J. et al. The PROSPECT reactor antineutrino experiment. Nucl. Instrum. Meth. A 922, 287–309 (2019).
Köhler, C. et al. Sterile-neutrino search based on 259 days of KATRIN data. Zenodo. https://doi.org/10.5281/zenodo.17205925 (2025).
Acknowledgements
We acknowledge the support of Helmholtz Association (HGF), the Ministry for Education and Research BMBF (05A23PMA, 05A23PX2, 05A23VK2 and 05A23WO6), the doctoral school KSETA at KIT, Helmholtz Initiative and Networking Fund (grant agreement no. W2/W3-118), the Max Planck Research Group (MaxPlanck@TUM), Deutsche Forschungsgemeinschaft DFG (GRK 2149 and SFB-1258), the Excellence Strategy of Germany (EXC 2094-390783311), and the Graduate School Scholarship Programme of Deutscher Akademischer Austauschdienst (DAAD) in Germany; the Ministry of Education, Youth and Sport (CANAM-LM2015056) in the Czech Republic; the Istituto Nazionale di Fisica Nucleare (INFN) in Italy; the National Science, Research and Innovation Fund through the Program Management Unit for Human Resources and Institutional Development, Research and Innovation (grant no. B39G670017) in Thailand; and the Department of Energy through awards DE-FG02-97ER41020, DE-FG02-94ER40818, DE-SC0004036, DE-FG02-97ER41033, DE-FG02-97ER41041, DE-SC0011091 and DE-SC0019304 and the Federal Prime Agreement DE-AC02-05CH11231 in the United States. This project has received funding from the European Research Council (ERC) under the European Union Horizon 2020 research and innovation programme (grant agreement no. 852845). We thank the computing support at the Institute for Astroparticle Physics at Karlsruhe Institute of Technology, Max Planck Computing and Data Facility (MPCDF), and the National Energy Research Scientific Computing Center (NERSC) at the Lawrence Berkeley National Laboratory.
Funding
Open access funding provided by Max Planck Society.
Author information
Authors and Affiliations
Consortia
Contributions
All authors contributed to this publication through various roles, including the design and construction of the experiment, performing theoretical calculations, software development, calibration of subsystems, operation, data acquisition and data analysis. The scientific results were discussed and approved by the full KATRIN Collaboration. The specific analysis presented in this Article, as well as the preparation of the manuscript, was carried out by a subgroup of authors (G. Gagliardi, C. Köhler, T. Lasserre, S. Mohanty and X. Stribl) and subsequently underwent an internal collaboration-wide review. All authors reviewed and approved the final version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature thanks Gianluca Cavoto, Steve Elliott, Patrick Huber, Alexander Leder and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data figures and tables
Extended Data Fig. 1 KATRIN data vs. model, simulated kink, and measurement-time distribution.
a) KATRIN data (KNM5, patch 0) with statistical uncertainties (error bars scaled by a factor of 10 for better visibility) and the best-fit model in the 3ν+1 framework. The model includes contributions from both the active and sterile β-decay branches. b) Ratio of simulated spectra for the 3ν+1 and 3ν frameworks, highlighting the kink-like signature of a sterile neutrino, most prominent near retarding energies qU ≈ E0 − m4. c) Measurement-time distribution across retarding energies, showing the time fraction spent at each retarding energy during the integral spectrum acquisition.
Extended Data Fig. 2 Measured spectra and global fit results across all campaigns.
The KNM3-SAP, KNM4-NOM, KNM4-OPT, and KNM5 datasets are divided into 14 detector patches. The squared mass of the sterile-neutrino state (\({m}_{4}^{2}\)) and its mixing parameter (\({\sin }^{2}({\theta }_{ee})\)) are treated as common fit parameters across all campaigns.
Extended Data Fig. 3 Comparison of exclusion curves from two independent analyses.
Comparison of 95% C.L. exclusion curves from the analysis of the first five KATRIN campaigns using two independent analysis procedures with \({m}_{{\rm{\nu }}}^{2}=0\,{{\rm{eV}}}^{2}\). Excellent agreement is observed regarding the excluded parameter space and the best-fit position.
Extended Data Fig. 4 Sterile Neutrino Exclusion contours at 95% C.L. for individual KNM1–KNM5 campaigns.
The datasets include approximately 2.0 million electrons for KNM1, 4.3 million for KNM2, 2.5 million for KNM3, 10.2 million for KNM4, and 16.7 million for KNM5.
Extended Data Fig. 5 Comparison of exclusion contours from KNM1–5 vs. earlier KNM1–2 result.
Comparison of exclusion contours from the KNM1-5 analysis, based on 36 million electrons in the analysis interval, to the previous KNM1-2 result, which utilized 6 million electrons in the same analysis interval.
Extended Data Fig. 6 Contributions to the uncertainty on \({{\bf{\sin }}}^{{\bf{2}}}{\boldsymbol{(}}{{\boldsymbol{\theta }}}_{{\bf{ee}}}{\boldsymbol{)}}\) from statistics and systematics at 68% C.L.
The figure comprises two panels: Top) Comparison of the combined systematic and statistical uncertainties for three representative values of \({m}_{4}^{2}\): 2 eV2 (blue), 101 eV2 (yellow), and 401 eV2 (green). This illustrates that statistical uncertainty consistently dominates across all \({m}_{4}^{2}\) values. Bottom) Individual systematic effects are displayed, each represented by three coloured bars corresponding to the same \({m}_{4}^{2}\) values as in the top panel. The x-axis quantifies the uncertainty in \({\sin }^{2}({\theta }_{{\rm{ee}}})\) for each systematic effect, demonstrating their relative contributions to the total uncertainty.
Extended Data Fig. 7 Breakdown of uncertainties.
Uncertainty breakdown for the combined dataset with \({m}_{{\rm{\nu }}}^{2}=0\,{{\rm{eV}}}^{2}\). The plot focuses on the region \({m}_{4}^{2} > 1\,{{\rm{eV}}}^{2}\). The 68.3% C.L. sensitivities on \({\sin }^{2}({\theta }_{ee})\) for individual systematic effects and the statistical-only contour are displayed. Notably, all systematic effects are minor compared to the statistical uncertainty. The statistical uncertainty predominates over the combined systematic uncertainties across all \({m}_{4}^{2}\) values.
Extended Data Fig. 8 Impact of FSD systematics on the exclusion contour.
a) 95% C.L. exclusion contours for datasets calculated under different FSD hypotheses. b) The absolute value of the normalized difference at each \({m}_{4}^{2}\) grid point between the 95% C.L. exclusion contours of the nominal FSD and the dedicated FSD datasets used in this analysis. The black solid line represents an Asimov dataset that includes all systematic uncertainties with nominal FSD, serving as a reference. Each contour illustrates the impact of a specific FSD systematic: dotted lines correspond to parameter uncertainties, dashed lines to theoretical approximations, and alternating dot-dashed lines to computational approximations.
Extended Data Fig. 9 Δχ2 distributions from Monte Carlo compared to Wilks’ theorem.
Cumulative Distribution Function (CDF) of Δχ2 for two sets of sterile-neutrino parameters compared to the analytical χ2 distribution with two degrees of freedom (orange line), as expected according to Wilks’ theorem. The null hypothesis simulation (NH), \([{m}_{4}^{2}=0,{\sin }^{2}({\theta }_{{\rm{ee}}})=0]\), is shown in blue, while the best-fit simulation (BF), \([{m}_{4}^{2}=55.66\,{{\rm{e}}{\rm{V}}}^{2},{\sin }^{2}({\theta }_{{\rm{e}}{\rm{e}}})=0.013]\), is shown in green. The corresponding empirical CDFs are calculated using Monte Carlo simulations.
Extended Data Fig. 10 KNM1–5 exclusion vs. sensitivity band, showing consistency with expected fluctuations.
Comparison of the KNM1-5 exclusion contour (data) with the 95% C.L. statistical band from simulated sensitivity contours, showing consistency with the statistical fluctuations.
Extended Data Fig. 11 Effect of different treatments of mν on the exclusion contour.
a) 95% C.L. exclusion contour of KNM1-5 including statistical and systematic uncertainties with active-neutrino mass fixed to 0 eV2 and left unconstrained in the analysis. The colour map describes the best-fit active-neutrino mass values. b) 95% exclusion contour of KNM1-5 including statistical uncertainties and different scenarios for the treatment of the active-neutrino mass.
Extended Data Fig. 12 Projected KATRIN sensitivity with the full 1000-day dataset.
The dashed blue line shows the 95% C.L. sensitivity contour, while the dark and light blue bands indicate the 1- and 2-σ statistical uncertainty. The black line represents the KNM1-5 exclusion result (this work). The figure highlights significant potential improvements in the sterile-neutrino search with the final dataset expected after 2025.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
The KATRIN Collaboration. Sterile-neutrino search based on 259 days of KATRIN data. Nature 648, 70–75 (2025). https://doi.org/10.1038/s41586-025-09739-9
Received:
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1038/s41586-025-09739-9
This article is cited by
-
Still no sign of hypothetical sterile-neutrino particle
Nature (2025)





