Introduction

The unexpected discovery of very high particle radiation in near-Earth space1 opened up a new field known as space physics. The regions of high energy radiation were named Van Allen Radiation Belts, as an homage to their discoverer, James Van Allen. The theory explaining how these particles are accelerated to ultra-relativistic energies is still a subject of ongoing debate.

Up until the mid-1990s, it was commonly accepted that electrons are accelerated to relativistic energies through inward radial diffusion2,3,4,5. The random radial displacements of particles lead to net diffusive transport from the populated outer regions to the inner region, which is depleted by loss. Diffusive transport of the particles into the inner region, where the Earth’s magnetic field is stronger, results in an acceleration of particles by means of betatron/Fermi acceleration.

The dynamics of the outer radiation belt have been revisited and received a lot of attention over the last two decades. It was suggested that electrons outside of the plasmasphere could be accelerated locally by taking energy from plasma waves in the Very Low Frequency (VLF) range with kHz-scale frequencies6,7. This acceleration mechanism, potentially responsible for driving electrons to relativistic energies in the outer belt, also demonstrated the capacity for swift acceleration in the so-called slot region between the outer and the inner belts8,9,10.

The support for the existence of the local acceleration mechanism stemmed from the observations of peaks in the radial profiles of Phase Space Density (PSD)11,12, which is a density in six-dimensional momentum-configuration space. The peaks indicate that particles accelerated locally as radial diffusion tends to smooth PSD and produce monotonic profiles. However, since the spacecraft is usually not exactly in the equatorial plane, it’s difficult to observe all electrons as some will mirror closer to the equator and will not be seen by the spacecraft. Moreover, magnetic field models have uncertainties, and the results based on the profiles of PSD that used empirical models of the magnetic field models were widely debated13.

To understand the dynamics of the radiation belts on 30 August 2012, NASA launched the Radiation Belts Storm Probes, which were renamed after the successful launch of the Van Allen Probes. The Van Allen Probes spacecraft14 was equipped with suites of instruments covering a very broad range of energies up to previously not well-studied and observed ultra-relativistic energies. An observational study of electrons at these energies15 discovered the ultra-relativistic population16 in the radiation belts that showed morphological structures and behavior very different from the bulk of the radiation belts. Van Allen Probes’ measurements confirmed the presence of peaks in PSD at ultra-relativistic energies, indicating the potential presence of local acceleration in the radiation belts17. However, it remained unclear how much local acceleration and radial diffusion contribute to the acceleration of the radiation belt. The modeling efforts confirming this hypothesis were done at a particular location in space18,19 and ignored the radial diffusion. The modeling of the storm in 201218 attempted to reproduce a real storm but used only one fixed value of the background density and one value of the distance from the Earth. The density was obtained from a statistical model20 and then divided by a constant factor of 3.5. This value was then assumed constant for the entire 16-hour-long simulation, which is unrealistic. That study concluded that local acceleration was the dominant process responsible for electron acceleration during the 9 October 2012 storm but made an assumption of constant density and ignored the radial diffusion.

The predominance of local acceleration has been questioned in several follow-up studies. It remained unclear if the assumption of constant density is realistic and if the measured density represented a local density at the spacecraft location at a particular time or a global density in the region of the radiation belts. Since the modeling was done at only one radial distance of 5 Earth radii, it remained questionable if the observations would be reproduced at other radial distances. Several studies21,22,23,24 argued that a two-step process may be responsible for the acceleration of electrons to ultra-relativistic energies, and the radial diffusion needs to act together with the local acceleration to produce such high energies.

In this study, we present global simulations of the Earth’s radiation belts for the April 2017 storm. The 3D modeling demonstrates that the local acceleration to such high energies is not a universal process but occurs only when the background plasma density is extremely low. Our results demonstrate that cold particles, which are over a million times less energetic than multi-MeV (>2 MeV) and ultra-relativistic (>~4 MeV) electrons, fully control the acceleration of the radiation belt electrons to such high energies. The acceleration of particles to ultra-relativistic energies in the presence of low plasma density is the most vivid example of how collective phenomena provide the influence of low-energy particles on the high-energy tail in collisionless plasmas.

Results and Discussion

Simulations of the 20 April 2017 storm

We chose the 20 April 2017 storm for which electron measurements of Van Allen Probes, GOES, and Arase are available. This is a relatively moderate storm that shows one of the strongest accelerations to the ultra-relativistic energies during the Van Allen Probes era.

To simulate this storm, we solve the modified 3D Fokker-Planck equation accounting for radial transport, local acceleration, loss to the atmosphere, and mixed diffusion of phase space density (here denoted as f):

$$\frac{{\partial}{{{\boldsymbol{f}}}}}{{\partial}{{{\boldsymbol{t}}}}} = {{{{\boldsymbol{L}}}}}^{* {{{\bf{2}}}}}{\frac{{\partial}}{{\partial}{{{{\boldsymbol{L}}}}}^{* }}\bigg|}_{{{{\boldsymbol{V}}}},{{{\boldsymbol{K}}}}}\left({{{{\boldsymbol{D}}}}}_{{{{{\boldsymbol{L}}}}}^{* }{{{{\boldsymbol{L}}}}}^{* }}{{{{\boldsymbol{L}}}}}^{* -{{{\bf{2}}}}}{\frac{{\partial}{{{\boldsymbol{f}}}}}{{\partial}{{{{\boldsymbol{L}}}}}^{* }}\bigg|}_{{{{\boldsymbol{V}}}},{{{\boldsymbol{K}}}}}\right)+{\frac{{{{\bf{1}}}}}{{{{\boldsymbol{G}}}}\,}\frac{{\partial}}{{\partial}{{{\boldsymbol{V}}}}}\bigg|}_{{{{\boldsymbol{K}}}},{{{{\boldsymbol{L}}}}}^{* }}{{{\boldsymbol{G}}}}\,\left({{{{\boldsymbol{D}}}}}_{{{{\boldsymbol{VV}}}}}{\frac{{\partial}{{{\boldsymbol{f}}}}}{{\partial}{{{\boldsymbol{V}}}}}\bigg|}_{{{{\boldsymbol{K}}}},{{{{\boldsymbol{L}}}}}^{* }} + {{{{\boldsymbol{D}}}}}_{{{{\boldsymbol{VK}}}}}{\frac{{\partial}{{{\boldsymbol{f}}}}}{{\partial}{{{\boldsymbol{K}}}}}\bigg|}_{{{{\boldsymbol{V}}}},{{{{\boldsymbol{L}}}}}^{* }}\right) \\ + \frac{{{{\bf{1}}}}}{{{{\boldsymbol{G}}}}}{\frac{{\partial}}{{\partial}{{{\boldsymbol{K}}}}}\bigg|}_{{{{\boldsymbol{V}}}},{{{{\boldsymbol{L}}}}}^{* }}{{{\boldsymbol{G}}}}\left({{{{\boldsymbol{D}}}}}_{{{{\boldsymbol{KK}}}}}{\frac{{\partial}{{{\boldsymbol{f}}}}}{{\partial}{{{\boldsymbol{K}}}}}\bigg|}_{{{{\boldsymbol{V}}}},{{{{\boldsymbol{L}}}}}^{* }}+{{{{\boldsymbol{D}}}}}_{{{{\boldsymbol{KV}}}}}{\frac{{\partial}{{{\boldsymbol{f}}}}}{{\partial}{{{\boldsymbol{V}}}}}\bigg|}_{{{{\boldsymbol{K}}}},{{{{\boldsymbol{L}}}}}^{* }}\right)-\frac{{{{\boldsymbol{f}}}}}{{{{\boldsymbol{\tau }}}}}$$
(1)

where L*, is a radial distance in magnetic coordinates inferred from the magnetic flux through the drifting around the Earth orbit of particles and measured in Earth radii, V and K are modified adiabatic invariants25 describing the momentum of particles and the direction of the velocity in the equatorial plane respectively. DL*L*, DVV, DKV, and DKK are the radial, V, mixed, and K diffusion coefficients, and G is the Jacobian of the transformation from adiabatic invariants to variables used in (1). The lifetime parameter τ accounts for losses of particles at small pitch angles due to collisions with atmospheric neutrals, and for loss at large radial distances outside the magnetopause.

The bounce-average diffusion coefficients due to the interactions with different waves are computed using the quasi-linear theory26,27. To account for the variable density, we calculate diffusion coefficients for various values of density (see Supplementary Fig. 1). We have inferred the evolution of cold plasma density from the EMFISIS instrument on Van Allen Probes28. The evolution of the ratio of the observed density to the value of the statistical model20 is shown in Fig. 1. The method of inferring density from data is discussed in the methods section and illustrated on Supplementary Fig. 4. An alternative method using data assimilation is also described in the methods section and the results are illustrated in the Supplementary Fig. 5. First of all, it should be noticed that the density is highly dynamic and changes over the course of the storm. The values of density as shown in Fig. 1 are often significantly below the average values by a factor varying from 2 to 5 as compared to the statistical data models at L* > 5. Such persistent and low values clearly show that the decrease in density during this storm is not local but global and persists for several days.

Fig. 1: Cold plasma density observed by Van Allen Probes and comparison to a statistical density model.
figure 1

a Cold plasma density observed by RBSP-A spacecraft and values inferred from a statistical model20. b Same as Panel a but for RBSP-B spacecraft. c Derived smoothed ratios of the statistical model and observed density is used for modeling outside of the plasmasphere. The observed values of density for this storm are clearly below the statistical values.

Previous studies29,30 have shown that the decrease in density can lead to an increased diffusion. As discussed in Supplementary Note 1 and illustrated in the Supplementary Fig. 1. the decrease in density results in a shift of resonances to higher energy and in the intensification of the scattering rates. With increasing energy, the scattering of relativistic electrons becomes negligible above ~2 -MeV (see Supplementary Fig. 1). So, the decrease in density produces an increase in local acceleration for multi-MeV and ultra-relativistic electrons at all values of pitch angles without increasing loss, which remains negligible even in the case of low density. While the energy diffusion is relatively slow, the lack of scattering near the loss cone ensures that particles are not lost to the atmosphere and provides gradual hardening of the spectrum. The shift in resonances to higher energies, and increased efficiency of scattering allows for the acceleration of electrons to ultra-relativistic energies when the measured density is used. The loss to the atmosphere is controlled by the scattering rates near the edge of the loss cone, which remain very slow even when the density is low, and hence the decrease in density does not significantly change the loss rates.

Figure 2 shows a comparison of the simulations with statistical plasma density values20, and plasma density inferred from observations. Two more energies of 2.6 and 7.7 MeV are presented in the Supplementary Fig. 2. While both simulations can approximately reproduce the dynamics of the belts, at MeV energies, only simulations with observed density evolution produce acceleration to ultra-relativistic energies. The evolution contour plot at 4.2 MeV clearly shows that in the case of constant density, there are practically no ultra-relativistic electrons, while for the simulation when the measured density is used, the modeled evolution reproduces the observed enhancement.

Fig. 2: Observations and simulation results of relativistic and ultra-relativistic electrons during the April 2017 geomagnetic storm.
figure 2

a RBSP-A&B MagEIS observations of electron flux at 0.9 MeV and 54° equatorial pitch-angle. b Simulated electron flux at 0.9 MeV and 54° equatorial pitch-angle using diffusion coefficients calculated for the empirical density model. c Simulated electron flux at 0.9 MeV and 54° equatorial pitch-angle using diffusion coefficients calculated using density measured by RBSP-A&B. d Kp index derived from ground observations and showing the level of geomagnetic activity throughout the storm. eh, Same as (ad) but for 4.2 MeV electron flux. Observations of 4.2 MeV electrons are provided by the REPT instrument.

Supplementary Note 2 also presents the simulations with density obtained from data-assimilative global simulations of plasma density, which show very similar results to the results shown in the main manuscript (see Supplementary Fig. 6).

As most of the previous 3D studies focused on the approximately 1 MeV populations, it is clear why the effect of density was not noticed in the past. The local acceleration to ultra-relativistic energies occurs only when density is low while acceleration up to 1 MeV can occur by means of a combination of radial diffusion and local acceleration at both times when density values are close to statistical values. The effects of density are not pronounced at MeV energies.

While inside the peak of PSD (L* ~ 5.5), particles at ultra-relativistic energies will be diffusing inwards and further accelerated by the radial diffusion, it’s clear that the transport by 1-2 RE will not be the dominant mechanism of acceleration from MeV up to 8 MeV and can only further increase the energy of electrons by up to 1–2 MeV (see Supplementary Fig. 7). It should be also noted that outside of the peak, radial diffusion will produce a similar effect and will work as a deceleration mechanism10,31.

Similarly, to Figs. 23 shows line plots at various energies. The model results in the heart of the radiation belts clearly show that variable density decreases fluxes at 0.9 MeV and dramatically increases at multi-MeV and ultra-relativistic energies. The simulation with a statistical model of density underestimates observations at 2.6 MeV by a factor of ten and at 7.7 MeV by approximately a factor of thousand, while the simulation with observed density accurately matches the observed enhancements at a wide range of energies.

Fig. 3: Observations and simulation results at L* = 4.5 and 54° pitch-angle for different energies during the April 2017 geomagnetic storm.
figure 3

The observations of 0.9 MeV electrons are provided by the MagEIS instrument, while the REPT instrument measures 2.6 MeV and 7.7 MeV electrons. a 0.9 MeV. c 2.6 MeV. e 7.7 MeV. b, d, f Kp index derived from ground observations and showing the level of geomagnetic activity throughout the storm.

In general, simulations with observed density variations can reproduce observations with high accuracy. Some points are missing at the highest energy channel of 7.7 MeV as REPT can measure only high counts of these particles and low counts fall below the threshold levels. Small discrepancies during the main phase of the storm could be due to adiabatic changes that cannot be accurately accounted for by the used empirical field model, which is known to be inaccurate during the main phase of the storm or potentially missing EMIC-induced loss32,33 that can produce loss of electrons at these energies.

In this study, we use the time-dependent evolution of plasma density and show that plasma density controls the acceleration to ultra-relativistic electrons. This study demonstrates that a depleted plasma density is a necessary condition for such acceleration. Our simulations with realistic density controlling local acceleration reproduce the dynamics at multi-MeV and ultra-relativistic energies globally at all radial distances, showing that such energies are reached by means of local acceleration. Our simulation provides excellent agreement with observations and, unlike previous attempts to understand acceleration to such high energies, includes both radial diffusion, which is important at MeV energy, and local acceleration which starts dominating the acceleration as the energy increases. This study is also consistent with the statistical study19, which presented the correlation between the density and acceleration to high energy, and the observational study34.

Previous attempts to quantify the role of local acceleration for ultra-relativistic electrons have not been done in 3D or have not compared the dynamics with realistic and statistical background density evolution. Radial diffusion is clearly essential for accelerating particles to MeV energies. The evidence for that is provided by the radial diffusion simulations, which can approximately reproduce the dynamics of relativistic electron belts5,10,35,36. The presence of the MeV is essential for the acceleration to multi-MeV and above as these particles are the seed population for the local acceleration to multi-MeV. Radial diffusion also contributes to the redistribution of fluxes and can transport particles to lower L-shells from the center of acceleration. However, the acceleration associated with the transport by one or even several Earth radii can provide only up to 2 MeV (see Supplementary Note 3 and Supplementary Fig. 7), which is very small as compared to the observed acceleration of approximately MeV to 7.7 MeV. So, the majority of this acceleration is produced locally by taking energy from the VLF waves.

The non-linear scattering has been shown by a number of theoretical studies to play a most important role in the evolution of the radiation belts37,38. However, quasi-linear simulations have shown to be able to provide a surprising accuracy at all radial locations for MeV16 and now multi-MeV and ultra-relativistic energies in this study. That is also consistent with the recent results39.

The results presented here provide an example of extreme efficiency of the local acceleration if the conditions of background plasma facilitate such acceleration. Superstorms and mega-storms may significantly deplete the background plasma density even further, while waves may be even stronger. Such an event may result in acceleration to even higher energies than considered in this study. Very similar acceleration to ultra-relativistic energies is likely to occur in the magnetospheres of giant planets, exoplanets, and various astrophysical magnetized objects.

Methods

Illustration of the effects of density on the pitch angle and energy scattering

The change in density descents is an increase in resonant energy. The increase in resonant energy is clearly seen in Supplementary Fig 1. The background density is assumed to have a radial and Magnetic Local Time (MLT) profile according to a statistical model20 but divided by a variable factor according to observations. The appropriate pre-calculated diffusion coefficient is then at a particular time according to the observed density. It should be noted that the wave statistics used for calculating the diffusion coefficients do not change with different cold density levels. The development of such wave models will be the subject of future research but will be a challenging task. Current wave models already suffer from a lack of observations during storm conditions. Observations as a function of latitude for storms exhibiting density depletions will be even scarcer, requiring the integration of data from all past missions and, potentially, new measurements.

Numerical approach for the solution of the Fokker-Planck equation

The VERB code40,41 has been validated against years of Van Allen Probe observations16,42,43, a 40-day interval of CRRES41, individual storms44,45, and superstorms46. VERB solves the Fokker-Planck equation (Eq. 1) in 3 dimensions, describing the evolution of electron phase space density in terms of radial distance from Earth, and the parallel and perpendicular momentum of particles.

The mixed diffusion terms are accounted for using the fully implicit method39. To compute radial transport, we tested various radial diffusion rates47,48,49 (see Supplementary Fig. 8). We have tested the sensitivity of the code to different parameterizations and found that the radial diffusion empirical coefficients that include a dependence on the first adiabatic invariant49 give the best results at ultra-relativistic energies. Figure S8 illustrates that the main conclusions of the study do not depend on the assumed model of radial diffusion coeffects and are true for all the considered models. The models that show no dependence on energy, show a bit less realistic results at the low inner boundary of fluxes at multi-MeV, and realistic radiation belts at MeV. That indicates that a realistic parameterization of radial diffusion should have weaker diffusion rates at multi-MeV and ultra-relativistic energies49. Additional evidence for that comes from the observations of remnant belts or “storage ring”15 that persist for up to a month in the belts.

The Fokker-Planck equation cannot be solved without providing proper initial and boundary conditions. The initial condition is built from one trajectory of the RBSP-A satellite, starting around 20 April, 7:00 p.m. UTC, and ending around 20 April, 11:30 p.m. UTC. To describe the initial condition on the whole simulation grid, we extrapolate data in pitch-angle assuming a sinusoidal distribution, and extrapolate the data beyond the satellite’s apogee by propagating the flux measured at the apogee to higher L* assuming that it stays constant.

The boundary conditions have to be provided for V, K, and L*. The lower V boundary is taken from the initial condition and is assumed to stay constant throughout the storm. The upper V boundary, describing electrons above 10 MeV, is set to constant zero, describing the absence of particles at such high energies. Both boundary conditions in K are set up to be zero derivatives, describing a symmetric pitch-angle distribution around 90° and allowing a filled loss cone at 0°.

At the lower L* boundary at L* = 1, we set up a constant value of zero, while the upper L* boundary at L* = 6.6 is built from observations of two Geostationary Operational Environmental Satellites (GOES-13 and -15). The particle observations are converted to phase space density as a function of V and K using the T04s magnetic field model. This allows us to use the GOES observations as a boundary at L* = 6.6, even though they are not flying at exactly L* = 6.6 throughout the storm. This method assumes that particles are transported adiabatically between the satellites’ location and L* = 6.6. Furthermore, we convert from integral to differential flux, assuming that the flux has an exponential dependence on energy. We set the flux at energies above 2 MeV to zero, assuming no incoming particles from the plasma sheet at such high energies.

Another parameter that has to be specified in the simulations is the location of the magnetopause, which can cause strong loss during geomagnetic storms10,45. The observations at 0.9 MeV indicate magnetopause shadowing during the early phase of the storm (see Supplementary Fig. 3). In this study, we assume that the L* of the magnetopause is taken from statistical models50, and then we subtract an empirical factor of 2.5 to account for the fact that the model predicts the radial distance and L* will be lower. The magnetopause is assumed to be compressed up to 22 April, 08:00 a.m. UTC, after that the magnetopause is assumed to be at L* = 10 corresponding to no magnetopause shadowing. It is also possible that EMIC waves may contribute to the loss of electrons at higher L-shells; however, since the focus of this study is acceleration, the exact details of the loss mechanism during the main phase of the storm are not a focus of this investigation.

The diffusion coefficients in V and K are pre-calculated using the Full Diffusion Code26 for different density levels. We use the chorus wave model developed from Van Allen Probes observations51 (see Supplementary Note 4). In the past, diffusion coefficients were calculated assuming the statistical Sheeley model to determine the cold plasma density. In this work, we also calculate the diffusion coefficients for all L* and MLT for plasma conditions, by assuming the Sheeley model divided by a constant factor (2,3,4,5). Based on the density measured by the satellites or predicted by the physics-based model, we decide which diffusion coefficient to use at each time point, L* and MLT. The MLT-dependent diffusion coefficients are averaged in MLT to give the final input into the VERB model.

Method of density calculation

We use the level 4 data product provided by the EMFISIS team, containing the cold plasma density derived by identifying the upper hybrid frequency or through the spacecraft potential28. The data is binned by 5 min to smooth outliers before the ratio to the MLT-resolved empirical plasma trough model20 is calculated for each point in time. Measurements inside the plasmasphere and plumes are identified by the criterion used by Sheeley20 and neglected. The derived ratios still show large variability, representing small-scale features or plasmaspheric plumes in the plasma trough (see Supplementary Fig. 4b,d). To attempt to reconstruct the average global picture of the density depletion, we calculate a moving average of the ratios using windows of 160 minutes and take the mean of the ratios derived from RBSP-A and -B (see Supplementary Fig. 4e). We tested different window sizes and this value gave us the most satisfying results considering the acceleration over all energies. The smoothed ratios are then used to decide which diffusion coefficients to use at each time point by choosing the nearest integer ratio level (see Supplementary Fig. 4f). It is, in general, possible that by averaging we underestimate the depletions and consequently the local acceleration. This potential additional local acceleration6 may be compensated by the EMIC wave scattering and loss, which can operate at these energies. However, including such events and studying this possibility would require a global and very accurate knowledge of the spatial distribution of density and EMIC waves, which is currently not available. While such effects can potentially quantitatively alter the results, they would not affect the main conclusions of the study, stating that acceleration to such high energies is not possible unless the density is globally decreased below statistical values.