Introduction

Magnetic tunnel junctions (MTJs) are the most popular spintronics devices used as core elements for magnetic random access memory (MRAM)1 or sensors for sensitively detecting magnetic fields2. MTJs with a crystallized MgO barrier layer are widely adopted due to their high tunnel magnetoresistance (TMR) ratio of more than a hundred percent3,4,5. This high TMR arises from the spin-dependent coherent tunneling of electrons6,7. In sputter-deposited CoFeB/MgO-based MTJs, which are indispensable for commercial applications, thermal annealing at a high temperature above ~300 °C is needed.

MgO is usually (001)-oriented after deposition on an amorphous CoFeB film8,9,10,11. The as-deposited amorphous CoFeB is known to crystallize into the bcc texture, accompanied by B diffusion into adjacent layers8,12,13,14,15, during annealing when using (001)-oriented MgO as a template9,10,11,13,16,17. This crystallization is essential for achieving a high TMR ratio, attributed to the phenomenon of coherent electron tunneling.

Conventional thermal annealing (CTA) using a heat treatment furnace is commonly employed for annealing MTJs. A typical annealing cycle lasts hours, and further waiting is often required for the sample to cool to room temperature. Rapid thermal annealing (RTA) with a halogen-lamp heater is also used for MTJ annealing18,19,20. RTA reduces the annealing time to hundreds of seconds and can yield near-100% TMR at 350–380 °C for 200–600 s19,20. Nonetheless, both CTA and RTA impose a high thermal budget, which can promote unwanted atomic interdiffusion and limit compatibility with temperature-sensitive device architectures.

Flash lamp annealing (FLA)21,22 (Fig. 1) provides a fundamentally different route. Intense, millisecond-scale Xe-lamp pulses can raise the MTJ surface temperature to extreme levels, exceeding 1000 °C in simulations, on timescales far shorter than those achievable with CTA or RTA. This ultrashort heating drastically cuts the thermal budget while potentially suppressing diffusion, enabling rapid crystallization even on thermally fragile substrates. In semiconductor processing, FLA has already proven effective for dopant activation while limiting diffusion22, suggesting its potential for broader application in advanced nanoscale devices.

Fig. 1: Schematic diagram of flash lamp annealing (FLA) for magnetic tunnel junctions (MTJs).
Fig. 1: Schematic diagram of flash lamp annealing (FLA) for magnetic tunnel junctions (MTJs).The alternative text for this image may have been generated using AI.
Full size image

A sequence of millisecond pulses from a Xe lamp rapidly heats the substrate surface to a high temperature, followed by a rapid decrease in surface temperature after irradiation. Using this process, annealing of MTJs fabricated on the substrate surface can be completed within a few seconds.

In this article, we demonstrate ultrafast annealing of CoFeB/MgO-based MTJs using flash lamp annealing (FLA). Optimized FLA achieves a TMR ratio of ~100% with a total irradiation time of only 1.7 s. In contrast, CTA requires hours of high-temperature processing to achieve comparable performance. This difference corresponds to a reduction in annealing time by orders of magnitude, while maintaining structural integrity and suppressing excessive interdiffusion. Using high-resolution transmission electron microscopy (HRTEM) and energy-dispersive X-ray spectroscopy (EDX), we carefully investigated the crystallinity and atomic diffusion in MTJs in their as-prepared state, after CTA, and after FLA under varied pulse conditions. The results of these analyses provide a clear basis for optimizing ultrafast annealing conditions toward high-throughput, energy-efficient spintronic device fabrication.

Results

Magnetic tunnel junctions for experiments

A CoFeB/MgO-based pseudo-spin valve structure was used for the experiments. The following layers, listed from the substrate side, were deposited on a thermally oxidized Si substrate with a thickness of 380 μm: Ta(3.2)/Ru(11)/Ta(3.2)/Co20Fe60B20(4.1)/MgO(1.7)/Co20Fe60B20(2.1)/Ta(3.2)/Ru(2.2), where the numbers in parentheses indicate the layer thicknesses in nanometers (see Methods). The stack was patterned into an elliptically shaped magnetic tunnel junction (MTJ) (see Fig. 2a). The in-plane area of the MTJ pillar (A) is 157 μm2. The CTA was performed at 450 °C for three hours in the absence of an external magnetic field.

Fig. 2: Magnetic tunnel junctions (MTJs) for experiments and experimental conditions and setup for flash lamp annealing (FLA).
Fig. 2: Magnetic tunnel junctions (MTJs) for experiments and experimental conditions and setup for flash lamp annealing (FLA).The alternative text for this image may have been generated using AI.
Full size image

a Schematic of an MTJ device. The inset shows a photograph of a device from the top electrode side. b MR curve before (as-made). c MR curve after CTA. A 118% TMR is obtained after annealing at 450 °C for 3 h. d Light irradiation sequence. The inset shows a typical light irradiation profile of a single pulse bunch. Ten light pulses, each with a duration of 10 μs, are delivered within a single pulse bunch. The total irradiation time increases with the number of pulse bunches n. e Photograph of a sample holder. A field of 420 mT was applied at the sample position by a pair of magnets (embedded in the dotted-line region). The inset shows a photograph of the backside of the holder. f Time dependence of the simulated wafer surface temperature (Tsurface, red) and the thermocouple temperature near the sample position (Tholder, blue) for n = 33. The inset shows the dependence of the simulated maximum Tsurface on n.

The magnetoresistance (MR) curves for the MTJs before (as-made) and after CTA are shown in Fig. 2b, c. An external magnetic field H was applied along the long axis of the MTJs. The vertical axis represents the MR ratio, defined as (R − RP)/RP, where R is the resistance of the MTJ and RP is the resistance when the magnetizations of the top and bottom CoFeB layers are aligned parallel at high H (see Methods). While the as-made MTJ exhibited only a small TMR effect, the CTA-treated device showed a maximum TMR ratio of 118%.

Flash lamp annealing

Here, the effect of FLA on the TMR ratio is discussed. FLA was performed using a Pulse Forge 3300 (NovaCentrix Corp.). Figure 2d shows a typical light irradiation profile. Ten light pulses, each with a duration of 10 μs, are delivered within 2 ms. We define this series of pulses as a single pulse bunch. In the following experiments, n pulse bunches were repeatedly irradiated at 50 ms intervals in air. Thus, the total irradiation time ttotal was 52n − 50 ms. Several as-made MTJs fabricated on the same wafer were fixed on a sample holder made of Al (see Fig. 2e) for light irradiation under each n condition. It should be noted that an external magnetic field of ~420 mT was applied along the long axis of the MTJs by built-in permanent magnets (SmCo) (see also Methods).

The temperature of the holder near the sample position (Tholder) during and after light irradiation was monitored using a thermocouple. The time dependence of Tholder for n = 33 is shown in Fig. 2f. Tholder increased to ~180 °C during FLA and decreased to below 50 °C within 3–4 s after light irradiation, suggesting that the annealing effect induced by the residual heat from the sample holder is not dominant. The temperature of the surface side of the wafer (Tsurface), corresponding to the MTJ pillar part, was estimated to substantially increase during FLA, although direct temperature monitoring of the wafer surface is difficult. Thus, we simulated Tsurface by assuming a similar layer structure (see Methods). The simulated Tsurface is also shown in Fig. 2f. The simulated maximum Tsurface produced by FLA is plotted in the inset, where T > 1000 °C is expected for n > 30.

Figure 3a shows typical MR curves for n = 10–40 (see Methods). A clear TMR is observed for n > 30. The TMR ratio and RPA as a function of n are summarized in Fig. 3b, c. The results for the as-made (n = 0) and CTA MTJs are also plotted in the figures. For the FLA MTJs, the TMR ratio abruptly increases at n = 31, and the maximum value approaching 100% is achieved at n = 33 (ttotal ≈ 1.7 s), whereas RPA is almost constant. The TMR ratio gradually decreases, but RPA increases when n is further increased.

Fig. 3: MR curves for flash lamp annealed MTJs and n dependence of the TMR and RPA.
Fig. 3: MR curves for flash lamp annealed MTJs and n dependence of the TMR and RPA.The alternative text for this image may have been generated using AI.
Full size image

a MR curves for MTJs subjected to FLA with different n values. The corresponding total irradiation time (ttotal) is also indicated. b TMR ratio as a function of n, and for CTA (red star). A clear TMR signal is observed for n > 30. The maximum TMR ratio of ~100% is obtained at n = 33. c RPA as a function of n, and for CTA (red star).

Microstructure observations

To determine the effect of FLA on the microstructure of the layers, HRTEM and EDX measurements were performed. Cross-sectional specimens were prepared by the focused ion beam microsampling technique. HRTEM images of as-made, CTA, and FLA (n = 33, 40, and 42) MTJs are shown in Fig. 4a–e. In the as-made MTJ (Fig. 4a), the CoFeB layers are in an amorphous state, whereas in the CTA MTJ (Fig. 4b), the layers become crystallized with a (001)-oriented texture8,10,12,13,15,16,23,24,25, which most likely plays a key role in enhancing the TMR ratio due to coherent tunneling. In contrast, in the FLA MTJ with n = 33 (Fig. 4c), crystallization is observed in the top CoFeB layer, but in the bottom layer it appears limited to ~1 nm immediately below the MgO layer. When n is further increased (Fig. 4d, e), the crystallized area in the bottom CoFeB layer extends across the entire layer.

Fig. 4: HRTEM and EDX.
Fig. 4: HRTEM and EDX.The alternative text for this image may have been generated using AI.
Full size image

ae HRTEM images of as-made, CTA, and FLA (n = 33, 40 and 42) MTJs. The layers from the Ta layer underneath the bottom CoFeB layer to the topmost Ru layer are shown in the large squares, and the CoFeB/MgO/CoFeB layers are enlarged in the insets. fj EDX line profiles of the CTA, as-made, and FLA (n = 33, 40 and 42) MTJs. The results for O, Mg, Fe, Co, Ru, and Ta are summarized in the same panel, while the result for B is shown in a separate panel on the right side in each figure. The horizontal dashed lines indicate the peak positions of Fe in the top and bottom CoFeB layers.

The Ta layer underneath the bottom CoFeB layer is crystallized before annealing, but it is nearly amorphous after CTA13 or progressively amorphous when n is increased after FLA. This change may be related to the formation of TaB14,16.

Figure 4f–j shows the EDX line profiles of the as-made, CTA, and FLA (n = 33, 40, and 42) MTJs. The results for O, Mg, Fe, Co, Ru, and Ta are summarized in the same panel of each figure, and the result for B is displayed in a separate panel on the right side of each figure. The horizontal dashed lines shown in Fig. 4f–j indicate the peak positions of Fe in the bottom CoFeB layer. Horizontal lines are also drawn at the peak position of Fe in the top CoFeB layer for the CTA and FLA (n = 33) MTJs (see the horizontal dotted lines in Fig. 4g, j). Importantly, the B peak in the top CoFeB layer shifts markedly toward the top Ta layer after CTA or FLA, indicating that the annealing process triggers B diffusion towards the Ta layer8,14,15,26. A closer examination of the net count of B near the horizontal dashed lines confirms higher values in the FLA MTJs than in the CTA MTJ (see dotted circles in Fig. 4g, j). This observation suggests that less B diffusion occurs, consistent with its much shorter annealing time22. The residual B near the CoFeB/MgO interfaces12,14,15,27 and incomplete crystallization of CoFeB may contribute to the lower TMR ratio observed in the FLA MTJs compared with the CTA MTJ. The decreased B diffusion suggested here can be one of the possible causes of the incomplete crystallization of the bottom CoFeB layer in the FLA MTJ with n = 33 because B diffusion is known to promote CoFeB crystallization15.

Although the net count of B around the bottom CoFeB layer remains high in the MTJs with n = 40 and 42 (see Fig. 4i, j), almost complete crystallization of the layer is observed (see Fig. 4d, e). The difference in the time scales of thermally activated B diffusion and CoFeB crystallization may be an important factor in the n dependence of the bottom CoFeB crystallization. A temperature gradient inside the MTJ pillar may be another possible reason for the n dependence of the bottom CoFeB crystallization, but this effect is likely minor in the present “nanolayer” stacks.

As an additional observation, in the FLA MTJs with n = 33, the net count of O remains low throughout the stack except for MgO, indicating that the MTJ is not oxidized despite irradiation in air. In contrast, for the FLA MTJs with n = 40 and 42, an overall increase in the O profile is observed in all the layers, indicating that the metallic layers are being oxidized. The decrease in the TMR ratio and increase in RPA for the FLA MTJs with n > 33 may be attributed to oxidation of the top and bottom CoFeB layers due to excess FLA. In addition, in the FLA MTJ with n = 42, the Fe peak of the top CoFeB disappears, likely due to mixing with the top Ta layer.

Discussion

A high TMR ratio approaching 100% was achieved in CoFeB/MgO-based MTJs with a total irradiation time of only 1.7 s using FLA. Compared with CTA, B diffusion in FLA-treated MTJs is reduced, and CoFeB crystallization is likely to begin before B diffusion is complete. The oxidation of metallic layers depends on the number of pulse bunches n. These results indicate that precise control of crystallization, B diffusion, and oxidation will be crucial for further optimization of FLA conditions. For instance, milder FLA treatments with reduced Xe-lamp power or increased pulse intervals beyond n = 33 may yield improved outcomes. Under optimized conditions, the MTJ annealing process can be completed in well under one minute.

FLA can thus significantly reduce the processing time for spintronic devices, which is highly beneficial for MRAM and magnetic sensor manufacturing. Although the present experiments were conducted using a research-scale FLA system, flash lamp annealing is inherently compatible with large-area processing, and uniform annealing over wafer-scale substrates is expected to be feasible with appropriate system design and scaling. Industrial CTA systems capable of processing 300 mm wafers under applied magnetic fields are already commercially available, and large-area FLA tools for semiconductor applications are likewise established, albeit typically without magnetic field integration. The combination of these two existing technological frameworks — large-area magnetic-field annealing platforms and wafer-scale FLA modules — suggests a plausible pathway toward implementing field-assisted FLA at the wafer scale.

Flash-like transient heating approaches have also been explored in magnetic multilayers for exchange bias setting, using pulsed laser irradiation28,29 and current-induced Joule heating30. While laser annealing is also a rapid thermal processing method, its application to MTJ processing remains relatively limited31,32 and FLA offers advantages for large-area processing and scalability. Laser annealing typically employs highly localized irradiation with very high power density and very short pulse durations (typically in the nanosecond–microsecond range), whereas FLA provides millisecond-scale transient heating over a much larger area. As a result, laser annealing is well suited for localized processing, while FLA enables uniform heating of a larger device area. In addition to substantial time savings, it may reduce the total energy cost of annealing. With further study of the temperature gradient between the substrate and devices, FLA could also be applied to spintronic devices fabricated on flexible33,34,35 or stretchable substrates with low thermal tolerance.

Methods

Preparation of MTJs

The MTJ layers were deposited by RF sputtering using Ar and Xe as process gases. The thickness of each layer was determined by HRTEM with an accuracy of approximately ±10%. An MTJ pillar with an elliptical shape (20 μm × 10 μm) was defined by photolithography followed by Ar ion milling. The bottom Ta/Ru/Ta layers served as the bottom electrode. The top electrode, consisting of Cr(3)/Cu(30)/Au(3–5), was fabricated by evaporation and lift-off processes (see Fig. 2a). The top and bottom electrodes, except for the MTJ pillar region, were electrically isolated by a 30-nm-thick Al2O3 layer deposited in an atomic layer deposition chamber.

Flash lamp annealing setup

The MTJ samples were positioned within a 7 mm × 10 mm region at the center of the 76 mm × 76 mm irradiation field. The irradiation non-uniformity across this sample area is below ~1%. The magnetic-field uniformity over the sample holder was evaluated by magnetic-field simulation. Within the sample region (~5 mm × 5 mm), the magnetic field variation remains within ±5% of the median value.

Because the pseudo spin-valve structure used in this study does not contain an antiferromagnetic layer, exchange bias setting is not required. The magnetic field present during FLA originates from permanent magnets embedded in the sample holder and is not expected to play a central role in the present results.

Simulation of surface temperature

The surface temperature (Tsurface) during FLA was calculated using simulation software (NovaCentrix SimPulseTM 3.8). For the simulation, a simplified layer structure was assumed: Si(380)/SiO2(0.5)/Ru(0.03)/Cu(0.04), where the numbers in parentheses denote layer thicknesses in micrometers.

MR measurements

The MR measurements were performed using a four-wire method. The resistance R was measured at a bias voltage V of +10 mV, with the top electrode connected to the positive terminal. The data in Fig. 3a–c correspond to the MTJ that exhibited the highest TMR ratio among at least five devices tested under the same n condition. For example, at n = 33, the TMR ratio ranged from 56% to 97%, with RA values between 27 and 36 kΩ·(μm)2. Similar device-to-device variations were observed for other n values. The data points shown in Fig. 3b were obtained from independent experimental runs, and the enhancement of TMR around n = 31–36 was consistently observed across these runs despite the device-to-device variations described above.