Main

Magic-angle twisted bilayer graphene (MATBG) has attracted extensive research interest owing to its remarkable tunability, offering a versatile platform for the study of strongly correlated electronic phenomena1,2,3,4,5,6,7,8,9,10,11,12,13. Many electronic states have been discovered in this low-density electron system, such as superconductivity, strongly correlated insulating states, pseudogap phases, topological phases and orbital magnetism12,13. The intricate interplay of strong electronic Coulomb interactions, which can override kinetic energy in flat-band systems, probably underlies many of these phenomena12,13. Yet, the origin of the unusual superconducting state remains enigmatic, with proposed pairing mechanisms including strong electronic correlations14,15,16,17, electron–phonon interactions18,19,20,21,22, spin fluctuations23 and skyrmions24.

In recent decades, angle-resolved photoemission spectroscopy (ARPES) has emerged as an important tool for investigating quantum materials, enabling the direct visualization of the electronic structure in momentum space25,26. However, the modest dimensions (usually 1–10 μm) of two-dimensional material devices and the prevalent twist angle inhomogeneity in MATBG devices27 pose challenges for conventional ARPES techniques owing to their limited spatial resolution (usually 50–500 μm). Fortunately, recent advancements in high-throughput X-ray optics28 have empowered us to perform high-quality ARPES measurements with submicrometre spatial precision (μ-ARPES), making it well suited for unravelling the intricate electronic structure in MATBG devices29,30.

In this study, we utilize μ-ARPES measurements to investigate and compare the electronic structures of superconducting MATBG devices (where MATBG is unaligned with its hexagonal boron nitride (hBN) substrate) and non-superconducting twisted bilayer graphene (TBG) systems—both hBN-aligned MATBG and TBGs deviating from the magic angle. Remarkably, in superconducting MATBG devices (unaligned with hBN), a distinct set of replicas of the flat band emerges at higher binding energies. These replicas show robust spectral intensity features that emulate the energy and momentum characteristics of the original flat band. Notably, these replicas manifest across the entire momentum range of the moiré Brillouin zone, maintaining a uniform energy separation of 150 ± 15 meV. In contrast, when subjected to the same experimental conditions, the non-superconducting TBGs—either hBN-aligned MATBG devices or TBGs deviating from the magic angle—do not show flat-band replicas. These experimental findings naturally suggest a strong connection (see Supplementary Information section XIII for more discussions) between the microscopic mechanisms responsible for the formation of flat-band replicas and the occurrence of superconductivity in MATBG.

The observation of replicated bands at higher binding energies in the single-particle ARPES spectrum goes beyond the predictions of non-interacting band theories and often indicates a strongly correlated origin31,32,33. Through a comprehensive approach involving theoretical analysis and non-perturbative many-body simulations19,34, we found that in superconducting MATBG (hBN unaligned) devices, these observed replicas can be attributed to the strong coupling between flat-band electrons to a transverse optical phonon mode at the graphene K point, facilitated by an intervalley scattering process (discussed below, in Supplementary Information sections VIIIX and in ref. 34). Remarkably, such coupling diminishes in non-superconducting TBG devices, including hBN-aligned MATBG and TBGs deviating from the magic angle (discussed in Supplementary Information sections XIXII). In summary, our discoveries provide valuable insights into the intricate interplay between flat-band electrons and the bosonic degrees of freedom within MATBG, shedding light on the complex electronic configurations from which superconductivity emerges.

In this study, we first utilized superconducting (hBN unaligned) and non-superconducting (hBN aligned) MATBG devices characterized in a previous investigation11, in which the twist angles and superconducting and spectroscopic properties of these devices were validated through scanning tunnelling microscopy (STM) and scanning tunnelling spectroscopy (STS) analyses (for a concise summary, see Supplementary Information section I). Furthermore, we conducted measurements on non-superconducting TBG devices with twist angles that slightly deviate from the magic angle to enhance the breadth and depth of our comparative analysis.

Measurement schematic and sample geometry

A schematic representation of the μ-ARPES measurement is depicted in Fig. 1a, where a gold pad is connected to the MATBG sample to ensure proper grounding and prevent charge accumulation during measurements. The optical images showing the device geometries for hBN-unaligned and hBN-aligned MATBG samples can be seen in Fig. 1b(i) and Fig. 1c(i), respectively. Notably, the μ-ARPES technique can directly resolve the MATBG band structure, as evidenced in the photoemission intensity maps for each sample, which distinguishes the MATBG regions from the hBN substrate (Fig. 1b(ii)) or single-layer graphene regions (Fig. 1c(ii)).

Fig. 1: μ-ARPES measurement and MATBG device geometries.
figure 1

a, Schematic of the μ-ARPES measurement geometry. b, Basic characterization of the MATBG (twist angle 1.06°) device A unaligned to its hBN substrate (8° rotational misalignment). (i) Optical image, with the substrate hBN, MATBG region (marked by white dashed lines) and the gold (Au) contact labelled. (ii) ARPES spectral intensity map covering the area enclosed by yellow dashed lines in (i) that highlights the MATBG region of the device. The spectral intensity is integrated within a binding energy of 0.2 eV, visualizing only conductive graphene and gold regions while omitting the hBN and silicon dioxide (SiO2) regions. (iii) STM topographic image10 (Vb = −80 mV, I = 300 pA) and a simulation of the hBN-unaligned MATBG lattice, showing two sets of moiré patterns with different periodicity: the larger one (about 13.3 nm) from the graphene/graphene moiré and the smaller one (about 1.76 nm, marked by the yellow bar) from the graphene/hBN moiré, respectively. c, The same as in b, but for the MATBG (1.08°) device C aligned with its hBN substrate (0.5 ± 0.1° rotational misalignment). (i) Optical image with monolayer graphene regions (G and G′; marked by red and green dashed lines) and the MATBG region (marked by purple dashed lines) labelled. The orange-coloured regions of the hBN are the result of two overlapped hBN flakes used to encapsulate this sample’s graphite back gate. (ii) ARPES spectral intensity map showing both monolayer graphene and MATBG regions corresponding to regions labelled in (i). (iii) STM topographic image11 (Vb = −300 mV, I = 100 pA) and lattice simulation, showing a commensurate moiré structure from MATBG and graphene/hBN as both have periodicity around 13 nm. Vb, setpoint sample bias; I, setpoint tunnelling current. Scale bars, 20 μm (b(i) and c(i)) and 10 nm (b(iii) and c(iii)).

Although two of the MATBG devices we studied share a nearly identical twist angle of about 1.08°, an optimal value associated with the maximum superconducting transition temperature (Tc) in transport measurements1,35, superconductivity occurs only in the hBN-unaligned device and not in the hBN-aligned devices (see ref. 11 and further details available in Supplementary Information section I). This disparity could be attributed to the influence of distinct moiré potentials stemming from different forms and degrees of coupling between MATBG and its hBN substrate, as illustrated in Fig. 1b(iii) and Fig. 1c(iii). In the unaligned sample (Fig. 1b(iii)), the hBN substrate introduces a high-frequency spatial background to the intrinsic MATBG moiré potential owing to the large angle (about 8°) mismatch between the hBN and MATBG, which does not significantly alter the electronic structure of MATBG. Conversely, in the aligned sample (Fig. 1c(iii)), the moiré periodicity (about 13 nm) introduced by the hBN substrate closely matches that of intrinsic MATBG (13.8 nm) owing to its small twist angle (about 0.5°) relative to the bottom graphene layer. In this aligned sample, the graphene/hBN moiré potential and the resulting C2-symmetry-breaking have a profound influence on the electronic structure of pristine MATBG, as is discussed later.

Flat-band replicas in superconducting MATBG

In Fig. 2, we present ARPES data from two superconducting MATBG (hBN-unaligned) devices A and B. The measurements encompass the vicinity of the K points of the individual graphene layers spanning several moiré Brillouin zones (Fig. 2a). A three-dimensional band-structure map of device A is depicted in Fig. 2b, and six representative band dispersion spectra across the moiré Brillouin zones are shown in Fig. 2c. Near the Fermi level (EF), we observe the flat band of MATBG, which extends throughout the entire moiré Brillouin zone and aligns well with single-particle calculations, as reported previously29,30. Remarkably, we observe previously unresolved replicas of this flat band at higher binding energies (Fig. 2c), which show similar bandwidth, momentum range and spectral distribution to the original flat band.

Fig. 2: Observation of flat-band replicas in superconducting MATBG (hBN unaligned).
figure 2

a, Illustration of the MATBG moiré Brillouin zones around the K point of top and bottom monolayer graphene Brillouin zones. The grey lines show the moiré Brillouin zones of MATBG. Cuts 1–6 mark the coordinates of ARPES spectra in c. b, Three-dimensional intensity plot of ARPES spectra in the vicinity of the graphene K point, presenting an overview of the band structure of MATBG. The ARPES measurements were conducted at room temperature (for details, see Supplementary Information section III). c, ARPES dispersion plots taken on superconducting hBN-unaligned MATBG device A (top rows, marked by cuts 1–6 in a and b) and their associated second partial derivative plots (bottom rows; Methods). The red and cyan arrows in top and bottom rows highlight the flat band and its first and second replicas, respectively. d, Top: integrated ARPES spectra intensity in the green dashed line box in c. Bottom: the orange curve represents the same data shown in the top panel with the removal of a smooth background (fourth-degree polynomial). The blue curve represents the integrated intensity of the second-derivative plot within the same green dashed line box. Both the orange and the blue curves show an energy spacing between the flat band and replicas of 150 ± 15 meV. (Details can be found in Supplementary Information section II). e, ARPES dispersion plots taken on a second superconducting hBN-unaligned MATBG, device B, where flat band and replicas are observed. (Details can be found in Supplementary Information section III). Scale bars, 0.1 Å−1. a.u., arbitrary units.

To provide enhanced clarity, we have captured these features in a series of ARPES dispersion spectra, cutting across the moiré Brillouin zone at different momentum locations (refer to labels ‘cut 1’ to ‘cut 6’ in Fig. 2a,b). These observed replicas and the original flat band show a uniform energy separation of 150 ± 15 meV, with intensities diminishing rapidly towards higher binding energies (Fig. 2c,d and detailed in Supplementary Information section II). Similar replica features are also observed in other superconducting MATBG devices (Fig. 2e and Supplementary Information section III for device B; Supplementary Information section V for STS signatures on device F; and Supplementary Information section IV for data from the literature30). These observed replica features cannot be explained by typical band hybridization. First, they consistently manifest at uniform binding energies, unlike in typical band crossing and hybridization processes in which the characteristic energy varies. Second, their presence is not restricted to high-symmetry momenta within the moiré Brillouin zone. Finally, but most importantly, these replicas are absent in non-superconducting TBG devices, a point that is elaborated on in the following.

Absence of replicas in non-superconducting samples

To compare our measurements on superconducting MATBG devices (devices A and B), we also investigated non-superconducting TBG devices, including both hBN-aligned MATBG (device C) and TBGs with the twist angle slightly deviating from the magic angle (devices D and E). As illustrated in Fig. 3a–c, in the hBN-aligned MATBG, we find no indications of the replica features, despite the similarity of its twist angle and strain to the hBN-unaligned MATBG devices shown in Fig. 2. Instead, the band structure of this sample shows characteristic signatures of band hybridization between the Dirac bands of the two graphene layers, further modified by the commensurate MATBG/hBN moiré potential (Fig. 1b(iii)). It is noteworthy that this distinct electronic structure observed in hBN-aligned MATBG, compared with hBN-unaligned MATBG (for a more comprehensive analysis, refer to Supplementary Information section VI), underscores the significant impact of the moiré-commensurate hBN substrate36, giving rise to unique emergent phenomena, such as the quantum anomalous Hall effect5. Moreover, typical signatures of band hybridization are also evident in other non-superconducting TBG devices (devices D and E and also ref. 29) without any discernible signatures of replica band features (Fig. 3d,e).

Fig. 3: Absence of flat-band replicas in non-superconducting TBG devices.
figure 3

a, Illustration of the MATBG moiré Brillouin zones around the K point of top and bottom monolayer graphene. The grey lines show the moiré Brillouin zones of MATBG. Cuts 1–6 mark the coordinates of ARPES spectra in c. b, Three-dimensional intensity plot of ARPES spectra in the vicinity of the graphene K point, presenting an overview of the band structure of hBN-aligned MATBG. Here the electronic structure of MATBG is modified by the commensurate alignment of hBN compared with the unaligned case illustrated in Fig. 2. (A detailed comparison can be found in Supplementary Information section VI). c, ARPES dispersion plots and second-derivative plots taken along the same momentum directions (marked in a and b) as those in Fig. 2 for the hBN-unaligned MATBG sample. The flat band is labelled by red and cyan arrows. No signs of replicas of the flat band were observed, in contrast to the data shown in Fig. 2. d,e, ARPES dispersion plots (direction with respect to moiré Brillouin zones in a is illustrated in the bottom-left corner) and second-derivative plots taken on the TBG devices with a twist angle of 0.87° (d) and 1.4° (e), smaller or larger than the magic region. No signature of flat-band replicas is evidenced. Scale bars, 0.1 Å−1.

Strong electron–phonon coupling in MATBG

The observation of replica bands at higher binding energies in single-particle ARPES spectra typically signifies the presence of strong electron–boson coupling in the system31,32,33. In TBG, the concurrent emergence of these replica flat bands and superconductivity further indicates a shared underlying mechanism. This connection closely parallels the observation of shake-off replica bands in FeSe/SrTiO3 (refs. 31,37,38), which probably arise from the strong coupling of electrons in monolayer FeSe to an optical phonon mode in SrTiO3. This strong electron–phonon coupling (EPC) was proposed to account for the much-enhanced superconducting transition temperature in this composite system compared with FeSe alone31,37,38. Despite the presence of various bosonic modes in MATBG, few are consistent with our observations. For instance, collective spin fluctuations, which couple with electrons to form magnetic polarons, can be ruled out, as they would yield unevenly spaced replica bands39. Plasmons, which couple with electrons to form plasmarons, are also unlikely candidates because the energy separation of the resulting replicas would be well below 150 meV (refs. 40,41) owing to the low carrier density in intrinsic MATBG/hBN1. In contrast, phonons, which couple with electrons to form polarons, are expected to have mode energies around 150 meV, matching the energy spacing of the observed replicas, and thus stand as the most plausible origin31,32,33,37,38.

Given the negligible electronic kinetic energy of the flat bands in MATBG (approximately 10–15 meV) compared with the large mode energy (approximately 150 meV), we initially examined the intrinsic phonon spectra of graphene, without considering the renormalization caused by polaronic dressing. To induce replica features in both moiré Brillouin zones, the relevant phonon modes should reside around the Γ or K (K′) point in the original graphene Brillouin zone, allowing them to couple with the corresponding intra- or intervalley electron–hole processes, respectively. In Fig. 4a, we emphasize three phonon branches (distinguished by green, red and blue colours) with energies around 150 meV near the K point, indicating their potential to mediate intervalley EPC. Notably, our frozen-phonon calculations reveal that the in-plane transverse optical (red line) mode shows the highest EPC strength dominant over the other two (in-plane longitudinal optical and in-plane longitudinal acoustic) relevant modes (as detailed in Fig. 4b and Supplementary Information sections VII and VIII). This substantial difference in EPC strength is attributed to the symmetry of flat-band wavefunctions and two-centre approximation34, consistent with previous findings42,43.

Fig. 4: Intervalley EPC in MATBG.
figure 4

a, Calculated phonon spectra of single-layer graphene. Three phonon modes, in-plane longitudinal optical mode (iLO), in-plane transverse optical mode (iTO), in-plane longitudinal acoustic mode (iLA), with approximately 150 meV energy around the K point are highlighted. iTA is in-plane transverse acoustic mode, ZO is out-of-plane optical mode and ZA is out-of-plane acoustic mode. b, Calculated effective EPC strength of iTO, iLO and iLA phonon modes as a function of average lattice distortion. The EPC strength of the iTO mode is almost an order of magnitude larger than the other two modes. Inset: real-space lattice distortion magnitude induced by A1 and B1 modes of iTO phonons, which peaks at the AA-stacking regions and diminishes at the AB-stacking regions. (Details can be found in Supplementary Information section VIII). c, Schematic of the intervalley EPC in MATBG. Flat-band electrons from two moiré Brillouin zones at the K and K′ points are exchanged through the scattering of iTO phonons, which generates shake-off replica bands separated by energy intervals of the iTO phonon energy ωph. d, Simulated photoemission spectra of the flat band and replicas. The top-right panel shows the zoomed-in second-derivative spectra of the flat bands, where the solid lines are the calculated dispersion of bare bands without EPC. (Details can be found in Supplementary Information section IX). e, Electron removal (solid blue line, occupied states) and addition (dashed grey line, unoccupied states) spectral function obtained with the presence of band inversion induced by the hBN substrate potential Δ. The flat-band replica is suppressed with increasing Δ (Supplementary Information section XII). f, The replica Poisson factor p, which reflects the relative intensity of ARPES replica, as a function of both hBN substrate potential Δ and the flat-band bandwidth w (Supplementary Information sections X and XI).

The in-plane transverse optical phonon mode scatters flat-band electrons between the moiré Brillouin zones at the K and K′ points (depicted in Fig. 4c), resulting in a series of evenly spaced replica bands akin to those produced by forward-scattering phonons in each moiré Brillouin zone. The observed ARPES spectra can be well reproduced (Fig. 4d, with details in Methods and Supplementary Information section IX), with the energy separation determined by the phonon energy, which significantly surpasses the electronic bandwidth of about 15 meV (refs. 31,32,33).

Using the same EPC model, we further simulate the ARPES spectrum with the presence of valley-dependent potentials induced by the aligned hBN substrate. As shown in Fig. 4e, the polaronic replicas are significantly suppressed with the presence of the hBN potential denoted as Δ (for details, see Supplementary Information section X), in agreement with our experimental observation. In addition, to address the disappearance of polaronic replicas at larger and smaller twist angles in TBG, we conducted simulations to explore the relative replica intensity at varying bandwidths. As shown in Fig. 4f, the Poisson factor experiences a pronounced reduction as the bandwidth increases from the flat-band limit (w = 0) to finite values. Such a suppression reflects a similar mechanism to that influenced by the hBN potential Δ, highlighting the competition between the single-particle electronic Hamiltonian and the valley-crossing EPC. This competition originates from the non-commuting nature of EPC matrices with the hBN potential or kinetic energy terms and reveals a universal phenomenon in materials with strong EPC (for detailed discussions, refer to Supplementary Information sections XXII).

Discussion and conclusion

We now discuss how this experimentally identified EPC may further contribute to electronic pairing in MATBG. Our raw estimation indicates that MATBG lies in the strong-coupling regime (for details and an explanation of strong coupling, see Supplementary Information section VIII), consistent with the observation of a pseudogap above Tc (ref. 11). We emphasize that the involvement of phonons does not necessarily lead to conventional s-wave superconductivity, given the topological nature of multiple flat bands34. Previous studies have shown that intervalley EPC can facilitate d-wave pairing in the intra-Chern-band channel19,34,44, which opens the possibility of various pairing scenarios, including time-reversal breaking chiral d-wave pairing or gapless nematic d-wave pairing45, with the latter consistent with previous STS observations of a nodal tunnelling gap11.

Owing to the intervalley nature of the EPC, a local Coulomb interaction in the chiral limit commutes with the EPC terms. As a result, the polaronic replicas induced by EPC are not affected by the presence of any local Coulomb interactions (Supplementary Information section XII). The coexistence of both strong EPC and Coulomb repulsions is anticipated to introduce further complexity to the pairing symmetry in MATBG and may collaborate in stabilizing superconductivity, as recently discussed for cuprates46,47,48. At low temperatures, these pairing properties would be reflected by electronic properties within the energy window of each replica. Unfortunately, this range lies beyond the reach of the current μ-ARPES set-up owing to the temperature limitation (about 20 K) and resolution constraints (about 20 meV). However, the nature of electronic many-body states should not affect the quantification of polaronic replica features at the energy scale of 150 meV, thus is unlikely to substantially alter the interpretation presented above.

Our findings underscore the importance, although not necessarily the dominating role, of intervalley phonon modes in superconducting MATBG, which act to dress the electronic states of the flat bands in ways that closely resemble those found in iron-based high-Tc superconductors31,37,38. Going forward, we anticipate that forthcoming experimental endeavours will provide additional insights into the connection between the microscopic mechanisms driving replica flat bands and those underpinning superconductivity in twisted graphene systems. For instance, similar replica flat bands may emerge in superconducting magic-angle twisted trilayer graphene49, whereas they may be absent in non-superconducting twisted monolayer–bilayer graphene50. Furthermore, we expect that the distinctive capabilities of μ-ARPES (and more advanced nano-ARPES) will find widespread use in investigating strong correlation effects in moiré systems.

Methods

Sample fabrication

MATBG devices A, B and C are devices B, F and C from ref. 11 (the full description of the fabrication procedure can be found in this reference). In brief, devices were fabricated using a ‘tear-and-stack’ method51 in which a single graphene sheet is torn in half by van der Waals interaction with hBN. The two halves were rotated relative to each other and stacked to form MATBG. Graphene, graphite (in the aligned device) and hBN were picked up with polyvinyl alcohol. Then, to flip the heterostructure upside down, the heterostructure was pressed against an intermediate structure consisting of polymethyl methacrylate/transparent tape/Sylgard 184, and the polyvinyl alcohol was dissolved by water injection. The heterostructure was then transferred to a SiO2/Si chip with pre-patterned titanium/gold electrodes. Residual polymer was dissolved in dichloromethane, water, acetone and isopropyl alcohol. These chips were annealed in ultrahigh vacuum at 170 °C overnight and at 400 °C for 2 h in previous STM and STS measurements11. The other devices shared a similar fabrication and preparation method.

Spatial- and angle-resolved photoemission spectroscopy

Synchrotron-based μ-ARPES measurements were performed at Beamline 7.0.2 (MAESTRO) of the Advanced Light Source, USA and Beamline 07U of the Shanghai Synchrotron Radiation Facility (SSRF), China. The samples were annealed in ultrahigh vacuum at 300 °C for 3 h and measured under ultrahigh vacuum below 3 × 10−11 torr. The photon energy of the incident beam was 95 eV and the measurement was performed at room temperature. Data were collected using an R4000 analyser upgraded with deflectors (Advanced Light Source) and a DA30 analyser (SSRF). The incoming photon beam was focused down to a 2-μm spot size by using a capillary mirror28 and a 1-μm spot size using a Fresnel zone plate (SSRF). The total energy and angle resolutions were 20 meV and 0.1°, respectively.

Laplacian (sum of second partial derivatives) plots were used to highlight the non-dispersive bands, as presented in Figs. 2 and 3. These plots were generated by summing the second partial derivatives of the original ARPES spectra along both axes in pixel units, then transforming them into energy and momentum space.

Model calculation

Estimation of EPC

We adopted the following non-interacting-electron tight-binding Hamiltonian to calculate the electronic structure and EPC of MATBG

$$H=\,-\,\sum _{Ii\alpha \,,\,Jj\beta }t({{\bf{R}}}_{Ii\alpha }-{{\bf{R}}}_{Jj\beta }){c}_{Ii\alpha }^{\dagger }{c}_{Jj\beta }$$
(1)

where \({c}_{{I}i\alpha }^{\dagger }\) and \({c}_{{J}j\beta }\) are creation and annihilation operators for the pz orbital of the iα/jβ carbon atom in the I/Jth moiré superlattice (α and β are joint indices for sublattices and layers), and R is the corresponding atomic coordinate. The impact of atomic coordinates on the hopping parameter t is approximated by the Slater–Koster formula, whose dependence on atomic coordinates sets the stage for the EPC estimation. The Slater–Koster parameters are specified in Supplementary Information section VII.

To evaluate the coupling in the mini-Brillouin zone, we further projected the coupling matrix using the truncated atomic plane wave (TAPW) method43,52. Thus, a moiré phonon near the ΓM point can be approximated by in-plane transverse optical/in-plane longitudinal acoustic/in-plane longitudinal optical phonons at graphene K/K′ points (see Supplementary Information section VIII for details). We set the moiré phonon energy ω0 = 150 meV, the mass of carbon atoms mc = 2.0 × 10−26 kg, and the characteristic phonon length \({l}_{{\rm{p}}}=\sqrt{\hbar /(2{m}_{{\rm{c}}}{\omega }_{0})}=34.0\,{\rm{m}}{\text{\AA }}.\) The TAPW electrons and projected EPC constitute the model Hamiltonian for the TBG system

$$\begin{array}{l}H\,=\,\sum _{\bar{{\bf{k}}}\sigma }{{\bf{c}}}_{\bar{{\bf{k}}}\sigma }^{({\rm{T}}{\rm{A}}{\rm{P}}{\rm{W}})\dagger }{h}_{\bar{{\bf{k}}}}{{\bf{c}}}_{\bar{{\bf{k}}}\sigma }^{({\rm{T}}{\rm{A}}{\rm{P}}{\rm{W}})}\\ \,\,-\frac{1}{\sqrt{{N}_{{\rm{m}}}}\,}\sum _{\bar{{\bf{k}}}\sigma \bar{{\bf{q}}}\nu }{{\bf{c}}}_{\bar{{\bf{k}}}+\bar{{\bf{q}}}\sigma }^{({\rm{T}}{\rm{A}}{\rm{P}}{\rm{W}})\dagger }{M}_{\bar{{\bf{q}}}\nu }{{\bf{c}}}_{\bar{{\bf{k}}}\sigma }^{({\rm{T}}{\rm{A}}{\rm{P}}{\rm{W}})}({a}_{\bar{{\bf{q}}}\nu }+{a}_{-\bar{{\bf{q}}}\nu }^{\dagger })+\sum _{\bar{{\bf{q}}}\nu }{\omega }_{0}{a}_{\bar{{\bf{q}}}\nu }^{\dagger }{a}_{\bar{{\bf{q}}}\nu }\end{array}$$
(2)

where \({{\bf{c}}}_{\bar{{\bf{k}}}\sigma }^{({\rm{TAPW}})}\) is the column vector of electron annihilation operators with moiré momentum \(\bar{{\bf{k}}}\) and spin σ in the TAPW basis and \({h}_{\bar{{\boldsymbol{k}}}}\) is the electronic hopping matrix for a specific moiré momentum \(\bar{{\bf{k}}}\). The size of its moiré mini-Brillouin zone is denoted as Nm to distinguish from the system size N. For folded phonon branches, \({a}_{\bar{{\bf{q}}}\nu }\) is the annihilation operator with moiré momentum \(\bar{{\bf{q}}}\) and ν labels the index of branches. The average distance between phonons and electrons is small compared with the moiré length scale. Therefore, the coupling matrix \({M}_{\bar{{\bf{q}}}\nu }\) can be estimated by the momentum-independent M0, which can be evaluated by the frozen-phonon calculations.

As the experimentally relevant electrons lie in low-energy flat bands, we further project the TAPW orbitals onto the four flat-band orbitals, which can be expressed as the projection operator \({{\bf{c}}}_{\bar{{\bf{k}}}\sigma }^{({\rm{flat}})}={{P}_{\bar{{\bf{k}}}}^{\dagger }{\bf{c}}}_{\bar{{\bf{k}}}\sigma }^{({\rm{TAPW}})}\), where P is the flat-band projection operator. In the projected Hamiltonian, the EPC matrix becomes a 4 × 4-dimensional \({\widetilde{{M}_{\bar{{\bf{k}}}}}(\bar{{\bf{q}}})=P}_{\bar{{\bf{k}}}{\boldsymbol{+}}\bar{{\bf{q}}}}^{\dagger }\,{M}_{\bar{{\bf{q}}}}\,{P}_{\bar{{\boldsymbol{k}}}}\). On the basis of its numerical distribution (Supplementary Information section VIII), we find that \(\widetilde{{M}_{\bar{{\bf{k}}}}}(\bar{{\bf{q}}})\) can be approximated by \({Q}_{\bar{{\bf{k}}}{\boldsymbol{+}}\bar{{\bf{q}}}}\,{g}_{\bar{{\bf{q}}}}\eta {Q}_{\bar{{\bf{k}}}}^{\dagger }\), where Q is the similarity transformation, g is the coupling strength and η is a constant diagonal matrix. Thus, the projected Hamiltonian can be written in an EPC-diagonal basis

$$\begin{array}{l}{H}_{{\rm{flat}}-{\rm{band}}}=\sum _{\bar{{\bf{k}}}\sigma }{{\bf{d}}}_{\bar{{\bf{k}}}\sigma }^{\dagger }{Q}_{\bar{{\bf{k}}}}^{\dagger }{\varepsilon }_{\bar{{\bf{k}}}}{Q}_{\bar{{\bf{k}}}}{{\bf{d}}}_{\bar{{\bf{k}}}\sigma }\\ \,\,\,\,\,-\frac{1}{\sqrt{{N}_{{\rm{m}}}}\,}\sum _{\bar{{\bf{k}}}\sigma \bar{{\bf{q}}}\nu }{g}_{\bar{{\bf{q}}}}{{\bf{d}}}_{\bar{{\bf{k}}}+\bar{{\bf{q}}}\sigma }^{\dagger }\eta {{\bf{d}}}_{\bar{{\bf{k}}}\sigma }({a}_{\bar{{\bf{q}}}\nu }+{a}_{-\bar{{\bf{q}}}\nu }^{\dagger })+\sum _{\bar{{\bf{q}}}\nu }{\omega }_{0}{a}_{\bar{{\bf{q}}}\nu }^{\dagger }{a}_{\bar{{\bf{q}}}\nu }\end{array}$$
(3)

where \({{\bf{d}}}_{\bar{{\bf{k}}}\sigma }={Q}_{\bar{{\bf{k}}}}^{\dagger }{{\bf{c}}}_{\bar{{\bf{k}}}\sigma }^{({\rm{flat}})}\) and the diagonal matrix \({\varepsilon }_{\bar{{\boldsymbol{k}}}}\) represents the energy of the flat bands.

ARPES spectral simulation for TBG

The strong EPC for the flat-band electrons leads to non-perturbative polaronic dressing effects. We consider the Lang–Firsov transformation for the coupled Hamiltonian

$${U}_{{\rm{LF}}}={{\rm{e}}}^{-\frac{1}{\sqrt{{N}_{{\rm{m}}}}{\omega }_{0}}{\sum }_{\bar{{\bf{R}}}\sigma \bar{{\bf{q}}}\nu }{{\rm{e}}}^{-{\rm{i}}\bar{{\bf{R}}}\cdot \bar{{\bf{q}}}}{{g}_{\bar{{\bf{q}}}}{\bf{d}}}_{\bar{{\bf{R}}}\sigma }^{\dagger }\eta {{\bf{d}}}_{\bar{{\bf{R}}}\sigma }({a}_{\bar{{\bf{q}}}\nu }-{a}_{-\bar{{\bf{q}}}\nu }^{\dagger })}$$
(4)

where \({{\bf{d}}}_{\bar{{\bf{R}}}\sigma }\) is the real-space annihilation operator of electron in the EPC-diagonal basis. Owing to the separation of energy scales for electrons and phonons, we can employ the polaron ansatz for the ground-state wavefunction \(| {\varPsi }_{{\rm{G}}}\rangle ={U}_{{\rm{LF}}}^{\dagger }| {\psi }_{{\rm{e}}}\rangle \bigotimes | {\psi }_{{\rm{ph}}}\rangle \), where |ψe and |ψph are the electronic and phononic wavefunctions. Moreover, as both the transformed coupling strength \({g}_{\bar{{\bf{q}}}}\) and temperature are much less than the phonon energy ω0 = 150 meV, we further assume that the \(|{\psi }_{{\rm{ph}}}\rangle \) can be approximated by a vacuum state \(|{0}_{{\rm{ph}}}\rangle \). Thus, the electronic part \(| {\psi }_{{\rm{e}}}\rangle \) is determined by an effective Hamiltonian

$$\begin{array}{c}\langle {0}_{{\rm{ph}}}| {U}_{{\rm{LF}}}H{U}_{{\rm{LF}}}^{\dagger }| {0}_{{\rm{ph}}}\rangle =\sum _{\bar{{\bf{R}}}\bar{{{\bf{R}}}^{{\prime} }}\sigma }{{\bf{d}}}_{\bar{{\bf{R}}}\sigma }^{\dagger }\langle {0}_{{\rm{ph}}}| {h}_{\bar{{\bf{R}}}\bar{{{\bf{R}}}^{{\prime} }}}^{* }| {0}_{{\rm{ph}}}\rangle {{\bf{d}}}_{\bar{{{\bf{R}}}^{{\prime} }}\sigma }\\ -\sum _{\bar{{\bf{R}}}\bar{{{\bf{R}}}^{{\prime} }}\sigma {\sigma }^{{\prime} }\bar{{\bf{q}}}}\frac{{N}_{v}\,{| {g}_{\bar{{\bf{q}}}}| }^{2}}{{N}_{{\rm{m}}}{\omega }_{0}}{{\rm{e}}}^{{\rm{i}}(\bar{{{\bf{R}}}^{{\prime} }}{\boldsymbol{-}}\bar{{\bf{R}}})\cdot \bar{{\bf{q}}}}({{\bf{d}}}_{\bar{{\bf{R}}}\sigma }^{\dagger }\eta {{\bf{d}}}_{\bar{{\bf{R}}}\sigma })({{\bf{d}}}_{\bar{{{\bf{R}}}^{{\prime} }}{\sigma }^{{\prime} }}^{\dagger }\eta {{\bf{d}}}_{\bar{{{\bf{R}}}^{{\prime} }}{\sigma }^{{\prime} }})\end{array}$$
(5)

The \({h}_{\bar{{\bf{R}}}\bar{{{\bf{R}}}^{{\prime} }}}^{* }\) is the phonon-dressed electronic hopping matrix, whose specific form is shown in Supplementary Information section X. In the flat-band limit of TBG, this matrix is close to identity and, therefore, is irrelevant in determining the polaronic dressing. The ground state of the above equation determines the \(| {\psi }_{{\rm{e}}}\rangle \) in \(| {\varPsi }_{{\rm{G}}}\rangle \).

The photoemission spectrum also involves excited states (denoted as \(| \varPhi \rangle \))

$$A(\bar{{\bf{k}}},\omega )={\rm{Im}}\left\{\frac{1}{{N}_{{\rm{m}}}}\sum _{\bar{{\bf{R}}}\bar{{{\bf{R}}}^{{\prime} }}}{{\rm{e}}}^{-{\rm{i}}(\bar{{\bf{R}}}{\boldsymbol{-}}\bar{{{\bf{R}}}^{{\prime} }})\cdot \bar{{\bf{k}}}}\sum _{\sigma \alpha ,\Phi }\langle {\varPsi }_{{\rm{G}}}| {c}_{\bar{{\bf{R}}}\sigma \alpha }^{\dagger ({\rm{flat}})}| \varPhi \rangle \langle \varPhi | {c}_{\bar{{{\bf{R}}}^{{\prime} }}\sigma \alpha }^{({\rm{flat}})}| {\varPsi }_{{\rm{G}}}\rangle \frac{1}{\omega -{\rm{i}}\varGamma +{E}_{\varPhi }-{E}_{{\rm{G}}}}\right\}$$
(6)

where Γ is the Lorentzian broadening. With the aforementioned ground-state ansatz, the intensity of the Mth replica peak is explicitly determined as

$$\,\frac{1}{M!}\exp \left(-\sum _{\bar{{\bf{q}}}}\frac{{N}_{v}{| {g}_{\bar{{\bf{q}}}}| }^{2}}{{N}_{{\rm{m}}}{\omega }_{0}^{2}}\right){\left(\sum _{\bar{{\bf{q}}}}\frac{{N}_{v}{| {g}_{\bar{{\bf{q}}}}| }^{2}}{{N}_{{\rm{m}}}{\omega }_{0}^{2}}{{\rm{e}}}^{{\rm{i}}\bar{{\bf{R}}}\cdot \bar{{\bf{q}}}}\right)}^{M}$$
(7)

It follows a Poisson distribution with the factor \(\sum _{\bar{{\bf{q}}}}\frac{{N}_{v}\,{| {g}_{\bar{{\bf{q}}}}| }^{2}}{{N}_{{\rm{m}}}\,{\omega }_{0}^{2}}\approx 0.11\), according to the frozen-phonon simulations. Focusing on the relative intensity of replica features and ignoring the interacting nature of electrons inside each replica, we produce the spectral simulation in Fig. 4d.

Variational non-Gaussian ansatz for more general models

The Lang–Firsov transformation is suitable for the flat-band TBG model. The experiments presented in this paper also include TBG under more complicated conditions, including with hBN substrates and finite bandwidth away from the magic angle. These generalized cases can be modelled in the form of

$$H=\,\sum _{\bar{{\bf{k}}}\sigma }{{\bf{d}}}_{\bar{{\bf{k}}}\sigma }^{\dagger }{h}_{\bar{{\bf{k}}}}{{\bf{d}}}_{\bar{{\bf{k}}}\sigma }-\frac{1}{\sqrt{{N}_{{\rm{m}}}}\,}\sum _{\bar{{\bf{k}}}\sigma }{{\bf{d}}}_{\bar{{\bf{k}}}+\bar{{\bf{q}}}\sigma }^{\dagger }{M}_{\bar{{\bf{q}}}}{{\bf{d}}}_{\bar{{\bf{k}}}\sigma }({a}_{\bar{{\bf{q}}}}+{a}_{\bar{{\bf{q}}}}^{\dagger })+\sum _{{\bf{q}}}{\omega }_{{\bf{q}}}{a}_{\bar{{\bf{q}}}}^{\dagger }{a}_{\bar{{\bf{q}}}}$$
(8)

where \({h}_{\bar{{\bf{k}}}}\) describes the band structure in general. The derivation and specific forms of Hamiltonian with hBN and bandwidth are detailed in the Supplementary Information sections X and XI.

It is important to note that the first term in equation (8) does not commute with the second term in this generalized case, making the Lang–Firsov transformation unsuitable for this case. To simulate the polaronic dressing in this generalized Hamiltonian, we employ a variational ansatz of the ground state \(| {\varPsi }_{{\rm{G}}}\rangle ={U}_{{\rm{NGS}}}^{\dagger }(\lambda )| {\psi }_{{\rm{e}}}\rangle | {0}_{{\rm{ph}}}\rangle \) with

$${U}_{{\rm{NGS}}}^{\dagger }(\lambda )=\exp \left[\frac{\lambda }{\sqrt{{N}_{{\rm{m}}}}}\sum _{\bar{{\bf{R}}}\sigma \bar{{\bf{q}}}\nu }\,{{\rm{e}}}^{-{\rm{i}}\bar{{\bf{R}}}\cdot \bar{{\bf{q}}}}{{\bf{d}}}_{\bar{{\bf{R}}}\sigma }^{\dagger }\eta {{\bf{d}}}_{\bar{{\bf{R}}}\sigma }({a}_{\bar{{\bf{q}}}\nu }-{a}_{-\bar{{\bf{q}}}\nu }^{\dagger })\right]$$
(9)

where λ is the variational parameter in contrast to the fixed g/ω0 of the Lang–Firsov transformation for TBG. Owing to the high phonon energy compared with any energy scales in equation (8), we still assume the post-transformation phonon state is vacuum. Therefore, the variational ansatz gives the total energy as a function of λ:

$$\begin{array}{l}{E}_{{\rm{tot}}}(\lambda )=\langle {0}_{{\rm{ph}}}| \langle {\psi }_{{\rm{e}}}| {U}_{{\rm{NGS}}}(\lambda )H{U}_{{\rm{NGS}}}^{\dagger }(\lambda )| {\psi }_{{\rm{e}}}\rangle | {0}_{{\rm{ph}}}\rangle \\ \,\,\,=\,{E}_{{\rm{kin}}}-\frac{{N}_{v}}{{N}_{{\rm{m}}}}(2g\lambda -{\omega }_{0}{\lambda }^{2})\sum _{\bar{{\bf{R}}}{\bar{{\bf{R}}}}^{{\prime} }\sigma {\sigma }^{{\prime} }\bar{{\bf{q}}}}{{\rm{e}}}^{{\rm{i}}(\bar{{{\bf{R}}}^{{\prime} }}-\bar{{\bf{R}}})\cdot \bar{{\bf{q}}}}\\ \,\,\,\,\langle {\psi }_{{\rm{e}}}| ({{\bf{d}}}_{\bar{{\bf{R}}}\sigma }^{\dagger }\eta {{\bf{d}}}_{\bar{{\bf{R}}}\sigma })({{\bf{d}}}_{\bar{{{\bf{R}}}^{{\prime} }}{\sigma }^{{\prime} }}^{\dagger }\eta {{\bf{d}}}_{\bar{{{\bf{R}}}^{{\prime} }}{\sigma }^{{\prime} }})| {\psi }_{{\rm{e}}}\rangle \end{array}$$
(10)

with the normalized kinetic energy Ekin follows the expression in Supplementary Information sections X and XI. Unlike the analytically solvable Lang–Firsov transformation, the variational parameter λ is obtained by numerical optimization of Etot(λ). The self-consistent equations for different situations of the generalized Hamiltonian are derived in Supplementary Information sections X and XI. This numerically determined λ describes the relative strength of the polaronic dressing, where λ reproduces g/ω0 when H takes the flat-band TBG form in equation (3). The Poisson factor for the relative replica intensity is then obtained by p = Nvλ2.