Main

Sulfur is an essential element in amino acids, vitamins and protein cofactors. Bacteria assimilate inorganic sulfur through the uptake and reduction of sulfate1. However, under sulfate unavailability, bacteria scavenge organic sulfur via methionine-salvage pathways by recycling metabolic by-products to methionine2,3. Rhodospirillum rubrum utilizes an anaerobic methionine-salvage pathway to convert (2-methylthio)ethanol (MT-EtOH) to methanethiol, direct precursor to methionine, via methylthio-alkane reductase4,5. In vivo, this enzyme produces methanethiol from several volatile organic sulfur compounds (VOSCs) besides MT-EtOH, including dimethyl sulfide (DMS) and ethyl methyl sulfide (EMS), releasing in a 1:1 stoichiometry ethylene (C2H4), methane (CH4) or ethane (C2H6), respectively4,5. The release of these small hydrocarbons, which are potent greenhouse gases, to the atmosphere can exacerbate the climate crisis. However, these hydrocarbons also offer a biological, sustainable alternative to fossil fuel-based products, like natural gas and plastic precursors. Specifically in the field of renewable plastic production, the methylthio-alkane reductase stands out as a biogenic source of C2H4 (ref. 6). Unlike all other C2H4-forming enzymes, including the 1-aminocyclopropane-1-carboxylic acid oxidase (ACCO), the ethylene-forming enzyme (EFE) and the NADH:Fe(III)EDTA oxidoreductase, the methylthio-alkane reductase is oxygen independent, requires simple substrates and does not release carbon dioxide (CO2) as a side product6. This makes this enzyme an exceptional option for the production of this important chemical building block, currently commercially derived exclusively from fossil fuels. Moreover, methylthio-alkane reductases explain the observed accumulation of C2H4 in waterlogged anoxic soils7 and, thus, are potentially involved in agricultural yield losses due to inhibitory effects of C2H4 on plant growth8.

Methylthio-alkane reductases have been phylogenetically classified as nitrogen fixation-like (Nfl) enzymes of the nitrogenase superfamily4,9 (Fig. 1). Nitrogenases are the only enzymes known to reduce the stable triple bond of dinitrogen (N2) to produce bioavailable ammonia (NH3)10 and distinguish themselves from Nfl enzymes in their catalytic metal centres11. Nfl enzymes such as the dark-operative protochlorophyllide a oxidoreductase (DPOR, encoded by bchLNB) or the Ni2+-sirohydrochlorin a,c-diamide reductase (encoded by cfbCD), involved in carbon–carbon double-bond reductions, contain only [Fe4S4]-clusters and tetrapyrrole binding sites12. By contrast, nitrogenases are known to contain large and complex metalloclusters that allow N2 fixation, including the P-cluster, an [Fe8S7]-cluster and the iron-molybdenum cofactor (FeMoco), a [MoFe7S9C-(R)-homocitrate]-cluster located in the active site of the molybdenum (Mo)-nitrogenase (encoded by nifHDK)11,13,14 (Fig. 1). FeMoco is assembled by the maturase Nif(EN)2, a nitrogenase family member that converts an Fe-only precursor, the [Fe8S9C]-cluster (L-cluster), into FeMoco by replacing a terminal Fe atom with Mo and attaching (R)-homocitrate15,16 (Fig. 1). Distinct activities of nitrogenases and Nfl enzymes are attributed to these differences in metallocluster content and, thus, there is great interest to identify the metal centres dictating the carbon–sulfur (C–S) bond reduction activity in methylthio-alkane reductases.

Fig. 1: Structural diversity of the nitrogenase and Nfl protein family.
figure 1

a, A schematic overview illustrating structures from key nitrogenase and Nfl proteins with their associated metalloclusters responsible for electron transfer and catalysis. Electrons are sequentially transferred through transient interaction of the homodimeric reductase components to the metal cofactors of the heterotetrameric catalytic components. All reductase components harbour an [Fe4S4]-cluster, which donates electrons to the subunit bridging metallocofactor of the catalytic component. For the DPOR (Bch(NB)2) and FeMoco maturase Nif(EN)2, this is an [Fe4S4]-cluster15,72, while the Mo-nitrogenase catalytic component (Nif(DK)2) harbours a P-cluster ([Fe8S7]-cluster) as an electron relay to the active site11. In DPOR, the substrate protochlorophyllide a (Pchlide) sits directly at the active site72,73. Nitrogenases harbour more complex metalloclusters in their active sites, such as FeMoco11, and Nif(EN)2 harbours the [Fe8S9C]-cluster15. The metallocluster composition for the methylthio-alkane reductase as well as the stoichiometry of the reaction was still unresolved. The protein name of each subunit is shown and derived from the corresponding gene name. Genes for each nitrogenase(-like) enzyme are usually organized in an individual operon encoding the subunits of reductase and catalytic components. Not shown here is the very distant homologue Ni2+-sirohydrochlorin a,c-diamide reductase (CfbD), which catalyses the second last step of coenzyme F430 biosynthesis, a tetrapyrrole cofactor in methanogens45. b, Main catalytic reactions. DPOR reduces the C17=C18 double bond of Pchlide to chlorophyllide a (Chlide) in the chlorophyll a biosynthetic pathway72,73. The Nif(EN)2 maturase converts the precursor [Fe8S9C]-cluster into FeMoco by inserting Mo and (R)-homocitrate15. Mo-nitrogenase reduces protons and N2 to NH3 and H2 (ref. 11). The methylthio-alkane reductase is proposed to reduce MT-EtOH to methanethiol and C2H4 (ref. 4). Protein phylogenetic relationships are depicted based on Extended Data Fig. 10. c, Legend of the metalloclusters found in nitrogenase and Nfl proteins shown in a.

Here, we determined the molecular structure of the methylthio-alkane reductase complex by anaerobic single-particle cryogenic electron microscopy (cryo-EM) yielding a global resolution of 2.75 Å. We found that the catalytic component contains P-clusters and [Fe8S9C]-clusters, making it the only Nfl enzyme with large nitrogenase cofactors, but unable to reduce N2. Our findings suggest that the protein scaffold, rather than the metalloclusters alone, dictates the differing catalytic activities of nitrogenases and methylthio-alkane reductases, advancing our understanding of the origin and evolution of N2 fixation.

Results

Structural characterization of the methylthio-alkane reductase

To reveal the molecular machinery responsible for CS bond reduction of methylthio-alkane reductases (encoded by marHDK), we heterologously produced subunits MarH, MarD and MarK from R. rubrum and its associated maturation enzyme MarB in an engineered Rhodobacter capsulatus strain17,18 encoding a disrupted Mo- (ΔnifD::SpR) and iron (Fe)-nitrogenases (ΔanfHDGK::gmR; for more details, see Methods and Supplementary Table 3). The reductase and catalytic components of the methylthio-alkane reductase were purified separately using affinity chromatography (Fig. 2a). We determined the oligomeric state of each component by mass photometry: the reductase component forms a MarH2 homodimer with a measured mass of 69 kDa (Fig. 2d; theoretical mass 64 kDa), while the catalytic component forms a Mar(DK)2 heterotetramer with a measured mass of 227 kDa (Fig. 2d; theoretical mass 217 kDa). In conclusion, methylthio-alkane reductase components have the same stoichiometry as the Mo-nitrogenase components.

Fig. 2: Purification and characterization of the methylthio-alkane reductase.
figure 2

a, SDS–PAGE of Strep-tagged MarD (57.5 kDa), MarK (50.8 kDa) and His-tagged MarH (31.9 kDa). MW, molecular weight; M, marker. Purification of the methylthio-alkane reductase complex was reproduced independently with similar results 26 times. b, Fe content of Mar(DK)2 and MarH2 determined by ICP-OES (n = 6; independent experiments). c, (R)-homocitrate abundance in Nif(DK)2 and Mar(DK)2 determined by HPLC–MS (n = 6; independent experiments). d, Mass photometry histogram of MarH2 (top) and Mar(DK)2 (bottom). Event counts are plotted against their corresponding molecular mass. e, In vitro activities of the methylthio-alkane reductase (Mar) and Mo-nitrogenase (Nif) for the reduction of MT-EtOH, EMS, DMS, acetylene (C2H2) and protons under an argon (Ar) atmosphere (n = 3; independent experiments). f, The methylthio-alkane reductase cannot reduce 15N2 in vitro. Total turnover numbers for the formation of 15NH4+ by Nif(DK)2 or Mar(DK)2 depending on reductase component, ATP and sodium DT, or the strong reductant Eu(II)-DTPA detected by NMR; for more details, see Extended Data Fig. 7 (n = 2; independent experiments). Columns depict the mean, dots the individual measurements and error bars the propagated instrumental error. g, Electron density map of the MarDK2H2-complex at a global resolution of 2.75 Å (EMD-50553), contoured at level 10. Reductase component is coloured in grey (H1 and H2), and the catalytic component is coloured in yellow (D1) and green (K1 and K2). h, Single-particle cryo-EM 2D classes of the methylthio-alkane reductase with ratios of catalytic:reductase component of 1:1 and 1:2 (for more details, see Extended Data Fig. 3). In b, c and e, columns show the mean, dots the individual measurements and error bars the standard deviation. n.d., not detected.

The purified enzymes were active for the reduction of MT-EtOH, EMS and DMS as measured by the release of C2H4, C2H6 and CH4, respectively (Fig. 2e and Extended Data Fig. 1). While the Mo-nitrogenase is unable to reduce VOSCs, the methylthio-alkane reductase converts MT-EtOH at a rate of 2.3 nmol C2H4 × mg−1 Mar(DK)2 × min−1, EMS at 7.9 nmol C2H6 × mg−1 Mar(DK)2 × min−1 and DMS at 6.5 nmol CH4 × mg−1 Mar(DK)2 × min−1 (Fig. 2e). The methylthio-alkane reductase activity is ATP dependent and requires the addition of both reductase and catalytic components as well as sodium dithionite (DT) as an electron donor (Extended Data Fig. 1e). Intriguingly, we detected the formation of molecular hydrogen (H2) for all tested substrates and in the absence of substrate under an argon atmosphere (Fig. 2e and Extended Data Fig. 1a–d).

While Mo-nitrogenase cannot reduce VOSCs, methylthio-alkane reductase can reduce the alternative nitrogenase substrate acetylene (C2H2) to C2H4 (Fig. 2e and Extended Data Fig. 1d). Moreover, methylthio-alkane reductase can fully reduce C2H2 to C2H6, a characteristic of the alternative nitrogenases, vanadium (V)- and Fe-nitrogenases19,20, which contain V or Fe in the position of Mo in FeMoco, termed FeVco and FeFeco21. This suggests the presence of FeFeco/FeVco-like cofactors in the methylthio-alkane reductase.

To reveal the metalloclusters of Mar(DK)2 involved in catalysis, we determined the molecular structure of the methylthio-alkane reductase complex by anaerobic single-particle cryo-EM. We were unable to obtain the structure of the catalytic component alone, due to severe preferential orientation bias and susceptibility to denaturation of MarD through contact with the gas–water interface. We overcame this issue by forming a complex of Mar(DK)2 with MarH2, using the non-hydrolysable MgATP mimic MgADP-AlF3, thus protecting the catalytic component from denaturation. Reference-free two-dimensional (2D) class averages revealed the existence of two major oligomeric states: an octameric Mar(DK)2(H2)2-complex, and a hexameric Mar(DK)2H2-complex (Fig. 2h). Attempts to resolve either complex in three dimensions were still hampered by severe preferential orientation bias, leading to uninterpretable reconstructions (Extended Data Fig. 2c–f). However, we realized that the preferred orientations of the two oligomeric states were perpendicular to each other. This insight prompted a revised classification strategy, where instead of segregating particles based on oligomeric state, we combined all particles thus resolving structural regions present across both complexes. (Extended Data Fig. 2g,h). This approach led to a substantial improvement in the reconstructed density, but the MarD subunits remained poorly resolved due to particle heterogeneity and the intrinsic flexibility of the N-terminal region. Given our goal to resolve the active site metalloclusters of the methylthio-alkane reductase, we prioritized solving a three-dimensional (3D) class that represented a stable state of one MarD subunit, at the expense of resolving the second copy. By applying strict 3D classification on 25 classes using a local mask on the proximal MarD, we selected the class with the best-resolved N terminus of MarD—specifically, the class in which the active site cluster is most deeply buried within the protein. This enabled a focused refinement of the MarDK2H2 fragment of the entire Mar complex, yielding a final reconstruction with a global resolution of 2.75 Å. (Fig. 2g and Extended Data Fig. 3). The resulting map enabled us to build a structural model with well-resolved metalloclusters (Fig. 3a), revealing the molecular basis of the methylthio-alkane reductase specificity.

Fig. 3: Methylthio-alkane reductase contains nitrogenase cofactors in the active site.
figure 3

a, Protein model of the MarDK2H2-complex with highlighted ligand positions (PDB: 9FMG). Colouring as in Fig. 2g. be, Magnified views of the methylthio-alkane reductase MgADP-AlF3 moieties (b), the [Fe4S4]-cluster (c), the P-cluster (d) and the [Fe8S9C]-cluster (e). Cryo-EM electron density maps are displayed as a grey mesh at contour level 8. Ligands are represented as ball-and-sticks with carbon atoms coloured in dark grey, nitrogen in blue, oxygen in red, phosphorus in orange, aluminium in light grey, fluoride in pale cyan, magnesium in green, sulfur in yellow and iron in dark orange. Amino acid residues coordinating the metalloclusters are shown in stick representation. fh, EPR spectroscopy of MarH2 and Mar(DK)2 metalloclusters: EPR S = 1/2 signals of the [Fe4S4]1+ cluster in DT-reduced MarH2 with and without MgATP recorded at 10 K (f); EPR spectra at 10 K of Mar(DK)2 samples poised at the indicated redox potentials show two main species between −550 and −150 mV, and with simulations in red lines (the signal appearing upon oxidation is similar to that of the [Fe8S9C]-cluster of A. vinelandii Nif(EN)2 (blue trace)) (g); EPR spectra of reduced (DT-free) MarH2 (upper trace), oxidized (IDS-free) Mar(DK)2 (middle trace) and a mixture of both (lower trace) at 10 K (h). Reduced MarH2 can transfer electrons to oxidized Mar(DK)2 leading to the disappearance of the broad [Fe8S9C]-cluster-like g = 1.91 EPR signal, with concomitant appearance of a sharp rhombic species. EPR conditions: microwave frequency 9.35 GHz; modulation frequency, 100 kHz; 1.5 (f and g) and 1.0 (h) mT modulation amplitude; microwave power 0.2 mW. An isotropic radical was subtracted from all spectra to facilitate simulation and comparison.

Methylthio-alkane reductase contains nitrogenase cofactors

Analysing the structure, we observed a canonical reductase component MarH2, as detailed in Supplementary Discussion 1, containing one MgADP-AlF3 moiety per monomer (Fig. 3b and Extended Data Fig. 4) and a single [Fe4S4]-cluster in its dimeric interface coordinated by Cys97MarH and Cys133MarH (Fig. 3c). Presumably, electrons are transferred from the [Fe4S4]-cluster to the P-cluster located at a distance of ~15.0 Å, bridging the proximal MarDK dimer (Fig. 3b). The P-cluster is symmetrically coordinated by Cys32MarK, Cys56MarK, Cys121MarK, Cys66MarD, Cys91MarD and Cys158MarD, featuring a centrally shared sulfide (Fig. 3d) and is most likely present in the reduced state (PN), as detailed in Supplementary Discussion 2. Most excitingly, we observed strong electron density at the active site of the methylthio-alkane reductase, located ~14.4 Å away from the P-cluster (Figs. 3e and 4b). The density neatly accommodates the [Fe8S9C]-cluster from Nif(EN)2, the FeMoco precursor, with no density corresponding to any organic moiety. In addition, we were unable to detect (R)-homocitrate by high-performance liquid chromatography–mass spectrometry (HPLC–MS) of Mar(DK)2 (Fig. 2c). To determine the metal composition of these metalloclusters, we performed inductively coupled plasma optical emission spectroscopy (ICP-OES; Fig. 2b). ICP-OES analysis revealed 32.6 ± 5.3 Fe atoms per catalytic component, consistent with the presence of two P-clusters ([Fe8S7]-clusters) and two [Fe8S9C]-clusters per Mar(DK)2 heterotetramer, and neither Mo nor V was detected. The reductase component MarH2 contained 3.5 ± 0.9 Fe atoms, indicating the presence of a labile [Fe4S4]-cluster. Thus, methylthio-alkane reductase is the only Nfl enzyme described to harbour the large and complex nitrogenase P- and [Fe8S9C]-clusters.

Distinct metallocluster coordination and surrounding

The core of the [Fe8S9C]-cluster consists of a trigonal prism formed by Fe2–Fe7, which are bridged by three belt sulfides. Due to the presence of all conserved NifB motifs in MarB and the presence the radical SAM domain responsible for carbide insertion4, we assume a similar role of MarB and NifB in the assembly of the nitrogenase precursor cluster22. Thus, an interstitial carbide was placed in the centre of the cluster, although the resolution of the electron density map is insufficient to locate the carbide. The metallocluster is anchored to the protein scaffold by two amino acid residues, Cys270MarD to Fe1 and His429MarD to Fe8 (Fig. 4a), analogous to the coordination of Fe1 and Mo of FeMoco by Cys275AvNifD and His442AvNifD, respectively23 (Fig. 4c). However, His429MarD is located at an extended loop specific to MarD, which is not conserved in bona fide nitrogenases (Loop I in Fig. 4a,f and Extended Data Fig. 5). This explains why previous sequence analysis of the methylthio-alkane reductase gene cluster had failed to identify this ligand4. The octahedral coordination of Mo in the Mo-nitrogenase, which involves three sulfides, (R)-homocitrate and His442AvNifD, is replaced in the [Fe8S9C]-cluster of the methylthio-alkane reductase by a tetrahedral coordination due to the absence of (R)-homocitrate. As a result, the anchoring to the protein adopts an axial geometry, and the bond distance between His429MarD and Fe8 of the [Fe8S9C]-cluster is shorter (1.9 Å) compared with the bond distance between His442AvNifD and the Mo atom of FeMoco (3.1 Å in PDB: 7UTA (ref. 23)). This is also the case when compared with the bond distance between His423RcAnfD and Fe8 of FeFeco from the Fe-nitrogenase (2.7 Å in PDB: 8OIE (ref. 17)). The observed coordination of the methylthio-alkane reductase [Fe8S9C]-cluster by just two residues (Cys270MarD and His429MarD) indicates that this cluster indeed necessitates a carbide in its centre, because the geometry of an [Fe8S9]-cluster with six trigonal Fe atoms would not be possible without the structural support of a shared central ligand.

Fig. 4: Comparison of active sites from methylthio-alkane reductase and Mo-nitrogenase.
figure 4

a,c, The active site of the methylthio-alkane reductase (a) features an extended loop (Loop I, highlighted in pink) containing His429MarD, which coordinates the [Fe8S9C]-cluster. Conversely, for Mo-nitrogenase (c; PDB: 7UTA) a different structural domain (Loop II, highlighted in green) contains His442AvNifD, which coordinates FeMoco. Both panels display key amino acid residues potentially involved in metallocluster stabilization and catalysis in stick representation. Metalloclusters and (R)-homocitrate (HCA) are shown as ball-and-sticks. Nt, N terminus; Ct, C terminus. b,d, Substrate channels predicted by CAVER36 accessing the active sites of the methylthio-alkane reductase (b) and the Mo-nitrogenase (d) are depicted, with the first and second most likely channels coloured in dark and light grey, respectively. Mo-nitrogenase channels correspond to the proposed substrate channels by Igarashi and Seefeldt (IS channel)33 and Morrison et al. (AI channel)34. Residues that gate the proximal pocket of the Mo-nitrogenase active site are shown in stick representation35. Distances between the three metalloclusters, [Fe4S4]-cluster, P-cluster and active site cofactor, are indicated by dotted lines. e, Overlay of [Fe8S9C]-cluster and FeMoco from the alignment of MarD and NifD, highlighting the displacement of the S2B belt sulfur position between both clusters. f, Side view of the active site of the methylthio-alkane reductase emphasizing S2B and S5A as potential substrate recognition and reduction sites. Distances to S2B and S5A from relevant amino acids are shown as a dashed grey line. Colouring as in Fig. 3a.

Beyond the distinct cluster composition, the direct protein environment of methylthio-alkane reductases also deviates from nitrogenases. Substrate binding to nitrogenase metalloclusters has been proposed to occur between Fe2 and Fe6, where the bridging sulfide S2B is replaced by the substrate24,25,26,27,28. The conserved residues Val70AvNifD, Gln191AvNifD and His195AvNifD surround S2B and are essential for N2 reduction29 (Fig. 4c). The cavity left by the small side chain of Val70AvNifD allows substrate access to Fe6 of FeMoco30. His195AvNifD forms a hydrogen bond with S2B in the resting state and possibly donates protons during catalysis, while Gln191AvNifD undergoes structural rearrangements that allow S2B displacement by the substrate during turnover26,31. The residues lining the active site of the methylthio-alkane reductase differ substantially (Extended Data Fig. 6a): Cys73MarD and Gln74MarD replace Val70AvNifD, while tryptophan (Trp195MarD) and phenylalanine (Phe199MarD) residues substitute Gln191AvNifD and His195AvNifD, respectively (Fig. 4a). However, because the S2B site of the [Fe8S9C]-cluster of the methylthio-alkane reductase is rotated by ~48° compared with the S2B site of FeMoco taking as the vertex its interstitial carbide, only Trp195MarD is close enough to S2B to form a hydrogen bond (Fig. 4e,f). Trp195MarD, His194MarD and His375MarD surround the S2B site of the [Fe8S9C]-cluster at distances of 3.9–4.7 Å and thus could be involved in substrate recognition and binding as well as proton transfer during turnover (Fig. 4a,f). The [Fe8S9C]-cluster in Mar(DK)2 is also stabilized by Arg428MarD, which forms a hydrogen bond with the belt sulfur S5A (Fig. 4f). Interestingly, Arg428MarD is located at the extended Loop I of MarD, adjacent to the [Fe8S9C]-cluster coordinating His429MarD, and replacing Gly424AvNifD, a residue strictly conserved in all nitrogenases32 (Extended Data Fig. 5). Consequently, Arg428MarD together with His429MarD might be a hallmark of methylthio-alkane reductases and distinguish them from nitrogenases. In summary, the methylthio-alkane reductase active site suggests a different substrate recognition and proton transfer to nitrogenases.

Substrate channels determine specificity

Differences between methylthio-alkane reductase and Mo-nitrogenase extend to substrate channels: Mo-nitrogenase substrate channels proposed by Igarashi and Seefeldt (IS)33 and Morrison et al. (AI)34 are restricted from accessing the proximal pocket of FeMoco due to hydrogen bond gating by Tyr229AvNifD and Ser278AvNifD35 (Fig. 4d). By contrast, the two most likely methylthio-alkane reductase substrate channels predicted by CAVER36 do not show steric impediments. Restricted access to the active site of Mo-nitrogenase could explain its inability to utilize VOSCs, despite being more efficient at reducing smaller substrates (Fig. 2e). The most likely substrate channel in the methylthio-alkane reductase (channel I) originates at the interface of MarD and MarK, passes the belt sulfide S5A on the way to S2B and occupies the void left by the absent (R)-homocitrate (Fig. 4b,f). The second channel (channel II) originates at the surface of MarD opposite to channel I and ends at S2B of the [Fe8S9C]-cluster (Fig. 4b,f). Nevertheless, given the symmetric geometry of the [Fe8S9C]-cluster, substrate binding to any of the three belt sulfurs, S2B, S5A and S3A, cannot be discarded at this point. Further investigations are needed to decipher the binding of methylthio-alkanes to the [Fe8S9C]-cluster during catalysis.

The [Fe8S9C]-cluster does not reduce N2

Based on our finding that the methylthio-alkane reductase contains an [Fe8S9C]-cluster in conjunction with the recent observations that [Fe8S9C]-cluster-containing nitrogenase maturase Nif(EN)2 can reduce N2 to NH3 to a slight extent37, we aimed to determine the N2 reduction activity of methylthio-alkane reductases. No formation of 15NH4+ beyond the background could be detected by nuclear magnetic resonance (NMR) in 15N2-assays mimicking physiological conditions containing catalytic component Mar(DK)2, reductase component MarH2, ATP and DT, in contrast to Mo-nitrogenase assays under the same conditions (Fig. 2f and Extended Data Fig. 7). Assays with just the catalytic component Mar(DK)2 and the strong reductant europium-II-diethylenetriamine pentaacetic acid (Eu(II)-DTPA) also did not display 15NH4+ formation above background levels (Fig. 2f and Extended Data Fig. 7). This result contrasts with the observation that Nif(EN)2 can reduce one to three molecules of N2 in a reductase component- and Eu(II)-DTPA-dependent manner37. Thus, our results reveal that the presence of an [Fe8S9C]-cluster is not the sole determinant for N2 reduction; rather, the coordination and the immediate surroundings of the [Fe8S9C]-cluster may determine its activity.

Spectroscopy and redox states of metalloclusters

Nitrogenases and Nfl enzymes exhibit very distinct EPR spectroscopic features13, which define the redox state and electronic structure of the metalloclusters. At 4–20 K, DT-reduced MarH2 exhibited the typical physical spin mixture of a S = 1/2 (g = 2.03, 1.933 and 1.84) and a S = 3/2 species of the [Fe4S4]1+ cluster, characteristic for NifH2 (ref. 38), of which the relative content of the two spin states is ATP dependent, as detailed in Supplementary Discussion 3 (Fig. 3f and Extended Data Fig. 8b,c). Next, we analysed the catalytic component Mar(DK)2: upon DT reduction, and in a dye-mediated redox titration with a midpoint potential (Em) of −390 mV versus normal hydrogen electrode, a relatively sharp S = 1/2 signal (gaverage = 1.96) appeared (Fig. 3g and Extended Data Fig. 8a). Although the Mar(DK)2 properties did not match those of the Mo-nitrogenase Nif(DK)2 (Supplementary Discussion 4, Extended Data Fig. 9 and Supplementary Table 3), it was very similar to the signal assigned to a reduced P-cluster in the catalytic component of V-nitrogenase39. Because the fully intact DT-reduced PN cluster is diamagnetic and the fully intact P2+ oxidized state has a g = 12 integer spin signal in Nif(DK)2 (ref. 40) and Vnf(HDK)2 (ref. 41), this signal probably derives from a subclass of incompletely matured P-clusters containing an [Fe4S4]1+ half41,42. Upon oxidation with 5,5′-indigodisulfonate (IDS) or by dye-mediated redox titration with Em = −250 mV versus normal hydrogen electrode, a peculiar broad S = 1/2 signal (simulated with gxyz = 1.967, 1.926 and 1.83) with gaverage = 1.91 appeared, totally unlike P1+ and P3+ EPR signals13,40. This EPR signal has previously been observed for the [Fe8S9C]-cluster-containing nitrogenase maturase Nif(EN)2 (ref. 37) (Fig. 3g). Our findings show that the gaverage = 1.91 signal is the fingerprint for the active site metallocluster of Mar(DK)2 and indicate that a carbide is indeed present in the [Fe8S9C]-cluster of the methylthio-alkane reductase. By mixing reduced (but DT-free) MarH2 with oxidized (but IDS-free) Mar(DK)2 and MgATP, we could demonstrate that the structurally defined electron transfer pathway (Fig. 3a) guides an electron to the species with gaverage = 1.91 (Fig. 3h). Concomitantly, the signal assigned to the reduced P-cluster partially appeared, in agreement with its low redox potential.

Evolution of nitrogenase metalloclusters

Our structural and biophysical data conclusively show that methylthio-alkane reductases contain metalloclusters until now exclusively associated with nitrogenases. This strongly suggests that their last common ancestor probably contained a P-cluster and a large active site cluster, as previously hypothesized43. The fact that we find ligating cysteines in methylthio-alkane reductases that are in homologous positions to equivalent cysteines in nitrogenases strongly suggests that these ligation sites already existed in their last common ancestor. To test this prediction, we first inferred a phylogeny of the subunits assembling the catalytic component of the wider nitrogenase family, composed of nitrogenase-like proteins (NflDK) including the methylthio-alkane reductase (MarDK), Mo-nitrogenase (NifDK), FeMoco maturase (NifEN), dark-operative protochlorophyllide a oxidoreductase (BchNB), chlorophyllide a oxidoreductase (BchYZ) and Ni2+-sirohydrochlorin a,c-diamide reductase (CfbD). Nitrogenases and their relatives are heterotetrameric proteins comprising two paralogues—usually called D and K. This arrangement is a result of an ancient gene duplication event of an ancestrally likely homomeric protein, represented today by CfbD2 (ref. 43). Subsequent operon duplications then produced the different paralogous heterotetramers known in this family. CfbD is an outgroup to all other sequences, which roots the tree between the K and D paralogues of the catalytic components of methylthio-alkane reductase and nitrogenase. BchNB and CfbD (just under 30% of all sequences from our alignment) neither have all cysteines conserved nor contain a P- or [Fe8S9C]-cluster. Our initial tree search yielded a maximum-likelihood tree in which all Bch proteins are a sister group to all nitrogenase(-like) proteins including methylthio-alkane reductases (Extended Data Fig. 10e). This implies that the duplication from homomers to paralogous heteromers occurred twice, once in nitrogenase(-like) proteins and once in Bch proteins. The support for this unparsimonious split was low, so we inferred another tree in which we constrained the topology such that Bch and nitrogenase(-like) proteins including methylthio-alkane reductases underwent only one initial duplication from a homomer, followed by subsequent operon duplications (Extended Data Fig. 10a). This constrained tree was not rejected by the approximately unbiased test, and we view it as more plausible. We then used ancestral sequence reconstruction to infer sequences of the two subunits (D and K) that together assembled into the last common ancestor of nitrogenases and methylthio-alkane reductases for the unconstrained and constrained trees. Our reconstructions of these two very deep nodes were not confident enough across their entire sequence to resurrect this ancient protein experimentally (average posterior probability across sites 0.70). Posterior probabilities were much higher, however, around the crucial metallocluster binding sites. We confidently infer the six cysteines that would ligate a P-cluster with a posterior probability of 1 for both trees, which are in positions homologous to their positions in methylthio-alkane reductases and nitrogenases. For the residues ligating the [Fe8S9C]-cluster, we confidently infer a cysteine homologous to Cys270MarD (posterior probability of 0.99), but not a histidine equivalent to His429MarD or His442AvNifD, because this ligand is located in different loops in nitrogenases and methylthio-alkane reductases (Extended Data Fig. 10). Our results indicate that this common ancestor had a P-cluster and probably an [Fe8S9C]-cluster ligated differently than extant nitrogenases and methylthio-alkane reductases. This ancestral reductase was probably catalytically active and contained a complex metallocluster buried in its active site, rather than being a maturase with a surface-exposed cluster like Nif(EN)2 (ref. 15), although its activity is unresolved, so far. The discovery of nitrogenase clusters in the methylthio-alkane reductase provides a new perspective on how the protein scaffold controls the maturation, insertion and reactivity of these great clusters of biology44 and strongly suggests that their history predates the origin of biological N2 fixation by bona fide nitrogenases as we know them today. This is consistent with the recent discovery that [Fe8S9C]-clusters are part of the activation complex of methyl-coenzyme M reductase, which probably preceded the evolution of [Fe8S9C]-cluster-bearing nitrogenase(-like) enzymes45.

Discussion

We aimed to identify the factors determining the unique activities of methylthio-alkane reductases. We revealed that these enzymes contain large nitrogenase metalloclusters, including a P-cluster bridging the MarDK subunits and an [Fe8S9C]-cluster in the active site of MarD, instead of an [Fe4S4]-cluster and a substrate-binding site. Consequently, methylthio-alkane reductase is the only Nfl enzyme with such metalloclusters that cannot perform N2 fixation. The absence in methylthio-alkane reductase of conserved nitrogenase amino acid residues Val70AvNifD, Gln191AvNifD and His195AvNifD involved in the N2-reduction mechanism29 could explain this observation. Moreover, complete nitrogenase activity might require (R)-homocitrate, suggested to be crucial for proton transfer during Mo-nitrogenase catalysis46,47,48. While Mo-nitrogenase is probably more reactive and efficient in electron transport, it cannot break the weaker C–S bond of VOSCs reduced by methylthio-alkane reductases. Our structural data suggest that the specificity of methylthio-alkane reductases for methylated sulfur compounds is probably due to wider substrate channels in comparison with nitrogenases, allowing unrestricted access of these larger compounds to the active site. The insights gained in this study highlight the catalytic potential of the [Fe8S9C]-cluster beyond ammonia formation and open the door to engineering the methylthio-alkane reductase for the biotechnological production of small hydrocarbons such as methane and ethylene.

Methods

Chemicals

Chemicals were acquired from Thermo Fisher Scientific, Carl Roth, Sigma-Aldrich and Honeywell. Gas bottles were purchased from Air Liquide.

Bacterial strains and growth conditions

Modified Rhodobacter capsulatus B10S49 strain MM024617 (ΔnifD::SpR ΔanfHDGK::gmR ΔdraTG ΔmodABC ΔgtaI) was used for the heterologous expression and purification of the methylthio-alkane reductase from R. rubrum. R. capsulatus MM0246 does not encode for a methylthio-alkane reductase4 and was designed for the optimal plasmid-based production of nitrogenase-like enzymes under the Fe-nitrogenase anfH promoter. The advantageous genetic modifications of this production strain include the disruption of native Mo-nitrogenase nifD and Fe-nitrogenase anfHDGK genes with a spectinomycin or a gentamycin resistance cassete, respectively17, the deletion of the nitrogenase posttranslational modification machinery draTG responsible for the reversible inhibition of the nitrogenase reductase component by ADP ribosylation50, the deletion of the modABC transporter responsible for the cellular uptake of Mo, a strong repressor of the anfH promoter51, and the deletion of the quorum sensing gene gtaI responsible for the formation of the capsule, which improves cell pelleting of R. capsulatus during collection52. R. capsulatus was cultured anaerobically under photoheterotrophic conditions (60 W krypton lamps) in modified R. capsulatus Minimal Medium V (RCV) liquid medium17 with the addition of 10 mM serine and 20 µg ml−1 streptomycin sulfate.

Escherichia coli DH5α (Thermo Fisher Scientific) and E. coli ST18 (ref. 53) were used for expression plasmid cloning and conjugation into R. capsulatus by diparental mating54. A list of all strains used in this study is found in Supplementary Table 4.

Molecular cloning

The marBHDK gene cluster (Rru_A0793–Rru_A0796) was amplified from R. rubrum DNA by PCR with Q5 high-fidelity DNA polymerase (New England Biolabs) and cloned under the R. capsulatus anfH promoter via Golden Gate assembly55 using NEBridge Golden Gate Assembly Kit (New England Biolabs) into a broad-host range plasmid pOGG024 (ref. 56) (pMM0165). To purify the methylthio-alkane reductase subunits separately, affinity tags were added to the sequences via Gibson Assembly57 using NEBuilder HiFi DNA Assembly Master Mix (New England Biolabs; pMM0181). A hexahistidine tag was added to the N terminus of MarH and a Strep-tag II to the C terminus of MarD, to purify reductase and catalytic components58. The sequence of pMM0165 and pMM0181 were confirmed by whole-plasmid sequencing (Plasmidsaurus). Primers and plasmids used in this study are listed in Supplementary Tables 5 and 6.

Protein purification and characterization

Purification of Strep-tagged Mar(DK)2 and His-tagged MarH2 from R. capsulatus MM0422 was started by inoculating at an optical density at 660 nm (OD660) of 0.05 modified RCV medium17 with 10 mM serine and 50 µg ml−1 kanamycin sulfate. Liquid cultures were cultivated anaerobically under illumination in 1.2 atm of 100 % argon atmosphere at 30 °C until reaching a final OD660 of ~3.2. Cultures were collected inside an anaerobic COY tent (COY Laboratory Products), with a >95% Ar <5% H2 atmosphere and were spiked with 2 mM sodium DT. Cell pelleting was performed in a Beckman Coulter centrifuge (Beckman Coulter) at 10,000g, for 30 min at 10 °C. The supernatant was discarded, and the cell pellet was resuspended in binding buffer (500 mM NaCl, 50 mM Tris pH 8.5, 10 % glycerol, 2 mM sodium DT, 0.01% Tween 20, 50 mM arginine–glutamate and 20 mM imidazole) supplemented with 0.2 mg ml−1 bovine pancreatic deoxyribonuclease I and one cOmplete EDTA-free protease inhibitor tablet (Sigma-Aldrich) per 30 ml of pellet. Cells were lysed in a French press FA-078AE (Thermo Fisher Scientific) at 20,000 psi. The soluble fraction was separated by ultracentrifugation at 119.028g for 1 h at 4 °C, and filtered before loading onto a Strep-Tactin XT 4Flow high-capacity column (IBA Lifesciences) and a HisTrap HP column (Cytiva) connected to an ÄKTA pure chromatography system (Cytiva) inside an anaerobic COY tent. Binding buffer containing 50 mM biotin was used for the elution of Strep-tagged Mar(DK)2, while binding buffer containing 250 mM imidazole was used for the elution of His-tagged MarH2. Eluted proteins were buffer exchanged in Sephadex G-25 packed PD-10 desalting columns (Cytiva) to desalting buffer (150 mM NaCl, 50 mM Tris pH 7.8, 10% glycerol and 2 mM sodium DT), and concentrated using Amicons Ultra 15 ml filters (Merck) of 30 kDa molecular weight cut-off. Protein yields were determined using Quick Start Bradford 1× Dye Reagent (Bio-Rad Laboratories) following the manufacturer’s instructions. Purified protein was stored in liquid nitrogen until further use. The purity of the proteins was determined by sodium dodecyl sulfate–polyacrylamide gel electrophoresis (SDS–PAGE), the metal content by ICP-OES and the molecular mass by mass photometry as previously described17. In short, mass photometry measurements were carried out on microscope coverslips (1.5 H, 24 × 50 mm; Carl Roth) with CultureWell Reusable Gaskets (CW-50R-1.0, 50–3 mm diameter × 1 mm depth). Measurements were set up in gaskets assembled on microscope coverslips on the stage of a TwoMP mass photometer (Refeyn) with immersion oil. Samples were measured in anaerobic measurement buffer (150 mM NaCl, 50 mM Tris (pH 7.8), 10% glycerol and 10 mM sodium DT). Then, 0.5 µl of sample (500 nM stock concentration, 4 mM sodium DT) was removed from an anaerobic vial, quickly added to 19.5 µl of measurement buffer and mixed on the stage of the mass photometer. Measurements were started ~5 s after removing protein from the anaerobic environment. Data were acquired for 60 s at 100 frames per second using AcquireMP v2.3 (Refeyn). Mass photometry data sets were processed and analysed using DiscoverMP v20222 R1 (Refeyn).

Homocitrate determination

Semi-quantitative determination of homocitrate was performed using high-resolution electrospray liquid chromatography–mass spectrometry. The chromatographic separation was performed on an Agilent Infinity II 1290 HPLC system (Agilent Technologies), using a Kinetex EVO C18 column (150 × 2.1 mm, 3 μm particle size, 100 Å pore size; Phenomenex) connected to a guard column of similar specificity (20 × 2.1 mm, 3 μm particle size; Phenomenex) at a constant flow rate of 0.2 ml min−1 with mobile phase A being 0.1% formic acid in water and phase B being 0.1% formic acid in methanol at 40 °C. The injection volume was 5 µl. The profile of the mobile phase consisted of the following steps and linear gradients: 0–0.5 min constant at 0% B; 0.5–6 min from 0% to 90% B; 6–7 min constant at 90% B; 7–7.1 min from 90% to 0% B; 7.1–12 min constant at 0% B.

An Agilent 6546 Q-TOF mass spectrometer (Agilent Technologies) was used in negative ionization mode with a dual electrospray ionization source: electrospray ionization spray voltage 2,000 V, nozzle voltage 500 V, sheath gas 300 °C at 11 l min−1, nebulizer pressure 45 psig and drying gas 170 °C, at 5 l min−1. The instrument was used in low molecule size mode with a screening window of 50–1,700 m/z. For online mass calibration, 112.9856 and 1033.9881 were used.

Compounds were identified on the basis of their accurate mass within a mass window of 5 ppm and their retention time compared with standards. Chromatograms were integrated using MassHunter software (Agilent). Relative abundance was determined based on the peak area by applying a mass extraction window of 10 ppm.

Methylthio-alkane reductase in vitro activity assays

Enzymatic activity of methylthio-alkane reductase and Mo-nitrogenase was determined by in vitro activity assays with the substrates MT-EtOH, EMS, DMS, protons (under Ar) and acetylene (C2H2, 10% in Ar) measuring production of ethylene (C2H4), ethane (C2H6), methane (CH4), molecular hydrogen (H2) and C2H4 as well as C2H6, respectively. The reductase component (0.15 mg) was added to the catalytic component (0.1 mg) in 700 µl of reaction mix containing 50 mM Tris pH 7.8, 3.5 mM ATP, 7.87 mM MgCl2, 44.6 mM creatine phosphate, 0.2 mg ml−1 creatine phosphokinase, 5 mM DT and 15 mM of the corresponding VOSC in 1.2 atm Ar, if indicated. Due to the tendency of MarH2 to precipitate at high concentrations, all assays were conducted at a catalytic:reductase component molar ratio of 1:5. The reactions took place in 10-ml sealed gas chromatography (GC) vials, at 30 °C and shaking at 200 rpm for 10 min after the addition of the catalytic component. Samples were quenched with 300 µl of 400 mM Na2EDTA pH 8.0 up to a total volume of 1 ml. Content of the headspace was determined using a Clarus 690 GC system (PerkinElmer), as previously described17.

NMR 15NH4 quantification

In vitro nitrogen fixation activity was determined for methylthio-alkane reductase and Mo-nitrogenase by running physiological assays as described above or Eu(II)-DTPA assays, which were performed in a 1.5-ml volume containing 5 mg of the catalytic component Mar(DK)2, 20 mM Eu(II)-DTPA, 25 mM Tris pH 8.0 and 2 mM DT. Both assays conditions were performed under 50% 15N2 (Merck) and 50% Ar at 2 atm in 10 ml GC vials, at 30 °C, shaking at 250 rpm for 5 h (physiological assays) or 15 h (Eu(II)-DTPA assays) quantifying 15NH4 with NMR. Negative controls were prepared under Ar or 14N2 atmospheres. Proteins were removed from the samples by ultrafiltration at 3,000g and 30 min using Amicon Ultra filters (3 kDa cut-off, Merck). Subsequently, 500 µl of filtrate were combined with 50 µl of 10 M H2SO4. Then, 50 µl of d6-DMSO was added as a locking agent to produce a final volume of 600 µl. Two reference samples of 30 mM NH4+ were prepared using natural abundance 15N-NH4Cl in assay buffers.

To quantify the concentration of 14NH4+ in each sample, one-dimensional 1H spectra with excitation sculpting water suppression were acquired using the zgesgp sequence from the Bruker standard sequence library on an Oxford Instruments 600 MHz magnet equipped with an Avance III console and a TXO Helium cooled cryoprobe. To quantify the concentration of 15NH4+, 15N heteronuclear single quantum coherence (HSQC) one-dimensional spectra were acquired on the same hardware using the hsqcetf3gp pulse sequence. Ninety-degree 1H pulse times were calibrated on water. The 1H carrier was set on-resonance at water to ensure optimal water suppression. The 15N carrier was set on-resonance for the ammonium signal at 21 ppm. The 90° 15N pulse time was calibrated on the high signal-to-noise Mo-nitrogenase sample and assumed constant for the other experiments where calibration was impossible due to low signal-to-noise ratios.

Spectra with varying numbers of scans were recorded for all samples dependent on the observable signal. The signal was rescaled linearly by the number of scans and the receiver gain. The spectra were then processed with zero-filling, an exponential window function and Fourier transformation using the nmrGlue python package59.

The 15NH4+ peak was then fitted in the 15N HSQC spectrum using a single Lorentzian function within the predefined grey window in Extended Data Fig. 7. The fitted peak was integrated and scaled against the reference peak to provide a 15NH4+ concentration. A region containing only noise was defined between 0 ppm and 1 ppm, and the standard deviation of this region was multiplied by the fitted full-width-at-half-maximum parameter to estimate the peak integral error.

The 14NH4+ peaks were fitted in the water-suppressed spectrum by first finding all local maxima above a spectrum-specific threshold, adding extra manual locations where needed, and fitting the resulting n Lorentzian functions to the data. The 14NH4+ triplet was identified within these fitted peak locations by identifying the characteristic 14N–1H coupling constant of 52.5 Hz. The integrals of the triplet were summed, and the fitting error was estimated as the standard deviation of the individual triplet peak integrals. These values were scaled against the respective buffer reference samples to provide a concentration of 14NH4+ within the sample.

Two sources of NH4+ signal were assumed: protein degradation and nitrogen fixation. All protein was produced in unenriched media, so it was assumed to contain a natural abundance 996:4 ratio of 14N and 15N. Therefore, we assumed that protein degradation contributes a natural abundance ratio of 14NH4+ and 15NH4+ molecules. To account for this, we subtracted a 4/996 fraction of each 14NH4+ concentration from the corresponding 15NH4+ concentration. The subtracted 15NH4+ signal was then used to calculate the total turnover number.

Cryo-EM sample preparation, collection and processing

MgADP-AlF3-trapped Mar(DK)2(H2)2-complex was prepared by starting a 5-ml reaction with 22 nmol of MarH2 and 3 nmol of Mar(DK)2 in the presence of 100 mM MOPS pH 7.3, 50 mM Tris pH 7.8, 100 mM NaCl, 5 mM DT, 4 mM NaF, 0.2 mM AlCl3, 8 mM MgCl2 and 1 mM ATP. After 3 h incubation at 30 °C and 200 rpm, particles with the appropriate molecular mass were separated via size-exclusion chromatography in a HiLoad 26/600 Superdex 200 pg column (Cytiva) and buffer exchanged to 50 mM Tris pH 7.8, 200 mM NaCl and 5 mM DT. The protein complex with a measured molecular mass of ~368 kDa (Extended Data Fig. 2a,b; theoretical mass 344 kDa) was concentrated in Amicon Ultra filters (30 kDa cut-off, Merck) to 0.3 mg ml−1. Grids were prepared with a Vitrobot Mark IV (Thermo Fisher Scientific) placed inside an anaerobic COY vinyl tent (COY Laboratory Products) with a >95% N2 <5% H2 atmosphere, at 26 °C and 19% humidity. Four microlitres of sample were added to freshly glow-discharged 300 mesh 2/1 µm C-flat copper grids (Electron Microscopy Sciences) and blotted for 10 s at blot force 4 at 100% relative humidity and 4 °C before being plunge frozen in a liquid mix of 37% (v/v) ethane and 63% (v/v) propane.

Cryo-EM data collection was carried out on a Titan Krios G3i electron microscope (Thermo Fisher Scientific) operated at 300 keV in EFTEM mode and equipped with a BioQuantum-K3 imaging filter (Gatan). Data were collected in electron counting mode with EPU (version 3.6) software (Thermo Fisher Scientific) at a nominal magnification of 105,000×, corresponding to a calibrated pixel size of 0.837 Å per pixel, with a total dose of 50 e Å−2 per image, 50 fractions and 2.6 s exposure time. The energy filter slit width was set to 30 eV, and the defocus range was −1.0 µm to −2.5 µm at increments of 0.3 µm.

Cryo-EM data analysis was carried out in CryoSPARC v4.4.1 (ref. 60) (Extended Data Fig. 3). A total of 16,951 micrographs were subjected to patch motion correction and patch contrast transfer function estimation before a template picking using the projections of the Fe-nitrogenase cryo-EM volume as templates (EMD-16890)17. An initial set of 10,625,194 particles were extracted at a box size of 380 pixels and used for three consecutive rounds of 2D classification into 200 classes. Particles from the best 2D classes displaying features of at least one MarH2 bound to Mar(DK)2 were reextracted at a box size of 380 pixels, yielding a total of 3,209,215 particles. These were subjected to ab initio reconstruction into three classes. The best ab initio class with 1,286,797 particles was used for non-uniform refinement, yielding a global reconstruction of 2.45 Å with very poor features around both MarD subunits. Hard 3D classification with a local mask on the proximal MarD subunit and 5 Å target resolution resulted in 25 different classes, including classes completely or partially lacking MarD. Particles of the largest class (116,370 particles) displayed the best alignment with reduced particle orientation bias and were subjected to local refinement with a local mask on the proximal MarD and non-uniform refinement, yielding electron density maps with resolutions of 2.86 Å and 2.77 Å, respectively. The final electron density map was obtained by performing a local refinement using a solvent mask yielding a final resolution of 2.75 Å. The resulting map was autosharpened in Phenix v1.21.1 (ref. 61). Details on cryo-EM data collection, model building and refinement statistics can be found in Supplementary Table 1.

Model building and refinement

The initial rigid-body fit of an AlphaFold2 (ref. 62) multimer model of the Mar(DK)2H2-complex was performed in ChimeraX-1.7.1 (ref. 63). Afterwards, manual fitting and refinement of all ligands was performed in Coot v0.8.9.2 (ref. 64) using the restraint files from the Coot ligand database (ADP, AF3 (AlF3), SF4 ([Fe4S4]-cluster), CLF (P-cluster) and S5Q ([Fe8S9C]-cluster)) allowing bond length deviation between 0.044 Å and 0.124 Å for the metalloclusters as specified in restraint files. Then, automatic model refinements were performed with phenix.real_space_refine of the Phenix software package (Phenix v1.21.1)61 using the same restrain files. For regions of the map that were not resolved well enough, the corresponding parts of the model were removed. Additional cycles of manual refinements in Coot as well as automatic refinements with phenix.real_space_refine were performed to obtain the final model, always applying ligand restraints.

Substrate channel estimation

Substrate access channels to the active site of the methylthio-alkane reductase were calculated using the tool CAVER Analyst 2.0 BETA36. The starting point for channel calculating was the proposed binding site S2B of the [Fe8S9C]-cluster. The probe radius, shell radius and shell depth were set to 0.6, 3.0 and 4.0 Å, respectively. The two most likely substrate channels, with a maximum bottleneck radius of 27.5 Å and 25.6 Å, respectively, were selected for analysis (Fig. 4b,f).

EPR spectroscopy

All EPR samples were prepared under anaerobic conditions in a Coy anaerobic glove box. All protein samples were buffer exchanged with Sephadex G-25 packed PD-10 desalting columns (Cytiva) into 500 mM NaCl, 50 mM Tris pH 8.5 and 10% glycerol to remove excess DT before sample preparation.

For the redox titrations, samples were titrated in the presence of a cocktail of 40 µM of each redox mediator through the addition of different volumes of buffered DT or potassium ferricyanide solutions (1.5–100 mM) until the desired redox potential was reached. The mediator mix consisted of 2,6-dichlorophenolindophenol (E0 = +217 mV), phenazine methosulfate (E0 = +55 mV), methylene blue (E0 = +11 mV), resorufin (E0 = −51 mV), IDS (E0 = −125 mV), 2-hydroxy-1,4-napthaquinone (E0 = −152 mV), sodium anthraquinone 2-sulfonate (E0 = −225 mV), phenosafranin (E0 = −252 mV), safranin O (E0 = −289 mV), neutral red (E0 = −329 mV), benzyl viologen (E0 = −358 mV) and methyl viologen (E0 = −449 mV). Upon stabilization of the redox potential, a 300-µl sample was withdrawn, transferred to an EPR quartz tube, capped and shock frozen in liquid nitrogen. The InLab Argenthal microelectrode (Ag/AgCl, E = +207 mV versus H2/H+, with combined Pt counter electrode) was calibrated with a quinhydrone saturated pH 7 reference buffer solution (E = +285 mV versus H2/H+ at 25 °C).

MarH2 samples were prepared by gentle addition of anaerobic stock solutions of the compounds of interest (2 mM DT, 5 mM MgATP, final concentration) to the protein, incubated for at least 3 min and frozen as for the redox titration. For the turnover experiment Mar(DK)2 was oxidized by addition of 2 equiv. IDS and incubated for 15 min. DT-reduced MarH2 and the IDS-oxidized Mar(DK)2 were desalted. Directly after desalting, 300-µl samples of MarH2 and oxidized and Mar(DK)2 were frozen separately. DT-removed MarH2 and IDS-removed Mar(DK)2 were mixed in ratio of 1.2:1 reductase:catalytic component with MgATP (5 mM final concentration) incubated for 1 min, concentrated with an Amicon Ultra filter (100 kDa cut-off, Merck) to a final volume of 300 µl and shock frozen. Nif(DK)2 was purified as described18, desalted and treated with 2 mM DT, 0.5 mM IDS or 50 µM IDS.

Cw-EPR spectra were recorded with a commercial Bruker spectrometer composed of an X-band E580-10/12 bridge and a 4122HQE resonator, for regular (perpendicular mode, for Fig. 3 and Extended Data Fig. 8) and an ER4116 dual mode cavity for parallel mode measurements (for Nif(DK)2 and trials to detect g = 12 or g = 16 signals in Mar(DK)2) measurements. The system was equipped with an Oxford Instruments temperature controller and ESR 900 cryostat, which was cryocooled by a Stinger (Cold Edge Technologies) linked to helium compressor (Sumitomo F-70). Spectral simulations were performed with GeeStrain5 (ref. 65) or in Excel (Gaussian curves representing the absorption-shaped highest g-value of the very anisostropic doublets of the S = 3/2 system of MarH2, or the first derivative of a Gaussian curve for the isotropic g = 4.3 EPR signal of the middle doublet of high spin ferric iron). Except for Fig. 3f and Extended Data Fig. 9, five-point moving averages were used to limit the noise of EPR spectra.

Phylogenetic analysis and ancestor prediction

For the phylogeny in Extended Data Fig. 10, sequences from Methanolobus bombayensis Ni2+-sirohydrochlorin a,c-diamide reductive cyclase complex component D (CfbD, WP_209620388.1), R. rubrum methylthio-alkane reductase MarDK operon (MarD, WP_011388552.1; MarK WP_011388551.1), A. vinelandii nitrogenase NifDKEN operon (NifD, WP_012698832.1; NifK, WP_012698833.1; NifE, WP_012698838.1; NifN, WP_012698839.1), R. capsulatus chlorophyllide a oxidoreductase BchYZ operon (BchY, WP_023914767.1; BchZ WP_023914765.1) and DPOR BchNB operon (BchN, WP_023913723.1; BchB WP_013066408.1) were used as queries across major archaeal and bacterial lineages, collecting a total of 996 sequences using the web interface of NCBI pblast. Sequences were collected iteratively between 2021 and 2024, and filtered to retain only those with E-values ≤1 × 10−30 and query coverage >70%. Sequences were initially assigned as bona fide nitrogenase components through synteny, for cases where NifDKEN are in synteny, because operons of other relatives of this family contain only DK paralogues. Further classification during dataset assembly was achieved through reciprocal best-blast hits. Ultimately, the classification on our tree is based on branching relationships of known sequences in each clade. Sequences were aligned using Muscle v5 (ref. 66) and further refined using clipKIT67. We have also realigned our sequences three times each with two different algorithms (geneafpair and globalpair algorithms with option -maxiterate 3000) in MAFFT, and verified that the alignment is robust—specifically, that all reported cysteines are consistently aligned in each case. The resulting phylogenetic trees were constructed using Iqtree2 (ref. 68) under the LG + G4 model, with 35,000 ultrafast bootstraps and 1500 SH-aLRT replicates69. Model selection for evolution was based on the corrected Akaike information criterion (-m MFP -merit AICc option in Iqtree2). The resulting tree was constrained to have BchY and BchN sister to NifDE and nitrogenase-like proteins (NflD) including MarD, and BchB and BchZ clades sister to NifKN, NflK and MarK. Every other node was polytomized. The resulting maximum likelihood analysis of the constrained tree shows similar likelihood values as for the unconstrained tree and it was not rejected by any statistical tree topology test performed with iqtree. This topology was chosen for analysis owing to the more parsimonious history in terms of gene duplication: all nitrogenase relatives other than CfbD are heterotetramers made from two paralogues. In our constrained topology, the tree implies that a single gene duplication of an ancestral homomer produced this arrangement. Subsequent operon duplications would then produce the different types of nitrogenase-like complex, each also consisting of two paralogues (including the maturases of nitrogenases). In the unconstrained tree, all BchNBYZ paralogues form a monophyletic sister group to all other nitrogenase-like genes. This would imply that Bch separately underwent a duplication of an ancestral homomer, which then underwent an operon duplication to yield BchNB and BchYZ. This implies one additional gene duplication compared with our constrained tree and, thus, less parsimony. Supporting values for this topology were calculated with 500 Felsenstein bootstrap analysis using Iqtree2 and the approximate likelihood ratio test (aLRT) using PhyML. Ancestral sequence reconstruction for the nodes of interest was performed using a wrapper script that uses Iqtree2 for ancestral sequence inference using the LG + G4 model and PAUPv4 for a Fitch parsimony inference of which sites should be gaps in our alignment.

AlphaFold Structure Prediction

The structure of the last common nitrogenase and methylthio-alkane reductase ancestor AncD and AncK was predicted by AlphaFold 262 using ColabFold70 with default parameters (without templates, model_type: alphafold2_multimer_v3, num_relax: 1).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.