Introduction

Photosynthesis converts solar energy into chemical energy and ultimately provides energy for all life on Earth. Among phototrophic organisms, purple bacterial photosynthetic metabolism is anoxygenic, and their light reactions occur in pigment–protein complexes within intracytoplasmic membranes (ICM). These complexes include a light-harvesting 1 complex (LH1), light-harvesting 2 complex (LH2), a reaction center (RC), cytochrome bc1 (Cyt bc1), and ATP synthases (ATPases). The RC is surrounded by the LH1 complex to form the LH1–RC core complex present in all purple phototrophic bacteria. By contrast, the LH2 complex is located outside the LH1–RC and is called the peripheral antenna and present in only some species1,2. LH1 and LH2 complexes are encoded by pufBA genes within the photosynthetic gene cluster and pucBA genes located outside the photosynthetic gene cluster, respectively, but exhibit conserved amino acid sequences and structures3,4,5.

In recent years, many structures of LH1–RC and LH2 complexes have been solved using single-particle cryo-electron microscopy (cryo-EM), and essential structural principles have emerged that have significantly advanced our understanding of these complexes from a functional standpoint. Complexes from different species exhibit significant diversity in both composition and topology. All core complexes consist of LH1 surrounding the RC to form monomeric or dimeric open or closed structures, whereas all LH2 complexes are closed circular structures that vary considerably in their specific details. Most LH1–RC complexes exist as monomers with the RC surrounded by a closed LH1 ring composed of 16–17 pairs of αβ-polypeptides6,7,8,9,10,11,12,13 or an open ring composed of 10–15 pairs of αβ-polypeptides plus auxiliary small transmembrane polypeptides11,14,15,16,17,18,19,20. Dimeric LH1–RC core structures are present in some Rhodobacter species21,22 and the alkaliphilic purple bacterium Rhodobaca (Rca.) bogoriensis23 and are characterized by continuous S-shaped LH1 arrays surrounding two RCs. Additionally, the LH1–RC core complex of Gemmatimonas (G.) phototrophica consists of two concentric rings surrounding the reaction center including an inner ring of 16 subunits and an outer ring of 24 subunits24.

In contrast to LH1 complexes, all characterized LH2 complexes form closed rings composed of 7–9 pairs of αβ-polypeptides25,26,27,28,29. Nevertheless, LH2 complexes share several structural features in common with LH1 complexes; both are large oligomers formed from multiple pairs of αβ-heterodimers and non-covalently bound molecules of bacteriochlorophyll (BChl) and carotenoid. Because of this structural similarity, it has been proposed that a common ancestor gave rise to both LH1 and LH2 complexes, with LH2 genes arising from gene duplications in an LH1-containing ancestor3,30,31. Although an appealing hypothesis, there is currently little experimental evidence to firmly support this scenario, leaving this important aspect of photosynthetic evolution unresolved.

Roseospirillum (Rss.) parvum is a mesophilic marine purple bacterium within the Alphaproteobacteria32, and its unusual absorption spectrum indicates that its light-harvesting complexes contain two types of BChl a with Qy transitions at 909 nm (corresponding to LH1-Qy) and 805 nm (corresponding to LH2-Qy but lacking the characteristic Qy at ~850 nm of classical LH2 complexes)33. Compared with the thoroughly investigated LH1 complexes from Thermochromatium (Tch.) tepidum34 and Thiorhodovibrio (Trv.) frisius strain 9708 in which calcium ions bound within the complex are critical to the red shift of LH1-Qy, the red-shifted photocomplexes of Rss. parvum result from a mechanism distinct from that of calcium binding based on preliminary analyses of amino acid sequence alignments35. Moreover, the Rss. parvum LH1–RC complex also differs from previously reported properties of the LH1–RC complex from the BChl a-containing filamentous and thermophilic phototroph Roseiflexus (Rfl.) castenholzii, a complex that absorbs at 804 and 880 nm14,20.

To explore the structural basis for the unusual absorption of Rss. parvum, we detail here the cryo-EM structures of the two photosynthetic complexes produced by this unusual phototroph. In addition to its LH1–RC core complex, Rss. parvum produces an LH1-only complex that we have designated LH1′. Although encoded by pufBA, LH1′ is smaller than LH1 in the Rss. parvum core complex, does not encircle a RC, is ring-shaped rather than elliptical, and functionally resembles LH2 of purple bacteria. Collectively, our results provide structural details of hitherto unseen photocomplexes, reveal mechanisms that underlie the unique absorption properties of both Rss. parvum photocomplexes and offer fresh insight into the evolution of purple bacterial photosynthetic complexes.

Results

Overall structure of the Rss. parvum LH1–RC and LH1′ complexes

The LH1–RC core complex and the LH1′ complex purified from Rss. parvum using one-step solubilization followed by DEAE chromatography and characterized by a series of biochemical methods, exhibited absorption maxima (Qy) at 908 and 905 nm, respectively (Supplementary Fig. 1); the cryo-EM structures of these complexes were then determined at resolutions of 2.65 Å and 2.35 Å, respectively (Table 1 and Supplementary Figs. 24). The LH1 complex of Rss. parvum consists of 16 pairs of helical αβ-polypeptides (αβ-dimer), 47 BChls a, and 16 trans-spirilloxanthins, forming a closed elliptical structure surrounding the RC (Fig. 1a, b). The long and short dimensions of the outer LH1-β polypeptide ring (distances between the outer edge of opposite helices) were 116.3 Å and 109.7 Å, respectively (Fig. 1b). The RC is accommodated in the LH1 ellipsoid and fits the shape of the inner LH1-α ring with a height of approximately 126.2 Å (Fig. 1a). The structure of the Rss. parvum RC subunits are nearly identical to those of corresponding subunits from the Tch. tepidum RC with a root-mean-square deviation (RMSD) value of 0.786 Å (Supplementary Fig. 5a). However, a major difference between them is that the Rss. parvum RC contains a complete N-terminus of its Cyt subunit (Supplementary Fig. 4c, Supplementary Fig. 5a, and see later discussion).

Table 1 Cryo-EM data collection, refinement and validation statistics of the Rss. parvum LH1–RC and LH1′ complexes
Fig. 1: Structural overviews of the LH1–RC core complex and LH1′ complex from Rss. parvum.
figure 1

a Side view of the LH1–RC complex parallel to the membrane plane with the periplasmic C-subunit above and the cytoplasmic H-subunit below. b Top view of the LH1–RC from the periplasmic side of the membrane. c Tilted view of the cofactors arrangement in LH1–RC complex. Dotted circle indicates the position that lacks a B803 molecule. d Side view of the LH1′ complex parallel to the membrane plane. e Top view of the LH1′ complex from the periplasmic side of the membrane. f Tilted view of the cofactors arrangement in LH1′ complex. The dimensions of the two complexes are represented and the color scheme is shown below the panels. Phospholipids are omitted for clarity. Abbreviations: BPhe a, bacteriopheophytin a; Spx, spirilloxanthin; MQ-8, menaquinone-8.

Cofactors in the Rss. parvum RC include 4 BChls a, 2 bacteriopheophytins (BPhe) a, 4 heme molecules, one 15-cis-spirilloxanthin and one menaquinone (MQ)-8 at the QA site; no ubiquinone (UQ) was observed at the QB site (Supplementary Fig. 5b, Supplementary Fig. 6). In contrast to most purple bacterial LH1–RC complexes, the BChls a in the Rss. parvum LH1 ring are distributed in a double-layered ring, similar to that in the LH1–RC complex of Rfl. castenholzii. One group of Rss. parvum BChls consists of closely stacked dimeric BChls a with a Qy maximum at 908 nm (B908), whereas the other group is composed of monomeric BChl a, with maximum absorption at 803 nm (B803) (Fig. 1c, Supplementary Fig. 1c). Each pair of LH1 αβ-polypeptides binds two B908 molecules on the periplasmic side, one B803 molecule on the cytoplasmic side, and one trans-spirilloxanthin that spans the transmembrane region. Notably, however, there is no B803 located between the first and sixteenth pairs of αβ-polypeptides, and this likely forms the channel necessary for quinone exchange (Fig. 1c).

The overall structure of the Rss. parvum LH1′ complex is a closed double ring consisting of 14 pairs of αβ-polypeptides encoded by pufBA, 42 BChls a, and 14 trans-spirilloxanthins (Fig. 1d, e). The inner ring formed by 14 α-polypeptides has a diameter of 58.8 Å, and the outer ring formed by β-polypeptides has a diameter of 103.8 Å (Fig. 1e). Like in its LH1, BChls a in the Rss. parvum LH1′ complex are also arranged in a double-layered ring, with the bacteriochlorophyll ring clustered near the periplasmic side composed of 14 pairs of dimeric BChls a with maximum absorption at 905 nm (B905) (Fig. 1f, Supplementary Fig. 1d). The latter Qy is slightly blue-shifted compared to B908 of LH1 but redshifted ~50 nm compared to the maximum absorption of LH2 complexes, most of which absorb near 850 nm. The bacteriochlorophyll ring clustered near the cytoplasmic side of the membrane consists of 14 monomeric BChls a, with maximum absorption at 803 nm (B803) (Fig. 1f, Supplementary Fig. 1d), similar to all previously reported LH1 and LH2 complexes. Additionally, 14 trans-spirilloxanthin molecules are positioned between each pair of LH1′ αβ-polypeptides with no apparent interactions with these polypeptides or with BChls a.

Unique features of the Rss. parvum LH1 complex

Within each pair of Rss. parvum LH1 αβ-polypeptides, B908 molecules are coordinated with α-His33 and β-His40, and the C31-acetyl groups of BChl a molecules form hydrogen bonds with α-Trp44 and β-Trp50 (Fig. 2a, b). Sequence alignment reveals that the conserved alanine, glycine or valine at position 36 of the β-polypeptide is replaced by cysteine in Rss. parvum36, although cysteine is rarely found in purple bacterial light-harvesting antenna proteins (Fig. 3). Base on the structure, we found that the thiol group of β-Cys36 can form two hydrogen bonds with the C131-keto and C132-ester groups of the α-B908 molecule (Fig. 2b). These additional hydrogen bonds further strengthen the interaction between pigments and their neighboring polypeptides, stabilizing the conformation of BChls a. In addition to β-Cys36, there is another Cys residue at position α-Cys36, which is similar to that found in the LH1 α1 polypeptide of Halorhodospira (Hlr.) halochloris37 (Fig. 3). Although the thiol group of α-Cys36 in Rss. parvum is distant from the bacteriochlorophyll ring and does not form hydrogen bonds, it is in the vicinity of the conserved His coordinating α-BChl a, which likely also contributes to the redshift of the LH1 Qy band.

Fig. 2: Structural details of the LH1 ring.
figure 2

a Arrangement of cofactors in an LH1 unit. The coordinated amino acid residues with BChl a molecules are shown by sticks and labeled for clarity. The color codes of polypeptides and pigments are the same as in Fig. 1. b Hydrogen bonding site around B908 in LH1 αβ-polypeptide. Both the residues involved in hydrogen bond interaction and hydrogen bond length are labeled. c Hydrogen bonding site around B803 in LH1 αβ-polypeptide. An aromatic residue involved in hydrogen bonding with B803 is also labeled. d Comparison of BChl a molecules orientation of Rss. parvum (green) with those of Rba. sphaeroides (light pink, PDB:7PBW), Rbl. acidophilus (pale yellow, PDB:1NKZ), Phs. molischianum (blue, PDB:1LGH), Mch. purpuratum (red, PDB:6ZXA), Rfl. castenholzii (cyan, PDB:8IUG) and G. phototrophica (pale cyan, PDB:7O0U). Their αβ-polypeptides are shown by grey white cartoon and carotenoid molecules are omitted for clarity. e Side view of a surface representation showing a channel (red dashed circle) in the LH1 ring which lacks a B803 molecule. Other channels are sealed by the B803 molecules (cyan). f The quinone-exchange channel between Rss. parvum LH1 α1- and α16-polypeptides. The surface representation LH1 α1- and α16-polypeptides are colored in pale cyan and are shown by the yellow ribbon simultaneously. The residues forming this hydrophobic channel are shown by magenta sticks.

Fig. 3: Sequence alignments of α- and β-polypeptides from phototrophic bacterial LH1 complexes.
figure 3

The sequences are aligned relative to the histidine (or asparagine) residues coordinating BChl molecules. The conserved residues that coordinate and/or hydrogen-bond (mainly C31-acetyl) to dimeric BChl molecules are shown in red and green, respectively. The non-conserved residues that coordinate and/or hydrogen-bond (mainly C31-acetyl) to monomeric BChl molecules (Qy absorption at 800 or 820 nm) are shown in blue and orange, respectively. The acidic residues of α- and β-polypeptides are colored in purple. The cysteine residues in Rss. parvum and Hlr. halochloris are colored in magenta. The WxxDxI or WxxDxV motifs present in Ca2+-binding LH1 polypeptides are marked by the dashed box.

A total of 15 monomeric B803 molecules exist in the Rss. parvum LH1 complex, and the porphyrin plane of B803 is nearly perpendicular to that of B908 (Fig. 1c). Nevertheless, the orientation and position of the B803 bacteriochlorophyll ring differs (Fig. 2d). The average distance between Mg in adjacent LH1 B803 molecules is 19.77 Å (Supplementary Table 1), shorter than that in all known LH2 complexes and indicating relatively strong coupling between monomeric molecules of BChl a in Rss. parvum. Although the conformation of the Rss. parvum dimeric BChl a is similar to that in the Rfl. castenholzii LH1–RC and G. phototrophica LHh (outer ring), the orientation of the Rss. parvum monomeric BChl a molecules differs as shown in Fig. 2d and is more similar to that of LH2 complexes25,26,27,28,29. In the Rfl. castenholzii LH1–RC14, the B804 molecule is coordinated with β-His26, whereas in G. phototrophica LHh24, there is no evident coordination for the central Mg2+ of the B800 BChl a. By contrast, in Rss. parvum, the central Mg2+ of the B803 molecule is coordinated with an α-N-terminal methionine that is not acylated (Fig. 2a, c). Furthermore, a strong hydrogen bond (2.6 Å) is formed between the C31-acetyl group of B803 and the nearby β-Tyr25 that further stabilizes B803 and likely affects its Qy absorption (Fig. 2c).

Another distinctive characteristic of the Rss. parvum LH1–RC and LH1′ complexes is the presence of several negative charges on the periplasmic side of the membrane (Supplementary Fig. 7a, b). A similar distribution is also observed in Hlr. halochloris LH1–RC complex38, which has a higher content of acidic residues compared to other phototrophic bacteria (Fig. 3). Supplementary Fig. 7c, d illustrate the pH-dependent spectral changes observed in the purified LH1–RC and LH1′ complexes. As the pH decreases (below 7.5), the intensity of the LH1 Qy band at 908 nm undergoes a significant decrease and gradual blue-shift. When the pH drops by two full units to 5.5, the LH1 Qy band at 893 nm does not further blue-shift. At the same time, the LH1 Qy band at 803 nm remains virtually unaffected by this pH decrease (Supplementary Fig. 7c). Similarly, the LH1′ Qy at 905 nm blue-shifted to 890 nm at pH 5.5 or 5.0 (Supplementary Fig. 7d). All of these changes are nearly reversible when the pH returns to neutrality (Supplementary Fig. 8a, b). Moreover, it was found that relatively slight spectral changes (6 and 9 nm) occurred when the LH1–RC and LH1′ complexes were suspended in an acidic buffer (pH 5.0) containing 0.03% DDM, and these changes were also reversible (Supplementary Fig. 8c, d).

As previously mentioned, due to the absence of the 16th B803 molecule, a pore forms between the first and sixteenth pairs of αβ-polypeptides in Rss. parvum LH1 (Figs. 1c, 2e). This likely functions as a channel for quinone exchange and as such is surrounded by multiple hydrophobic amino acids including Leu18, Val19, Ala22, and Val26 in α1 and Phe12, Thr17, Ala20, Leu21, and Phe24 in α16 (Fig. 2f). Furthermore, the tail of the α-B908 molecule and the polyene chain of spirilloxanthin, which are bound to the first pair of αβ-polypeptides, lie close to this channel, contributing to its hydrophobicity.

The Rss. parvum RC complex

The structure of the Rss. parvum RC is similar to known purple bacterial RCs; however, the N-terminal domain of the Cyt subunit differs. Hydrophobicity analysis of the Cyt subunit revealed a transmembrane helix from Ala23 to Met45 (Supplementary Fig. 9a), consistent with the cryo-EM map, where a helix formed by 17 amino acids (Val27–Val43) inserts into the membrane, functioning as an anchor (Supplementary Fig. 4c, Supplementary Fig. 5a). This arrangement has also been observed in the RC Cyt subunits of Rfl. castenholzii14 and the acidophilic purple bacterium Rhodopila (Rpi.) globiformis13. Structurally, this unique configuration not only tightly binds the Cyt subunit to the membrane but also stabilizes the entire LH1–RC complex. Positionally, the N-terminal transmembrane helix is located near the first and second LH1 α-polypeptides, enhancing the connectivity of the entire complex without affecting the binding of B908 and B803 molecules (Fig. 4a, b). Comparison of the Rss. parvum RC with the Rpi. globiformis RC reveals slight differences in the length and position of the transmembrane helix of the Cyt subunit, but both are embedded in the membrane at similar angles and their N-terminal loop regions are highly similar (Fig. 4c, d). Previous studies have shown that the C-terminus of the LH1–RC protein PufX in Rhodobacter (Rba.) sphaeroides overlaps with the N-terminal loop region of the Cyt subunit in Rpi. globiformis in terms of conformation and position, suggesting a potential evolutionary link between PufX and the N-terminal region of the RC-bound Cyt subunit13. Similarly, aligning the Rss. parvum RC with the Rba. sphaeroides LH1–RC also reveals overlap between the position and conformation of the C-terminus of PufX (Leu55-Gly69) and the N-terminal loop region of the Rss. parvum Cyt subunit (Met44-Leu58) (Fig. 4e, f). Additionally, the Cyt subunit of Rfl. castenholzii also has a complete N-terminus, but this portion is located in a different position than is PufX (Supplementary Fig. 9b). In the Rss. parvum RC, only MQ-8 was identified at the QA site, whereas no quinones were observed at the QB site (Supplementary Fig. 6). This observation was common in early crystal structure studies of RC complexes39,40,41 mainly due to the dissociation of QB from its binding site during the purification process, and this is likely the explanation for the absence of a quinone in this site in the Rss. parvum RC.

Fig. 4: The features of the RC subunits in Rss. parvum and structural similarities between PufX and N-terminus of the RC-bound Cyt subunits.
figure 4

a, b The features of the N-terminus of Rss. parvum Cyt subunit. The color codes of αβ-polypeptides are gray, while all other color codes are the same as in Fig. 1. c Superposition of the Cα carbons of RC subunits between Rpi. globiformis (PDB: 7XXF) and the Rss. parvum (this work). The N-terminus of the Cyt subunits of Rss. parvum and Rpi. globiformis are shown by light blue and pale-yellow cartoons, respectively. The other portions of the Rss. parvum RC are colored the same as those in Fig. 1, while those of Rpi. globiformis are colored gray, except for the heme groups, which are shown as yellow sticks. d Expanded view marked in (c) showing that the N-terminal loop region (Met44-Asp65) of the Rss. parvum Cyt subunit overlaps well with that (Phe27-Tyr48) of Rpi. globiformis. For clear presentation, (d) has been slightly rotated relative to the dash box in (c). e Superposition of the Cα carbons of the RC-M subunits between the Rba. sphaeroides monomeric LH1–RC (PDB: 7F0L) and Rss. parvum RC. PufX is shown as a cyan cartoon and the other portions of the complex are shown as gray ribbons, except for Protein-U, which is shown as a hot pink ribbon. f Expanded view marked in (e) showing that the C-terminus (Leu55–Gly69) of PufX overlaps positionally and conformationally with the N-terminal loop region (Met44-Gly58, main chain shown) of the Rss. parvum Cyt subunit.

Unique features of the Rss. parvum LH1′ complex

Although the Rss. parvum LH1′ shows features common in LH2 complexes, genome analysis failed to reveal pucBA, the genes that normally encode LH2 complexes. Because there is only one set of pufBA in the Rss. parvum genome, the LH1′ αβ- polypeptides must also be encoded by pufBA. From a structural point of view, however, Rss. parvum LH1 and LH1′ are not identical; the major differences are that LH1′ does not encircle a RC and is smaller than LH1. As regards the latter, there are 14 αβ-polypeptides in the LH1′ complex compared with 16 in LH1 (Fig. 1d, e). Other than these two major differences, the arrangement and orientation of pigment molecules and the binding mode of BChl a molecules and their hydrogen bonding interactions with surrounding polypeptides in the Rss. parvum LH1′ complex are similar to those in the LH1 complex.

In the Rss. parvum LH1′ complex, the average Mg–Mg distances were 9.30 Å within a dimer and 8.32 Å between dimers (Supplementary Table 1), almost the same as those in the LH1–RC complex. However, unlike the LH1–RC complex that lacks one B803 molecule to form an open cavity, both layers of bacteriochlorophyll rings in the LH1′ complex are intact, similar to that of classical LH2 complexes. The average Mg–Mg distance between B803 BChl monomers was 20.05 Å (Supplementary Table 1), which is slightly shorter than that of classical LH2 complexes. In summary, then, although encoded by the same genes as those that encode LH1, Rss. parvum LH1′ is smaller than its LH1 and neither contacts nor interacts with the RC. Thus, Rss. parvum LH1′ functionally resembles LH2 complexes, and its unique structure may be a prerequisite for capturing wavelengths above 900 nm, radiation virtually unabsorbed by classical LH2 complexes.

Discussion

The presence of two light-harvesting antenna complexes in purple phototrophic bacteria is not uncommon, but when present, the two antenna complexes are typically encoded by distinct gene sets. Here we have solved the high-resolution structures of the LH1–RC complex and an accompanying RC-free complex we refer to as LH1′, both encoded by the genes pufBA. Together, the two complexes constitute the unique photosystem of Rss. parvum.

These two complexes show several highly unusual features, and in particular, a significant red shift in Qy transition from that of LH1 and LH2 complexes of most purple bacteria. There are several factors that could account for this red shift as well as some factors known to trigger a red shift that are likely not involved; both factors are considered here. Although the Qy transitions of the Rss. parvum LH1 and LH1′ complexes lie beyond 900 nm—more than 30 nm and 50 nm red-shifted compared with other LH1 and LH2 complexes, respectively, the average Mg–Mg distances in the Rss. parvum LH1 complex were 9.30 Å within a dimer and 8.25 Å between dimers (Supplementary Table 1). These values are similar to those of Rhodospirillum (Rsp.) rubrum and Rba. sphaeroides9,16 and the orientations of the Qy dipole moments of dimeric BChl a molecules are identical, indicating that exciton coupling is unlikely to be the decisive trigger of the Rss. parvum Qy red shift. In addition, although the LH1 Qy absorption of Rss. parvum is similar to that of Tch. tepidum42, in this thermophilic purple sulfur bacterium a Ca2+-coordination network contributes significantly to the Qy red shift6,43,44; such a network also contributes to the LH1 red shifts observed in Trv. frisius strain 9708 and Allochromatium (Alc.) tepidum12. However, other than the Mg2+ in BChl a and unbound Mg2+ in the RC, no Ca2+ or other metal ions were detected in the LH1–RC complex of Rss. parvum and so the mechanism underlying the Qy red shift in Rss. parvum is clearly distinct from that in these calcium-dependent purple bacteria.

The red shift in both Rss. parvum photocomplexes is likely related to hydrogen-bonding interactions between BChls and neighboring proteins and specific polypeptide sequences that collectively create an environment that affects the Qy transition. Previous studies have shown that an interacting triangle formed with BChl a–Trp47–His48–BChl a in Trv. frisius strain 970 enhances hydrogen-bonding interactions between BChl a and polypeptides and promotes a red shift of the LH1 Qy8,45. In Rss. parvum, besides the conserved hydrogen-bonding interactions between α-Trp44 and β-Trp50 with the BChl a C31-acetyl group, the sulfhydryl group of β-Cys36 can form two additional hydrogen bonds with the C131-keto and C132-ester groups of α-B908 molecules; these enhanced hydrogen-bonding interactions may play an important role in the Rss. parvum Qy red shift.

In some LH1–RC complexes, the extended C-terminal domains of the LH1 αβ-polypeptides are homogeneously arranged to form rigid structures, which may be one of the significant factors contributing to the red shift of the LH1 Qy band8,46. Although the N/C-terminus of Rss. parvum LH1αβ-polypeptides was not fully resolved in the two complexes, the C-terminal chain of the LH1 β-polypeptides was slightly longer and contained several neutral and nonpolar amino acids (Fig. 3). Together, these provide a strong hydrophobic environment on the periplasmic side that stabilizes BChl a and likely contributes to the Qy red shift. Furthermore, there were several negative charges on the periplasmic side of the two complexes of Rss. parvum (Supplementary Fig. 7a, b). Similar distributions were observed in the Hlr. halochloris LH1–RC complex, which included a large number of negatively charged amino acids at the C-terminus of the αβ-polypeptides38,46. As the pH decreased, the Qy band of LH1–RC and LH1′ complexes of Rss. parvum and the LH1–RC band of Hlr. halochloris exhibited reversible spectral changes likely associated with the distribution of electrostatic charges around BChl molecules44,46 (Supplementary Fig. 8). However, the Qy shift is a complicated process, and it was found that both pH and detergents can play a role in it47,48 (Supplementary Fig. 8). Additionally, the unusual cysteine residues of Rss. parvum, especially α-Cys36, located near the conserved His and virtually identical in position to the cysteine residue of LH1 α1 in Hlr. halochloris46, represents a potential candidate for modulating the point charge surrounding BChl a molecules, which is also responsible for the pH-dependent reversible spectral changes in LH1 Qy absorption. Given that the subunit composition and pigment arrangement of the Rss. parvum LH1′ complex is very similar to that of its LH1–RC complex, the above factors also explain the red shift in the LH1′ complex. In addition, however, due to subtle differences in Mg-Mg distances and pigment-polypeptide interactions between the two complexes, the LH1′ Qy is blue-shifted by 3 nm compared with the LH1–RC complex, revealing a possible mechanism for energy transfer from LH1′ to the core complex.

Another unusual feature of the Rss. parvum LH1 complex is the binding of two layers of BChl a of which the B803 molecules close off the channel between adjacent LH1 αβ-polypeptides, as is also the case in the Rfl. castenholzii LH1–RC complex14. However, the absence of a B803 molecule between the 1st and 16th LH1 subunits of Rss. parvum creates a distinct pore, quite likely the channel necessary for quinone exchange (Fig. 2e, Supplementary Fig. 10a). Such a gap is analogous to the pore formed in the Rpi. globiformis and Blastochloris (Blc.) viridis LH1–RCs by the absence of the γ-like polypeptide13 or γ-polypeptide7, respectively (Supplementary Fig. 10b, c). In the Rfl. castenholzii LH1–RC complex, the gap created by the insertion of the Cyt-TM (C-subunit transmembrane helix) and TMx (the subunit X) between the 1st and 15th pairs of αβ-polypeptides is considered to be the quinone channel14,20 (Supplementary Fig. 10d); this is similar to the LH1 ring gap formed by PufX or protein-W in Rba. sphaeroides15,16,21,22,49 and Rps. palustris11, respectively. Additionally, the quinone channel in the Rss. parvum LH1 complex is formed by several hydrophobic amino acids in the 1st and 16th α-polypeptides (Fig. 2f). Sequence alignments show that amino acids near the quinone channels (formed by adjacent LH1 αβ-polypeptides) in LH1 complexes are not identical but do share similar hydrophobic properties (Supplementary Fig. 11). Therefore, this structural feature may be one of the inherent characteristics of core complex quinone channels in purple bacteria.

Previous studies have shown that the RC Cyt subunit in the aerobic purple bacterium Roseobacter (Rsb.) denitrificans has a complete N-terminus, and sequence alignments revealed a phylogenetic relationship between this portion of the Cyt subunit and PufX from both Rba. sphaeroides and Rba. capsulatus50. Moreover, structural studies of the Rpi. globiformis LH1–RC complex have revealed that the N-terminus of the Cyt subunit forms a helix to anchor the subunit into the membrane and shows a high similarity between its N-terminus and the C-terminus of PufX13. The Cyt subunit in the Rss. parvum RC also shows this pattern, with several amino acid residues at its N-terminus overlapping well with the C-terminus of PufX (Fig. 4e, f). Furthermore, a close overlap exists between the N-terminal 13/14 residues of the truncated Cyt subunit and the C-terminus of PufX (Supplementary Fig. 9c–g), and a long-standing hypothesis suggests that the RC of the common ancestor of phototrophic purple bacteria likely contained a full-length four-heme Cyt subunit that subsequently evolved through partial deletion of the pufC gene13.

The overall structure of the Rss. parvum LH1′ complex resembles the heterologously expressed circular LH1-only complex from Tch. tepidum, as both structures comprise 14 pairs of αβ-polypeptides encoded by pufBA51. However, there are also major differences between the two complexes. The Tch. tepidum LH1-only complex was expressed in a photosynthetically incompetent strain of Rsp. rubrum that lacked genes encoding its RC complex. By contrast, in Rss. parvum, a fully functional LH1–RC complex exists with the LH1′ complex being a separate and distinct entity, an arrangement typical of purple bacteria that contain peripheral (LH2) complexes. Despite the fact that the LH1-only complex from Tch. tepidum and Rss. parvum LH1′ are both circular and composed of an identical number of subunits, the complexes do not completely overlap due to differences in the orientation of the αβ-polypeptides and their associated pigments (Supplementary Fig. 12). This also suggests that the function of the Rss. parvum LH1′ complex is distinct from that of its LH1, and in this connection, we propose that LH1′ harvests light energy analogous to classical LH2s. If the Rss. parvum LH1′ does function as a peripheral antenna, it would be the largest LH2-like complex known. The lack of a B850 spectral component in the Rss. parvum LH1′ and the red shift of its Qy transition would also make it a unique peripheral antenna. Conversely, it is possible that LH1′ is a compact and functionally independent version of Rss. parvum LH1. There is a precedent for this, as an LH1-only complex has been obtained during purification of the LH1–RC complexes from Rsp. rubrum9,52 and Rfl. castenholzii53. Other studies using freeze-fracture electron microscopy to investigate the intracytoplasmic membranes of Allochromatium (Alc.) vinosum also revealed different forms of an LH2 complex, one of which was composed of 13 pairs of αβ-heterodimers54. Moreover, atomic force microscopy observations of the intracytoplasmic membranes of Pararhodospirillum (Par.) photometricum have shown the possible presence of an LH2 complex composed of 14 subunits55,56. However, all of these studies lacked confirmation from corresponding high-resolution structures. By contrast, our results with Rss. parvum are based on the atomic structure of the native 14-fold LH1′ complex, and thus the number of subunits per complex is known precisely. Regardless of how it is labeled—an LH1′ or an LH2—the Rss. parvum LH1′ complex likely communicates in some way with its LH1–RC to ultimately funnel light energy into the core complex. The absorption differences reported herein could form the basis of such communication, as energy transfer, albeit slight, from the Rss. parvum LH1′ (Qy 905 nm) to its LH1–RC (Qy 908 nm) would be an energetically “downhill” (and thus favorable) reaction.

All purple bacteria contain an LH1–RC complex while the peripheral LH2 antenna is present in many but not all species. Moreover, genes encoding LH1 are located within the photosynthetic gene cluster (PGC) while LH2 genes are typically located outside of the PGC. Therefore, one could argue that LH1 and LH2 originated from a common ancestor and that the LH1 complex emerged first and gave rise to the LH2 complex through gene duplication followed by horizontal gene transfer to certain species3. Although there is scant firm evidence to support this hypothesis, the fact that the Rss. parvum LH1–RC and LH1′ complexes are both encoded by pufBA provides fresh support for this scenario. Early evolving purple bacteria may have used pufBA to encode LH1 and produced only an LH1–RC complex. And then, possibly in response to the colonization of light-limiting environments and the need to exploit a greater breath of wavelengths, early forms of LH2 such as the Rss. parvum LH1′ complex evolved while still encoded by pufBA. In such a scenario, pucBA would have evolved from pufBA roots to encode smaller LH2 complexes composed of variable numbers of αβ-polypeptides that had different absorption properties25,26,27,28,29. Increasing the structural diversity of LH2 complexes would have broadened the light-harvesting range of purple bacteria throughout the near infrared region. The light-harvesting complexes of Rss. parvum not only fit well into such an evolutionary scenario but in addition enrich our understanding of purple bacterial light-harvesting complexes and expand the diversity of known light-harvesting antenna complexes in general.

Methods

Purification and characterization of the Rss. parvum LH1–RC and LH1′ complexes

Rss. parvum cells (purchased from DSMZ) were cultivated phototrophically at 30 °C for 7 days under incandescent light (50–60 μmol m–2 s–1) as described previously32. The cells were harvested by centrifugation at 5500 rpm (5353 × g) for 10 min and chromatophores were isolated by sonication (Ultrasonic Homogenizer JY92-IIN, SCENTZ, China) of pelleted cells suspended in 20 mM Tris-HCl buffer (pH 7.5) followed by ultracentrifugation (OptimaTM L-100 XP Ultracentrifuge, BECKMAN COULTER, USA) at 208,000 × g for 90 min. Chromatophores were treated with 1.0% (w/v) n-dodecyl β-D-maltopyranoside (β-DDM) at room temperature for 60 min and then centrifugated at 208,000 × g for 60 min. The resulting supernatant was loaded onto a DEAE anion-exchange column (Toyopearl 650S, TOSOH) equilibrated with 20 mM Tris-HCl buffer (pH 7.5) containing 0.03% (w/v) β-DDM. The LH1′ and LH1–RC complexes were eluted by a linear gradient of NaCl from 50 mM to 250 mM (Supplementary Fig. 1a). After washing the mixture of impurities, the LH1′ complex was eluted first, followed by the LH1–RC complex; purities were assessed by sucrose density gradient ultracentrifugation, SDS-PAGE, BN-PAGE and negative-stain electron microscopy (Supplementary Fig. 1, Supplementary Fig. 2a). Fractions with ratios of R908/280 and R905/280 larger than 2.0 and 2.4, respectively, were collected and concentrated with Amicon 100,000 molecular weight cutoff (MWCO) centrifugal filters (Millipore). Absorption spectra were measured using a UV-1900i spectrophotometer (Shimadzu). The purified complexes were diluted with buffers of different pH values (pH 5.0, 5.5, 6.5, 7.5) for pH-dependent spectroscopic measurements. Subsequently, reversible tests were conducted by adding a large amount of buffer containing 20 mM Tris-HCl (pH 7.5) and 0.03% (w/v) β-DDM until neutral pH was reached.

The complete genome of Rss. parvum

The complete genome sequence of Rss. parvum was sequenced using Nanopore Oxford Nanopore Technologies (ONT). After trimming low-quality reads by Trimmomatic v0.33 software57, about 911 Mb of clean sequences were used to assemble the genome by CANU v2.2 software58. Two versions of genomic sequences were generated by different parameters, resulting in sequence sizes of 3,589,066 and 3,578,963, respectively. To ensure accuracy, sequence correction was performed using RACON v1.5.0 software59 (https://github.com/lbcb-sci/racon). Both sequences had a length of above one cycles and were the same in alignment, which was employed using MINIMAP2 v2.24-r112260 for comparisons. Only one cycle was retained as the complete genome after removing overlapped bases, and the finished cycled genome had a length of 3,548,410. Additionally, the genome was polished using NextPolish v1.4.161 with Illumina paired-reads. Finally, a high-quality, complete, circular genome with a length of 3,555,720 bp was assembled. Prodigal V2.6.3 software62 (Assembly, Annotation, and Comparative Analysis of Bifidobacterial Genomes) was employed for annotation using the cycled assembly genome.

Cryo-EM data collection

Proteins for cryo-EM were concentrated to 4 mg/ml. Then 5 microliters of protein solution were applied on glow-discharged holey carbon grids (M01 Au300-R1.2/1.3, Nanodim Tech) previously treated with H2 and Ar mixtures in a Solarus plasma cleaner (Gatan, Pleasanton, USA) for 60 s and then blotted and plunged into liquid ethane at –182 °C using Vitrobot Mark IV (Thermo Fisher Scientific). The applied parameters were a blotting time of 4 s at 100% humidity and 4 °C. Data were collected on a Titan Krios (Thermo Fisher Scientific, Hillsboro, USA) electron microscope at 300 kV equipped with a K3 camera (Gatan, Pleasanton, USA) (Supplementary Fig. 2b). Movies were recorded using EPU software (Thermo Fisher Scientific) at a nominal magnification of 81 k in counting mode and a pixel size of 1.04 Å at the specimen level with a dose rate of 39 e- per physical pixel per second, corresponding to 36 e- per Å2 per second at the specimen level. The exposure time was 1.71 s, resulting in an accumulated dose of 61.6 e- per Å2. Each movie includes 32 fractioned frames.

Image processing of the LH1–RC complex

All of the stacked frames were subjected to motion correction, and defocus was estimated using cryoSPARC63. A total of 788,173 particles were selected from 8150 micrographs using the crYOLO64 (Supplementary Fig. 3). All of the picked particles were further analyzed with cryoSPARC63, and 400,267 particles were selected by 2-D classification and divided into four classes by 3-D classification resulting in only one good class containing 281,288 particles. The initial 3-D model was generated in cryoSPARC. The non-uniform refinement without any imposed symmetry (C1) produced a map at 2.65 Å resolution after contrast transfer function refinement, masking, and post-processing, according to the gold-standard Fourier shell correlation using a criterion of 0.14365,66 (Supplementary Fig. 3b). The local resolution maps were calculated on cryoSPARC.

Image processing of the LH1′ complex

All of the stacked frames were subjected to motion correction, and defocus was estimated using cryoSPARC61. A total of 826,886 particles were selected from 7387 micrographs using crYOLO64 (Supplementary Fig. 3). All of the picked particles were further analyzed with cryoSPARC63, and 521,006 particles were selected by 2-D classification. All particles were subjected to a low-pass filtering of 3 Å before 3D reconstruction. Four initial 3-D models were generated in cryoSPARC and then subjected to Hetero-Refinement, with one good class containing 397,127 particles selected. These particles were then reextracted without filtering, then the non-uniform refinement without any imposed symmetry (C1) produced a map at 2.72 Å resolution65. After applying C14 symmetry, contrast transfer function refinement, masking, and post-processing, a final map of 2.35 Å was produced, according to the gold-standard Fourier shell correlation using a criterion of 0.14366 (Supplementary Fig. 3b). The local resolution maps were calculated on cryoSPARC.

Model building and refinement of the LH1–RC and LH1′ complexes

The atomic model of the Tch. tepidum LH1–RC (PDB: 5Y5S) was fitted to the cryo-EM map obtained for the Rss. parvum LH1–RC using Chimera67. Amino acid substitutions and real space refinement for the polypeptides and cofactors were performed using COOT68. Whole regions of the LH1 αβ-polypeptides as well as both terminal regions of the Cyt subunit and LH1 αβ-subunit were modelled ab-initio based on the density. For LH1′ complex, the atomic model of LH1 αβ-polypeptide with cofactors was fitted to the cryo-EM map sequentially to obtain the final structure. The manually modified model was refined in real-space on PHENIX69, and the COOT/PHENIX refinement was iterated until the refinements converged. Finally, the statistics calculated using MolProbity70 were checked. Figures were drawn with the Pymol Molecular Graphic System (Ver2.5, Schrödinger)71, UCSF Chimera67 and UCSF Chimera X72. Surface charge distributions were calculated using the APBS Electrostatics plugin in the Pymol.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.