Introduction

Water channels named aquaporins (AQPs) are membrane proteins that facilitate the permeation of water across lipid bilayers to the exclusion of ions and protons1. AQPs that permeate not only water but also small solutes such as glycerol are named aquaglyceroporins1,2. Thirteen human AQPs identified to date have different permeabilities and are mainly classified as follows: AQP0, AQP1, AQP2, AQP4, and AQP5 are considered water-selective type AQPs, while AQP3, AQP7, AQP9, and AQP10 are aquaglyceroporins3,4. AQPs are regulated by several factors, including pH, phosphorylation, and protein interactions5,6,7,8. Functionally different AQPs underlying various physiological and pathological phenomena are distinctly distributed in the body1,2,3,4,5,6,7,8.

The permeability of AQPs is related to their channel pore properties. AQPs have a unique fold that forms a tetramer with one water-permeable pore per monomer5,9,10. The aromatic/arginine (ar/R) region located at the extracellular side of the channel pore forms a constriction filter11,12. While the arginine residue is highly conserved, the aromatic residues vary among AQPs13. The variety of ar/R filters relates to the different permeabilities of AQPs for water and solutes11,12,13,14,15,16,17. Below the ar/R filter, two half-membrane-spanning helices insert from both sides of the membrane and meet in the centre of the pore with a highly conserved asparagine-proline-alanine (NPA) motif9. The helical dipole moments of the short helices create a positive electrostatic field around the NPA motifs with the pore-facing asparagine residues9. These NPA motifs with an electrostatic field together with the ar/R filter are proposed to contribute to the permeabilities of AQPs without permeation of ions and protons5,9,18,19,20,21.

Structural analyses of many water-selective AQPs and aquaglyceroporins have been performed by X-ray and electron crystallography, and single-particle cryo-electron microscopy (cryo-EM) analysis22,23,24. Escherichia coli (E. coli) GlpF was the first structurally determined aquaglyceroporin10,21. The mammalian aquaglyceroporin AQP7 structure has also been solved25,26,27. Expression of AQP7 in human pancreatic cells and its junction structure via extracellular loops were recently reported28. AQP10-specific low-pH dependent glycerol release from adipocytes was demonstrated by functional and structural analyses29. The regulatory features governing the structures and functions of other aquaglyceroporins, such as AQP3, however, remain unclear. A homology model of the AQP3 structure, in which the ar/R filter forms an open pore, has been used for in silico studies30,31,32,33.

In this work, we perform cryo-EM structural analysis of rat AQP3, revealing two snapshots of AQP3: 1) ar/R filter forms a pore with a diameter of 2.8 Å, and 2) a previously unobserved narrowed pore conformation in which Tyr212, an aromatic residue of the ar/R filter, inserts into the pore. All-atom molecular dynamics (MD) simulations show that the AQP3 Tyr212-in conformation is stable and largely suppresses water permeability. AQP3 Y212F and Y212T mutant structures show that the pore-inserted conformation is due to the aromatic residue. Aquaglyceroporins AQP7 and GlpF contain Tyr223 and Phe200, respectively, at the site of Tyr212 in AQP3, yet neither structure exhibits the AQP3-like narrowed pore conformation. Unexpectedly, the cryo-EM structure of GlpF reveals a narrowed pore by the intracellular loop, similar to conformations observed in plant AQPs. Structural model B-factor suggests that the intracellular loop is relatively flexible. MD simulations support that the GlpF intracellular loop is flexible allowing water to pass through the region near the loop. Our study elucidates the structural variety of aquaglyceroporins.

Results

Structures of AQP3 in DDM micelles and in POPC or DMPC nanodiscs

To experimentally determine the structure of AQP3, we expressed rat AQP3 in insect cells and purified it using the detergent n-dodecyl-β-D-maltopyranoside (DDM) (Supplementary Fig. 1a, b). AQP3 was reconstituted into lipid nanodiscs34 by mixing DDM-solubilised AQP3 with MSP proteins and 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphocholine (POPC), followed by the application of Bio-Beads to remove the DDM (Supplementary Fig. 1a, b). Cryo-EM maps of AQP3 in DDM micelles and in POPC nanodiscs (hereafter referred to as AQP3DDM and AQP3POPC, respectively) were obtained at resolutions of 3.12 Å and 2.94 Å, respectively (Fig. 1a, b and Supplementary Figs. 2 and 3a, b). As expected, AQP3 forms a tetrameric aquaporin fold (Fig. 1c). The monomer consists of six transmembrane helices (H1–H6) with intracellular N- and C-termini and five loop regions (loop A–E), of which intracellular loop B and extracellular loop E contain reentrant loops with short helices (HB and HE) that feature NPA motifs (Fig. 1d).

Fig. 1: Conformational differences between AQP3 in DDM micelles and AQP3 in POPC nanodiscs.
figure 1

Cryo-EM maps of AQP3DDM (a) and AQP3POPC (b). Regions of micelles and nanodiscs are shown as transparent using unsharpened maps. The monomers of AQP3DDM and AQP3POPC are highlighted in gray and rainbow colours, respectively. c The tetrameric modelled structure of AQP3POPC shown as a ribbon representation from the side and extracellular views. d Schematic topology of an AQP3 monomer coloured as in (b, c). The NPA motifs at the HB and HE short helices are indicated as NPA. The ar/R filter residues, Phe63 at H2, Tyr212 at reentrant loop E, and Arg218 at HE, are indicated as F, Y, and R, respectively. Overall structures of the monomers of AQP3DDM (e) and AQP3POPC (f) from the side and extracellular views. The reentrant loop E, ar/R filter residues, and asparagines at the NPA motifs are highlighted with several residues shown as sticks. The channel pore of the AQP3DDM monomer is indicated by an asterisk from the extracellular view (e, right panel). For clarity, loop B, loop C, and H6 are partially removed from the side views. g Superimposition of the regions around reentrant loop E of AQP3DDM and AQP3POPC from the side and extracellular views. The movement of Phe63 is indicated by a two-way arrow (right panel). The hydrogen bond between Tyr212 and Asn83 of AQP3POPC is indicated by a yellow dotted line (left panel). Cryo-EM maps and models of the pore regions of the monomers of AQP3DDM (h) and AQP3POPC (i).

Importantly, AQP3 exhibits significantly different conformations between structures in DDM micelles and POPC nanodiscs. The ar/R filter formed by Phe63, Tyr212, and Arg218 of AQP3DDM creates a pore diameter of 2.8 Å (Tyr212-out conformation; Fig. 1e and Supplementary Fig. 4a). For AQP3POPC, however, a conformational change of extracellular reentrant loop E is observed with Tyr212 inserted into the pore (Tyr212-in conformation; Fig. 1f). The pore of AQP3POPC is calculated to be occluded by this conformational change (Supplementary Fig. 4b). Tyr212 inserts into the centre of the pore near the NPA motifs, and the hydroxyl group of Tyr212 forms a hydrogen bond contact with Asn83 in the NPA motif of HB (Fig. 1g, left panel). The inserted Tyr212 on reentrant loop E pushes Phe63 outward at the ar/R filter (Fig. 1g, right panel). The reentrant loop E of AQP3POPC is kinked at Gly207 with a large conformational change compared with that of AQP3DDM (Fig. 1g, left panel). While the observed density map at the pore of AQP3POPC largely corresponds to Tyr212, that at the pore of AQP3DDM appears as a relatively continuous shape (Fig. 1h, i). It was difficult to assign a specific model in the density map at the pore of AQP3DDM (Fig. 1h).

Some additional differences were observed between the structures of AQP3DDM and AQP3POPC. Along with the loop E conformational changes in AQP3POPC, Phe56 at the extracellular side of the H2 helix flips upward and His53 at the adjacent monomer’s loop A becomes disordered (Supplementary Fig. 5a, b). In addition, the loop C residues His154 and Leu155 are also conformationally changed in AQP3POPC (Supplementary Fig. 5a, b). Furthermore, the position of the intracellular loop D shifts to the pore side in AQP3POPC compared with AQP3DDM (Fig. 1e, f and Supplementary Fig. 5c).

We measured the phospholipid contents in the purified proteins of AQP3DDM and AQP3POPC. Phospholipids were detected in AQP3DDM samples to a lesser extent than in AQP3POPC (Supplementary Fig. 6a), suggesting that native phospholipids from insect cells are not completely stripped away by DDM solubilisation. The shapes and sizes of the cryo-EM maps around the tetrameric proteins differ between AQP3DDM and AQP3POPC (Supplementary Fig. 6b,c). These cryo-EM maps may correspond to DDM or surrounding lipids, but it was difficult to assign specific models.

The structure of AQP3 in 1,2-dimyristoyl-sn-glycero-3-phosphocholine (DMPC) nanodiscs (referred to as AQP3DMPC) was also solved (Supplementary Figs. 1c, d and 3c). AQP3DMPC particles appeared as oval discs under the cryo-EM analysis without symmetry application (Supplementary Fig. 7a). All four monomers of AQP3DMPC exhibited the Tyr212-in conformation like AQP3POPC (Supplementary Fig. 7b). Cryo-EM maps like acyl chains of lipids were observed around some monomers of AQP3DMPC (Supplementary Fig. 7c).

MD simulations of the AQP3 Tyr212-out and Tyr212-in conformations

To examine structural stability and water distribution in the Tyr212-out and Tyr212-in conformations of AQP3, 500-ns MD simulations were conducted. The Tyr212-out conformation was observed in the purified protein of AQP3DDM, which contains not only DDM but also phospholipids (Supplementary Fig. 6a). The Tyr212-out conformation was examined in MD simulations both in DDM micelles and POPC bilayers to investigate the effects of surrounding environments. The Tyr212-in conformation was also examined in the same manner. The four systems were generated by burying the AQP3 tetramers of the Tyr212-out and Tyr212-in conformations in DDM micelles and POPC bilayers as follows: the modelled AQP3 tetramer of the Tyr212-out conformation in DDM micelles (referred to as ITyr212-out/DDM hereafter); that of the Tyr212-out conformation in the POPC bilayer (ITyr212-out/POPC); that of the Tyr212-in conformation in DDM micelles (ITyr212-in/DDM); and that of the Tyr212-in conformation in the POPC bilayer (ITyr212-in/POPC). These systems were used in MD simulations (Supplementary Fig. 8a–d and Supplementary Movies 14). The overall structures were not denatured during the MD simulations (Supplementary Fig. 8e). In the MD simulations for ITyr212-out/DDM and ITyr212-in/DDM, the dissociation of DDM molecules from the micelle was not observed, indicating that the system setup for DDM micelles is appropriately adjusted.

The distance between Tyr212 and Asn83 for each chain of the tetramers was monitored during the MD trajectories (Fig. 2a–f). In the simulation of the Tyr212-out conformation, the Tyr212–Asn83 distance for ITyr212-out/DDM decreased from 12.3 Å in the initial structure to ~6 Å at two chains of the tetramer, and that for ITyr212-out/POPC decreased to ~6 Å at two chains and decreased further to ~4 Å at one chain (Fig. 2a–c). In contrast, the Tyr212–Asn83 distance for ITyr212-in/DDM and ITyr212-in/POPC were maintained at ~4 Å during the simulations except for one chain of ITyr212-in/DDM, in which the distance became ~7 Å during the last 30 ns (Fig. 2d–f). These results showed that the Tyr212-out conformation is relatively flexible and that the flexibility is independent of the surrounding environments of DDM micelles and POPC bilayers. These results also showed that the inserted Tyr212 is stable in both DDM micelles and POPC bilayers. The fluctuations observed by MD simulations in the reentrant loop E of the Tyr212-out conformations were larger than those of the Tyr212-in conformations (Supplementary Fig. 8f). The experimental structural flexibility of the reentrant loop E represented by the B-factor of cryo-EM structure of AQP3DDM was smaller than that of AQP3POPC (Supplementary Fig. 8g). The stability of the reentrant loop E represented by the B-factor of AQP3DDM was inconsistent with fluctuations observed in the Tyr212-out conformations by MD simulations (Supplementary Fig. 8f, g).

Fig. 2: MD simulations for the Tyr212-out and Tyr212-in conformations of AQP3.
figure 2

Initial and monitored distance between Tyr212(OH) and Asn83(ND2) in MD simulations for the AQP3 Tyr212-out (ac) and Tyr212-in (df) conformations. The monitored distances are shown for ITyr212-out/DDM (b), ITyr212-out/POPC (c), ITyr212-in/DDM (e), and ITyr212-in/POPC (f). In the graphs of the monitored distance, the plots of the four chains of tetramers are shown (b, c, e, f). The reentrant loop E is represented by magenta (a, d). g Water distribution in pores during MD simulations for four systems. The pore axes are aligned with z-axes of the cryo-EM structures. The origin of the pore axis is set at the z-coordinate of the Cα atom of Asn83.

The water distribution in the AQP3 pores was calculated from the obtained MD trajectories. In calculating the water distribution, the number of water molecules in the pore axis was counted and averaged over the four pores of a tetramer and over all snapshots of the simulations (Fig. 2g). For ITyr212-in/DDM and ITyr212-in/POPC, the relative density of the water molecules equalled almost zero along the pore axis, spanning a distance of 2 to 8 Å, which is near the region of the inserted Tyr212 (Fig. 2g). Furthermore, the water permeability in the MD simulations was also measured by counting how many water molecules passed through the four pores in the intracellular or extracellular direction. The total number of water molecules passing through was 1989 for ITyr212-out/DDM, 1251 for ITyr212-out/POPC, 75 for ITyr212-in/DDM, and 5 for ITyr212-in/POPC. Water molecule orientations in the pore were also analysed. Water molecules are oriented in opposite directions in the two regions of the AQP3 pore separated by the NPA motif (Supplementary Fig. 8h). This finding is consistent with a previous report showing water molecule orientations in the GlpF pore21.

The essential findings from the MD simulations of AQP3 are the differences in structural stability and water permeability between Tyr212-out and Tyr212-in conformations. All-atom conventional MD was chosen as the method because the two conformations are quite similar except for loop E and large structural changes of a membrane protein are not focused on in this study. The results are based on the behaviours of the four chains in the AQP3 tetramer. In the analyses in the monomer unit, one MD trajectory of the tetramer can be considered to include four independent trajectories of the four chains. The four chains had different initial conformations and velocities of atoms in the production run because of the energy minimisation and equilibration runs before it. In the MD simulations for the Tyr212-out conformation, the Tyr212–Asn83 distance changed in some chains in the timescale of tens of nanoseconds (see Fig. 2b, c). Also, the differences in water distribution and in number of permeations were captured within a sampling range of 500 ns. Therefore, the simulation length appeared to be sufficient for the analyses of structural stability and water permeability focused on in this study.

Aromatic residue required for the pore insertion of AQP3 in POPC nanodiscs

To investigate the mechanism of the pore insertion observed in AQP3POPC, Y212F and Y212T mutants were constructed. The Y212F mutation lacks the hydroxyl group but retains the aromatic ring. The Y212T mutation lacks the aromatic ring. We expressed and purified AQP3 Y212F and Y212T mutants (Supplementary Fig. 1e, f). We obtained their cryo-EM structures in POPC nanodiscs (hereafter referred to as AQP3-Y212FPOPC and AQP3-Y212TPOPC, respectively; Supplementary Fig. 3d, e). The structure of AQP3-Y212FPOPC exhibited conformational changes of the reentrant loop E with the pore insertion of Y212F near the NPA motifs, similar to AQP3POPC (Fig. 3a). On the other hand, the structure of AQP3-Y212TPOPC revealed that Y212T does not insert into the pore, but covers the pore (Fig. 3b). The cryo-EM map corresponding to reentrant loop E residues from Gly207 to Thr212 of AQP3-Y212TPOPC is more disordered than that of the wild-type AQP3POPC and AQP3-Y212FPOPC (Fig. 3c–e). The unsharpened map of AQP3-Y212TPOPC supports that the reentrant loop E of AQP3-Y212TPOPC appears to cover the pore (Fig. 3e) and the position of Phe208 in AQP3-Y212TPOPC differs from the position of Phe208 in AQP3POPC and AQP3-Y212FPOPC (Fig. 3e). The pore of AQP3-Y212FPOPC is occluded like that of AQP3POPC (Supplementary Fig. 4b). The pore diameter of AQP3-Y212TPOPC was calculated to be wider than that of AQP3POPC and AQP3-Y212FPOPC (Supplementary Fig. 4b). These results suggest that an aromatic residue (Tyr or Phe) is necessary for the pore insertion of AQP3 in POPC nanodiscs.

Fig. 3: Structures of AQP3 Y212F and Y212T mutants.
figure 3

Overall structures of the monomers of AQP3-Y212FPOPC (a) and AQP3-Y212TPOPC (b) from the side and extracellular views. The reentrant loop E, ar/R filter residues, and asparagines at the NPA motifs are highlighted with several residues shown as sticks. For clarity, loop B, loop C, and H6 are partially removed from the side views. Extracellular views of cryo-EM maps and models around loop E regions of AQP3POPC (c), AQP3-Y212FPOPC (d), and AQP3-Y212TPOPC (e). Unsharpened maps are also presented (right panel). f Overall structure of the monomer of AQP3-Y212TDDM shown as in (ab). Modelled DDM is not shown. Extracellular views of cryo-EM maps and models around loop E regions of AQP3DDM (g) and AQP3-Y212TDDM (h). i Enlarged side view of cryo-EM map and model of pore region of AQP3-Y212TDDM with modelled DDM molecule coloured purple.

To evaluate whether or not the structural instability of the reentrant loop E of AQP3-Y212TPOPC appears in DDM micelles, structural analysis of AQP3 Y212T in DDM micelles (hereafter referred to as AQP3-Y212TDDM) was performed (Supplementary Fig. 3f). Reentrant loop E residues are not disordered in the cryo-EM structure of AQP3-Y212TDDM similar to those of AQP3DDM (Fig. 3f–h). Cryo-EM maps that might correspond to DDM or lipids appear around tetrameric AQP3-Y212TDDM and are not the same as those in AQP3-Y212FPOPC and AQP3-Y212TPOPC (Supplementary Fig. 6d–f).

The structure of AQP3-Y212TDDM exhibited two additional features at the pore. First, the Y212T mutation changes the position of reentrant loop E to narrow the pore around Y212T compared with that in AQP3DDM (Fig. 3g, h and Supplementary Fig. 4a). Second, the cryo-EM map at the channel pore of AQP3-Y212TDDM shows a clear density at the intracellular side, matching that in DDM (Fig. 3i). Such a clear shape is not observed in the cryo-EM maps of the AQP3-Y212FPOPC and AQP3-Y212TPOPC pores (Supplementary Fig. 9a, b).

Water and glycerol permeability of AQP3 and its mutants in POPC liposomes

Proteoliposomes were made by DDM-solubilised proteins and POPC to evaluate the water and glycerol permeability using a stopped-flow apparatus. We compared the protein and phospholipid contents of proteoliposomes among AQP3 wild-type, Y212F, and Y212T (Supplementary Fig. 9c and Supplementary Table 1). We confirmed that these prepared proteoliposomes exhibits similar protein/phospholipid weight ratios (Fig. 4a).

Fig. 4: Stopped-flow measurements of AQP3 and mutants.
figure 4

a The measured protein/phospholipid weight ratio of AQP3, AQP3 Y212F, and AQP3 Y212T proteoliposomes. Averaged values ± SEM and individual data points are plotted (n = 4 independently reconstituted liposome samples from the same purified proteins). Data are statistically analysed using multiple comparison Tukey test. P values are indicated. b, c Stopped-flow measurements under NaCl hyperosmotic conditions for AQP3, AQP3 Y212F, and AQP3 Y212T proteoliposomes and empty liposomes. Averaged water permeability Pf values ± SEM and individual data points are plotted (n = 4 independently reconstituted liposome samples from the same purified proteins) (b). Data are statistically analysed using multiple comparison Tukey test. P values are indicated. *P < 0.05 and ***P < 0.001. Representative time courses of normalised intensity change and fitted curves are shown (c). d, e Stopped-flow measurements under glycerol gradient conditions for AQP3, AQP3 Y212F, and AQP3 Y212T proteoliposomes and empty liposomes. Averaged glycerol permeability Pgly values ± SEM and individual data points are plotted (n = 4 independently reconstituted liposome samples from the same purified proteins) (d). Data are statistically analysed using multiple comparison Tukey test. P values are indicated. *P < 0.05, **P < 0.01, and ***P < 0.001. Representative time courses of normalised intensity change and fitted curves are shown (e). Source data are provided as a Source Data file.

AQP3 proteoliposomes shrank faster than empty liposomes under hyperosmotic conditions, indicating the water permeability of the reconstituted AQP3 in proteoliposomes (Fig. 4b, c and Supplementary Table 1). Both AQP3 Y212F and Y212T mutants also exhibited water permeability, and the permeability of AQP3 Y212T is slightly faster than that of AQP3 wild-type and significantly faster than that of Y212F (Fig. 4b, c and Supplementary Table 1). AQP3 also exhibited glycerol permeability under glycerol gradient conditions (Fig. 4d, e and Supplementary Table 1). Both AQP3 Y212F and Y212T mutants exhibited glycerol permeability, with Y212T exhibiting significantly faster glycerol permeability than AQP3 wild-type and Y212F (Fig. 4d, e and Supplementary Table 1).

Structure of AQP7 in POPC nanodiscs

We focused on the sequence conservation of Tyr212 in AQP3 among human AQPs. Only AQP7, an aquaglyceroporin, has an aromatic residue, Tyr223 (Fig. 5a). The cryo-EM structure of human AQP7 in detergent micelles using glyco-diosgenin was recently reported28. Therefore, we purified and performed cryo-EM structural analysis of human AQP7 only in POPC nanodiscs (termed AQP7POPC; Supplementary Figs. 1g, h and 3g). AQP7POPC did not exhibit the narrowed pore conformation like AQP3POPC, despite the presence of aromatic residues at reentrant loop E (Fig. 5b and Supplementary Fig. 10a). The structure of AQP7POPC is similar to that of AQP7 in detergent micelles28 (Fig. 5b). The positions of the ar/R filter residues and reentrant loop E are similar between AQP7POPC and AQP3DDM (Supplementary Fig. 11a).

Fig. 5: Structural comparison of AQP7 and GlpF with AQP3.
figure 5

a Aligned sequences of rat AQP3, human AQP7, and E. coli GlpF used in this study. Residues highlighted in gray are ar/R filter residues and NPA motifs. The site of residues corresponding to Tyr212 at reentrant loop E in AQP3 is indicated by an asterisk. Residues highlighted in pink correspond to GlpF Val173 and Pro177 at loop D, respectively. Sequence alignment was achieved by EMBL Clustal W web services, and manually modified. b Superimposition of overall structures of the monomers of AQP7 in POPC nanodiscs and reported AQP7 in the detergent micelles, which forms a dimer of tetramers analysed under D4 symmetry (PDB: 8AMX) from the side and extracellular views. The reentrant loop E, ar/R filter residues, and asparagines at the NPA motifs are highlighted with several residues shown as sticks. For clarity, loop B, loop C, and H6 are partially removed from the side views. c Overall structure of the monomer of GlpFPOPC is shown as in (b). The intracellular view is also shown, and the intracellular loop D is highlighted with Val173 and Pro177 shown as sticks. d, e Superimposition of the model of GlpFPOPC with the cryo-EM map of GlpFDDM from the extracellular (d) and intracellular (e) views. The ar/R filter residues are shown as sticks (d). f Comparison of the intracellular loop D of GlpFPOPC and AQP3POPC. The loop D is highlighted with several residues shown as sticks. The difference of loop D is indicated by a two-way arrow. g Overall structure of the monomer of GlpF-ΔP177POPC is shown as in (c). h Comparison of GlpFPOPC and GlpF-ΔP177POPC from the intracellular view. The loop D and C-terminus are highlighted with several residues shown as sticks. The differences of loop D and C-terminus are indicated by two-way arrows.

We prepared AQP7 reconstituted POPC liposomes. We confirmed the protein contents (Supplementary Fig. 11b, c and Supplementary Table 1), water permeability (Supplementary Fig. 11d, e and Supplementary Table 1), and glycerol permeability (Supplementary Fig. 11f, g and Supplementary Table 1) of reconstituted AQP7.

Structures of GlpF in DDM micelles and POPC nanodiscs

Sequence analysis showed that E. coli GlpF also has an aromatic residue, Phe200, at the same position as Tyr212 in AQP3 (Fig. 5a). Therefore, we purified and performed cryo-EM structural analyses of GlpF in DDM micelles and POPC nanodiscs (termed GlpFDDM and GlpFPOPC, respectively; Supplementary Figs. 1i, j, 3h, i, and 12a). GlpFPOPC did not exhibit the narrowed pore conformation like AQP3POPC, despite the presence of an aromatic residue at reentrant loop E (Fig. 5c and Supplementary Fig. 10b). A structural model of GlpFDDM was not constructed because of the relatively low-resolution map, but the model of the reentrant loop E of GlpFPOPC fits well to the cryo-EM map of GlpFDDM (Fig. 5d), indicating the same conformation of loop E between GlpFDDM and GlpFPOPC. The residues at the ar/R filter of AQP7 and AQP3 are the same, but differ from those of GlpF as follows: Trp48 in GlpF corresponds to Phe63 in AQP3 and Phe74 in AQP7, and Phe200 in GlpF corresponds to Tyr212 in AQP3 and Tyr223 in AQP7 (Fig. 5a). The positions of the ar/R filter residues and reentrant loop E are similar between GlpFPOPC and AQP3DDM (Supplementary Fig. 12b).

Notably, GlpFPOPC exhibits a previously unobserved structure where the pore is covered by loop D on the intracellular side (Fig. 5c). The cryo-EM map of GlpFDDM also shows that the intracellular side of the pore is covered by loop D (Fig. 5e), indicating that this is not dependent on the DDM or POPC nanodisc environment. Loop D of GlpFPOPC mainly faces the pore with its Val173 (Fig. 5c). While the residue corresponding to Val173 in GlpF is also a valine (Val186) in AQP3 (Fig. 5a), loop D does not cap the pore of AQP3POPC (Fig. 5f). Comparison of the structure and sequence suggests that the inserted proline residue found only in GlpF, Pro177, generates the differences (Fig. 5a, f). The GlpF ΔP177 mutant in POPC nanodiscs (termed as GlpF-ΔP177POPC) was purified and subjected to cryo-EM structural analysis (Supplementary Figs. 1k, l, 3j, and 12c). The intracellular loop D of GlpF-ΔP177POPC predictably does not cover the pore (Fig. 5g and Supplementary Fig. 10b). The C-terminus also shows a different conformation between GlpFPOPC and GlpF-ΔP177POPC (Fig. 5h). GlpF-ΔP177POPC is modelled up to Gly256, but GlpFPOPC is modelled up to Cys261.

Loop D does not cap the pore and the C-terminus shows a different conformation in the X-ray crystal structure of GlpF without glycerol21 compared with GlpFPOPC (Supplementary Fig. 10b and 12d). The positions of loop D and the C-terminus of GlpF-ΔP177POPC are more similar to those of the X-ray crystal structure of GlpF (Supplementary Fig. 12e).

The pore closure structures by loop D have been reported in other X-ray crystal structures of two plant AQPs, spinach PIP2;1 (SoPIP2;1)35 and Oryza sativa NIP2;1 (OsNIP2;1)36. We compared the structures of SoPIP2;1 and OsNIP2;1 with GlpFPOPC (Supplementary Fig. 12f, g). The position of loop D in GlpFPOPC is not the same as that in SoPIP2;1 and OsNIP2;1.

MD simulations and permeability assays of GlpF and its mutant

In the same manner, as for AQP3, 500-ns MD simulations were conducted to examine the structural stability and water distribution of GlpF wild-type and GlpF ΔP177 in POPC bilayers. Modelling the tetramers of GlpF based on the cryo-EM structures of GlpFPOPC and GlpF-ΔP177POPC, the initial structures were generated by burying the tetramers in POPC bilayers. The two systems are referred as IGlpF/POPC and IGlpF-ΔP177/POPC hereafter, and were used in the MD simulations (Supplementary Fig. 13a, b and Supplementary Movies 5 and 6). The overall structures of IGlpF/POPC and IGlpF-ΔP177/POPC from Ser5 to Gly256 were not denatured during the MD simulations (Supplementary Fig. 13c). The C-terminus Arg257 to Cys261 of IGlpF/POPC fluctuated quite a bit in the MD simulations (Supplementary Fig. 13c).

The distances Val173–Ala79 and Gly170–Ala79 for each chain of the tetramers of IGlpF/POPC were monitored during the MD trajectory (Fig. 6a–d). The distance Val173–Ala79 moved with an amplitude of ~3 Å (Fig. 6a, b), and the distance Gly170–Ala79 moved more than that of Val173–Ala79 (Fig. 6c, d). The snapshot of loop D of IGlpF/POPC during the MD simulation showed that the position of loop D changed and water passed through near the loop D (Fig. 6e). The distance Val173–Ala79 for each chain of the tetramers in IGlpF-ΔP177/POPC was monitored during the MD trajectory (Fig. 6f, g). The distance Val173–Ala79 moved with an amplitude of ~5 Å (Fig. 6f, g). The loop D of IGlpF-ΔP177/POPC maintained its position away from the pore. The fluctuation of loop D in IGlpF/POPC was larger than that in IGlpF-ΔP177/POPC (Supplementary Fig. 13d). The fluctuations observed in the loop D of GlpF wild-type and GlpF ΔP177 were consistent with the experimental structural flexibilities represented by the B-factor of cryo-EM structures of GlpFPOPC and GlpF-ΔP177POPC (Supplementary Fig. 13e).

Fig. 6: MD simulations and stopped-flow measurements of GlpF and its mutant.
figure 6

ad Initial and monitored distance between Val173(Cα) and Ala79(Cα) (a, b) and between Gly170(Cα) and Ala79(Cα) (c, d) in MD simulations for IGlpF/POPC. In the graphs of the monitored distance, the plots of the four chains of tetramers are shown (b, d). The loop D is represented by magenta (a, c). e Representative water distribution and the moved loop D in a snapshot for IGlpF/POPC during the MD simulation. The loop D and water molecules are represented by magenta and spheres, respectively. Initial (f) and monitored distance (g) between Val173(Cα) and Ala79(Cα) in MD simulations for I GlpF-ΔP177/POPC. In the graphs of the monitored distance, the plots of the four chains of tetramers are shown (g). The loop D is represented by magenta. h Water distribution in pores during MD simulations for two systems. The pore axes are aligned with z-axes of the cryo-EM structures. The origin of the pore axis is set at the z-coordinate of Cα atom of Asn68. i The measured protein/phospholipid weight ratio of GlpF and GlpF ΔP177 proteoliposomes. Averaged values ± SEM and individual data points are plotted (n = 4 independently reconstituted liposome samples from the same purified proteins). Data are statistically analysed using Student’s two-tailed unpaired t test. P value is indicated. j Stopped-flow measurements under NaCl hyperosmotic conditions for GlpF and GlpF ΔP177 proteoliposomes and empty liposomes. Averaged water permeability Pf values ± SEM and individual data points are plotted (n = 4 independently reconstituted liposome samples from the same purified proteins). Data are statistically analysed using multiple comparison Tukey test. P values are indicated. ***P < 0.001. k Stopped-flow measurements under glycerol gradient conditions for GlpF and GlpF ΔP177 proteoliposomes and empty liposomes. Averaged glycerol permeability Pgly values ± SEM and individual data points are plotted (n = 4 independently reconstituted liposome samples from the same purified proteins). Data are statistically analysed using multiple comparison Tukey test. P values are indicated. ***P < 0.001. Source data are provided as a Source Data file.

Water distribution in the GlpF pores was calculated in the same manner as that in AQP3 (Fig. 6h). The relative density of water molecules in IGlpF/POPC was lower than that in IGlpF-ΔP177/POPC within the pore axis range of −15 to −5 Å, where the pore is capped by loop D, but it was not close to zero (Fig. 6h). The total number of passed water molecules in the MD simulations was 1132 for IGlpF/POPC and 2867 for IGlpF-ΔP177/POPC. Water molecule orientation in the pore was also analysed. Water molecules are oriented in opposite directions in the two regions of the pore of GlpF wild-type and GlpF ΔP177 separated by the NPA motif (Supplementary Fig. 13f). This finding is consistent with a previous report showing water molecule orientations in the GlpF pore21.

MD simulations of GlpF wild-type and GlpF ΔP177 in POPC bilayers were conducted similarly for AQP3. The four chains in the tetramer had different initial conformations and velocities of atoms in the production run because of the energy minimisation and equilibration runs before it. The simulation length was sufficient for examining the structural stability and water distribution.

We prepared GlpF wild-type and GlpF ΔP177 reconstituted POPC proteoliposomes, which had similar protein contents (Fig. 6i, Supplementary Fig. 13g, and Supplementary Table 1). Water permeability (Fig. 6j, Supplementary Fig. 13h, and Supplementary Table 1) and glycerol permeability (Fig. 6k, Supplementary Fig. 13i, and Supplementary Table 1) did not significantly differ between reconstituted GlpF wild-type and GlpF ΔP177.

Discussion

This study presents cryo-EM structures of aquaglyceroporins rat AQP3 and E. coli GlpF in DDM micelles and POPC nanodiscs, and human AQP7 in POPC nanodiscs, revealing distinct conformations among them (Supplementary Fig. 14). The narrowed pore structure of AQP3POPC through extracellular loop E with the insertion of Tyr212 into the pore is an AQP conformation not previously observed.

We purified DDM-solubilised AQP3 from insect cells. The cryo-EM structure of AQP3DDM showed that the ar/R filter containing Tyr212 forms a pore with a 2.8-Å diameter (Tyr212-out conformation; Fig. 1e). The stability of the reentrant loop E containing Tyr212 represented by the B-factor of cryo-EM structure of AQP3DDM was inconsistent with fluctuations observed in the Tyr212-out conformations by MD simulations (Supplementary Fig. 8f, g). We observed an unassigned cryo-EM density map at the pore of AQP3DDM (Fig. 1h); the unassigned density might stabilise the Tyr212-out conformation and inhibit the Tyr212-in conformation in the cryo-EM structure of AQP3DDM. The cryo-EM structure of AQP3-Y212TDDM exhibits an apparent DDM density at the pore inserted from the intracellular side (Fig. 3i), suggesting that the unassigned density observed in the pore of AQP3DDM might also correspond to DDM. Due to the unclear map shape, however, it was difficult to assign DDM or lipids in the cryo-EM map of AQP3DDM pore.

AQP3 was reconstituted into POPC nanodiscs by mixing DDM-solubilised AQP3 with MSP proteins and POPC, followed by the application of Bio-Beads to remove the DDM. The cryo-EM structure of AQP3POPC showed that Tyr212 inserts into the pore (Tyr212-in conformation; Fig. 1f). The Tyr212-in conformation is the only conformation reconstructed during the cryo-EM data processing (Supplementary Fig. 2d), although we cannot exclude the possibility that the protein particles that form the Tyr212-out conformation are mixed in the micrographs. The three-dimensional (3D) classification also showed that smaller and larger size of MSP nanodiscs are mixed, but both exhibited the Tyr212-in conformation (Supplementary Fig. 2d), suggesting that the interaction between AQP3 and MSP is not important for the Tyr212-in conformation. This is further supported by cryo-EM analysis of AQP3DMPC because one monomer (Monomer B; Supplementary Fig. 7a, b) that does not seem to interact with the MSP disc forms the Tyr212-in conformation. Structures of AQP3POPC and AQP3DMPC suggest that the Tyr212-in conformation does not depend on the length of the acyl chain in PC lipids.

Cryo-EM structures of AQP3 mutants in POPC nanodiscs suggest that aromatic residue Tyr212 or Y212F must stably insert into the AQP3 pore (Fig. 3a). We speculate that the positive electrostatic field at the NPA motifs, generated by the helical dipole moment of the two short HB and HE helices, attracts the pi-electrons of the aromatic residue to stabilise the AQP3 structure. The aromatic residue itself is not sufficient to insert into the pore in POPC nanodiscs, as the pore narrowed by reentrant loop E is not observed in AQP7 and GlpF, although they share aromatic residues Tyr223 and Phe200 at the position corresponding to Tyr212 in AQP3 (Fig. 5b, c). AQP3 contains unique sequences that are not present in AQP7 and GlpF, but may be critical for forming the Tyr212-in conformation.

The Tyr212-in conformation of AQP3POPC is a pore-occluded state as calculated by the pore detection programme MOLEonline (Supplementary Fig. 4b) and as supported by MD simulations of the water distribution in the AQP3 Tyr212-in conformation within the POPC bilayer system (Fig. 2g). AQP3 is a known aquaglyceroporin, and we confirmed that AQP3 reconstituted in POPC liposomes exhibits both water and glycerol permeability (Fig. 4). Therefore, we speculate that, upon the permeation of water and glycerol, the AQP3 Tyr212-in conformation undergoes conformational rearrangement in the plug loop, transitioning to a water- or glycerol-permeable state, possibly resembling the Tyr212-out conformation.

The water and glycerol permeability of AQP3 Y212F in POPC liposomes is similar to that of AQP3 wild-type (Fig. 4). This is consistent with the cryo-EM structure of AQP3-Y212FPOPC in which Y212F is inserted into the pore like AQP3 wild-type (Fig. 3a). The water and glycerol permeability of AQP3 Y212T is slightly faster than that of AQP3 wild-type and AQP3 Y212F (Fig. 4). This may be consistent with the cryo-EM structure of AQP3-Y212TPOPC, in which Y212T is not inserted into the pore (Fig. 3b), and a wider pore is detected compared with AQP3POPC and AQP3-Y212TPOPC (Supplementary Fig. 4b). The loop E containing Y212T might be sufficiently flexible to allow water and glycerol to pass through in the POPC environment.

We present cryo-EM structural evidence for GlpF with pore capping by intracellular loop D both in DDM micelles and POPC nanodiscs (Fig. 5c, e). Loop D does not cap the pore in the X-ray crystal structure of GlpF, suggesting that crystal packing may account for the differences between the X-ray crystal and cryo-EM structures of GlpF. Intracellular loop D caps the pore in the cryo-EM structure of GlpFPOPC, but the loop D is suggested to be flexible by its B-factor (Supplementary Fig. 13e). MD simulations of GlpF in POPC bilayer support that the intracellular loop D is flexible and that water molecules could pass through near the loop D (Fig. 6a–e, h and Supplementary Fig. 13d). We found that the loop D conformation exhibits an open pore in GlpF-ΔP177POPC (Fig. 5g). MD simulations and stopped-flow measurements using GlpF wild-type and ΔP177 (Fig. 6) suggest that the narrowed pore conformation by intracellular loop D is not largely related to the GlpF permeability. This may be similar to the case of OsNIP2;1 loop D, in which MD simulations showed a rapid opening of the loop D and its physiological significance remains to be elucidated36,37. In contrast, for SoPIP2;1, the low pH keeps the SoPIP2;1 loop D closed, and phosphorylation is proposed to modulate loop D gating38,39.

In addition to the narrowed pore structures of AQPs by loop D capping and AQP3 by pore-inserted loop E, other narrowed AQP structures are reported. The X-ray crystal structure of yeast Aqy1 shows its intracellular N-terminus blocking the pore40. The X-ray crystal structure of AQP10 shows a pH-dependent closed state by intracellular loop B29. The X-ray crystal structure of the freshwater fish climbing perch AQP shows that extracellular loop C narrows the pore41. The electron crystal structure of AQP0 shows that junctions are formed via extracellular loops with a closed conformation42,43. Thus, AQPs appear to have diverse narrowed pore conformations.

In conclusion, we identified distinct conformations of aquaglyceroporins AQP3, AQP7, and GlpF, and revealed AQP3-specific narrowed pore conformation by the insertion of Tyr212 in an extracellular loop. AQP3 is localised in skin keratinocytes, contributing to physiological and pathological skin conditions44,45. AQP3 is also involved in cancer progression46. Our newly uncovered structural information may contribute to the development of chemicals and drugs targeting AQP3.

Methods

DNA constructs

Complementary DNAs of rat AQP3 and human AQP7 proteins (Uniprot: P47862 and O14520, respectively) were cloned from cDNA libraries, while that of E. coli (strain K12) GlpF protein (Uniprot: P0AER0) was commercially synthesised. The full-length genes were subcloned into the pFastBac vector (Invitrogen) using the Gibson assembly system (New England Biolabs). For AQP3 and GlpF, additional residues, including an octa-His tag, a thrombin digestion site, and a linker sequence, were attached at the N-terminus. For AQP7, additional residues, including an octa-His tag, enhanced green fluorescent protein (A207K), a tobacco etch virus protease digestion site, and a linker sequence, were attached at the N-terminus. Site-directed mutagenesis was achieved by overlap extension polymerase chain reaction.

Expression and purification of MSP proteins

Plasmids for MSP1D1 and MSP1E3D1 expression were purchased from Addgene (Addgene plasmid nos. 20,061 and 20,066)47,48. The plasmids were transformed into BL21-Gold(DE3) (Agilent), which was cultured at 37 °C. Expression of MSP proteins was induced by isopropyl β-D-thiogalactopyranoside. The harvested cells were suspended in buffer containing 40 mM Tris (pH 8.0), 300 mM NaCl, 1% (v/v) Triton X, and 1 mM phenylmethylsulfonyl fluoride, and then disrupted by sonication. The sonicated cell suspension was centrifuged at 22,095 × g for 25 min. The collected supernatant was bound to nickel-nitrilotriacetic acid-agarose resin (Ni-NTA, Qiagen) under 1 mM L-His for 5 h. The resin was washed with 50 mM sodium cholate and 20 mM imidazole and then eluted with buffer containing 40 mM Tris (pH 8.0), 300 mM NaCl, and 450 mM imidazole. The eluate was dialysed in buffer containing 20 mM Tris (pH 8.0) and 150 mM NaCl. Tobacco etch virus protease was incubated, and then Ni-NTA was mixed to remove the tag. The pass fractions were collected and dialysed in buffer containing 20 mM Tris (pH 8.0) and 150 mM NaCl.

Expression, purification, and nanodisc reconstitution of AQPs

All constructs were expressed in Sf9 (Novagen, 71104) or Sf+ (Protein Sciences Corporation) insect cells using the Bac-to-Bac Baculovirus Expression System (Invitrogen). Cells were cultured in suspension at 27 °C, harvested after 48 h infection, and stored at −80 °C.

For AQP7, cells were suspended in a phosphate-buffered saline buffer supplemented with 2 mM EDTA and 1 mM phenylmethylsulfonyl fluoride, and then disrupted by sonication. The sonicated cell suspension was centrifuged at 960 × g for 10 min, and the supernatant was centrifuged at 100,000 × g for 45 min. The collected crude membranes were solubilised for 1 h at 4 °C in buffer containing 20 mM Tris (pH 8.0), 300 mM NaCl, and 1% (w/v) DDM. After centrifugation at 100,000 × g for 45 min, the detergent concentration of the supernatant was decreased to 0.2% DDM by dilution and bound to Ni-NTA under 10 mM imidazole overnight. The resin was washed with 50 mM imidazole and then eluted with 300 mM imidazole in buffer containing 20 mM Tris (pH 8.0), 300 mM NaCl, and 0.05% DDM. The eluate was concentrated and loaded onto a Superose 6 Increase 10/300 GL column (Cytiva) equilibrated with buffer containing 20 mM Tris (pH 8.0), 150 mM NaCl, and 0.05% DDM. Peak fractions were pooled as AQP7 in DDM micelles. MSP1E3D1 was used to reconstitute DDM-solubilised AQP7 into nanodiscs. POPC lipids in chloroform were dried using nitrogen gas and rehydrated with 2% DDM. The AQP-monomer:MSP:POPC molar ratio for AQP7 was 1:0.5:30. Mixed samples were incubated with Bio-Beads SM-2 resin (Bio-Rad) overnight at 4 °C with constant rotation. The mixture, except for the Bio-Beads, was mixed with Ni-NTA to remove empty nanodiscs. The eluate was concentrated and loaded onto a Superose 6 Increase 10/300 GL column in buffer containing 20 mM Tris (pH 8.0) and 150 mM NaCl. Peak fractions were pooled as AQP7 in POPC nanodiscs.

For AQP3, GlpF, and their mutants, cells were suspended in buffer containing 20 mM Tris (pH 7.5) and 150 mM NaCl, and then disrupted by sonication. The sonicated cell suspension was centrifuged at 960 × g for 10 min, and the supernatant was centrifuged at 100,000 × g for 30 min. The collected crude membranes were solubilised in buffer containing 20 mM Tris (pH 7.5), 150 mM NaCl, 5% (v/v) glycerol, and 1% DDM for 1 h at 4 °C. After centrifugation at 100,000 × g for 30 min, the DDM concentration of the supernatant was decreased to 0.4% by dilution and bound to Ni-NTA under 1 mM L-His for approximately 3 h. The resin was washed with 5 mM L-His and then eluted with 300 mM L-His in buffer containing 20 mM Tris (pH 7.5), 150 mM NaCl, 5% glycerol, and 0.03% DDM. The eluate was concentrated and loaded onto a Superose 6 Increase 10/300 GL column in buffer containing 20 mM Tris (pH 7.5), 150 mM NaCl, and 0.03% DDM. Peak fractions were pooled as AQP3, GlpF, or their mutants in DDM micelles, and reconstituted into nanodiscs using MSP1D1. The AQP-monomer:MSP:POPC molar ratios were as follows: 1:5:200 for AQP3POPC, AQP3-Y212FPOPC, and AQP3DMPC; and 1:5:50 for AQP3-Y212TPOPC, GlpFPOPC, and GlpF-ΔP177POPC. Mixed samples were incubated with Bio-Beads SM-2 resin overnight at 4 °C with constant rotation, except for AQP3DMPC at room temperature. The mixture, except for the Bio-Beads, was concentrated and loaded onto a Superose 6 Increase 10/300 GL column in buffer containing 20 mM Tris (pH 7.5) and 150 mM NaCl. Two peaks appeared to correspond to AQPs in nanodiscs and empty nanodiscs. The first peak fractions containing AQPs in nanodiscs were mixed and bound to Ni-NTA to further remove the contaminated empty nanodiscs. The eluate was concentrated and loaded onto a Superose 6 Increase 10/300 GL column in buffer containing 20 mM Tris (pH 7.5) and 150 mM NaCl. Peak fractions were pooled as AQP3, GlpF, or their mutants in nanodiscs.

The LabAssay Phospholipid Kit (FUJIFILM Wako) and SpectraMax i3x microplate reader (Molecular Devices) were used to measure the phospholipid concentrations in the purified proteins of AQP3DDM and AQP3POPC diluted to an absorbance at 280 nm of 3. AQP3 monomer concentration in the diluted purified AQP3DDM was calculated to be 2.2 mg/mL from the absorbance and molar extinction coefficient. The purified AQP3POPC contains AQP3 and MSP1D1 proteins. The weight ration of AQP3:MSP1D1 was estimated to be 2:1 by sodium dodecyl sulphate-polyacrylamide gel electrophoresis analysis (Supplementary Fig. 1b). Therefore, AQP3 concentration in the diluted purified AQP3POPC was estimated to be 1.7 mg/mL from the absorbance and molar extinction coefficients of AQP3 and MSP1D1. The detected phospholipid concentrations were represented as the normalised phospholipid concentrations for AQP3 concentration of 1 mg/mL.

Cryo-EM data acquisition

The purified AQPs were concentrated at the appropriate concentrations as follows: 0.7–2.9 mg/mL for AQPs in lipid nanodiscs; 19 mg/mL for AQP3DDM; and 5.0 mg/mL for AQP3-Y212TDDM and GlpFDDM. The concentrated proteins were loaded onto glow-discharged Quantifoil holey carbon grids (R1.2/1.3, Au, 300 mesh), which were blotted for approximately 4.0 s at 4 °C and plunge-frozen in liquid ethane using a Vitrobot Mark IV (Thermo Fisher Scientific). Data collection was performed on a JEM-Z320FHC electron microscope (JEOL, Japan) cooled by liquid nitrogen at 300 kV, equipped with a cold field emission gun and an in-column energy filter using a zero-loss slit width of 20 eV. All images were recorded on a K2 Summit direct electron detector camera (Gatan) operated in electron counting mode. SerialEM49 was used for automatic data acquisition. The dose rate was limited to 8.0 e/pixel/s, and the exposure time was 8.0 s and fractionated into 40 frames without a pre-dose delay. The refined pixel size and the accumulated dose for each sample of AQPs are summarised in Supplementary Table 2.

Cryo-EM data processing

All image processing was performed using RELION50,51 and cryoSPARC52. The stacked frame movies were subjected to motion correction with MotionCor253 in RELION. Contrast transfer function (CTF) estimation was performed with cryoSPARC patch CTF algorithm. Micrographs with an estimated resolution worse than 4 or 5 Å were removed. Initial particles were picked from the yielded micrographs by the blob picker or the template picker using fine two-dimensional (2D) class averages in cryoSPARC. The initial particles were subjected to multiple rounds of 2D classification and selected for heterogeneous refinement in cryoSPARC. After heterogeneous refinement using multiple reference maps generated from ab-initio reconstruction or 3D refinement, the classified particles were subjected to homogeneous refinement and non-uniform (NU) refinement54 in cryoSPARC. These refined particles were then imported to RELION using the PyEM algorithm [https://doi.org/10.5281/zenodo.3576630], followed by 3D refinement, Bayesian polishing55, and 3D classification without alignment in RELION. The classified particles were imported back to cryoSPARC for 2D classification and finally subjected to NU refinement and global CTF refinement. Map resolutions under the refined pixel sizes were estimated according to the gold-standard Fourier shell correlation using the 0.143 criterion56 in cryoSPARC (Supplementary Fig. 3). Local resolution was estimated in cryoSPARC.

For AQP3DDM (Supplementary Fig. 2a, c, e), 3251 movies were collected, yielding 2470 micrographs, 1,948,354 initial particles, and 385,812 selected particles for heterogeneous refinement under C1 symmetry. The selected 172,527 particles after the heterogeneous refinement were processed by Bayesian polishing and 3D classification without alignment in RELION. The selected 161,557 particles were imported back to cryoSPARC for 2D classification and NU refinement under C4 symmetry with global CTF refinement. Finally, a 3.12 Å resolution map was obtained from 153,591 particles.

For AQP3POPC (Supplementary Fig. 2b, d, f), 1557 and 6012 movies from two datasets were collected, yielding 1362 and 5174 micrographs, 1,216,419 and 4,627,389 initial particles, and 198,592 and 788,271 selected particles for heterogeneous refinement under C1 symmetry. The 351,377 selected particles after the heterogeneous refinement were processed by Bayesian polishing and 3D classification without alignment in RELION. The selected 279,337 particles were imported back to cryoSPARC for 2D classification and NU refinement under C4 symmetry with global CTF refinement. Finally, a 2.94 Å resolution map was obtained from 271,900 particles.

For AQP3DMPC, 6021 movies were collected, yielding 5454 micrographs, 4,890,341 initial particles, and 1,081,920 selected particles for heterogeneous refinement under C1 symmetry. The selected 500,069 particles after the heterogeneous refinement were processed by Bayesian polishing and 3D classification without alignment in RELION. The selected 236,611 particles were imported back to cryoSPARC for 2D classification and NU refinement under C1 symmetry with global CTF refinement. Finally, a 3.03 Å resolution map was obtained from 228,009 particles.

For AQP3-Y212FPOPC, 6885 movies were collected, yielding 6320 micrographs, 4,658,285 initial particles, and 810,927 selected particles for heterogeneous refinement under C1 symmetry. The selected 240,685 particles after the heterogeneous refinement were processed by Bayesian polishing and 3D classification without alignment in RELION. The selected 145,026 particles were imported back to cryoSPARC for 2D classification and NU refinement under C4 symmetry with global CTF refinement. Finally, a 2.90 Å resolution map was obtained from 135,650 particles.

For AQP3-Y212TPOPC, 7713 movies were collected, yielding 6781 micrographs, 6,873,017 initial particles, and 1,854,449 selected particles for heterogeneous refinement under C1 symmetry. The selected 756,357 particles after the heterogeneous refinement were processed by Bayesian polishing and 3D classification without alignment in RELION. The selected 308,080 particles were imported back to cryoSPARC for 2D classification and NU refinement under C4 symmetry with global CTF refinement. Finally, a 2.67 Å resolution map was obtained from 295,521 particles.

For AQP3-Y212TDDM, 4887 movies were collected, yielding 4250 micrographs, 1,789,455 initial particles, and 334,358 selected particles for heterogeneous refinement under C4 symmetry. The selected 127,686 particles after the heterogeneous refinement were processed by Bayesian polishing and 3D classification without alignment in RELION. The selected 60,523 particles were imported back to cryoSPARC for 2D classification and NU refinement under C4 symmetry with global CTF refinement. Finally, a 3.39 Å resolution map was obtained from 54,899 particles.

For AQP7POPC, 3636 movies were collected, yielding 3010 micrographs, 3,638,849 initial particles, and 148,023 selected particles from iterative heterogeneous refinement under C4 symmetry. The selected particles were processed by Bayesian polishing in RELION. The particles were imported back to cryoSPARC for NU refinement under C4 symmetry with global CTF refinement. Finally, a 2.49 Å resolution map was obtained.

For GlpFDDM, 4012 and 6264 movies from two datasets were collected, yielding 3762 and 5676 micrographs, 993,334 and 1,380,923 initial particles, and 162,007 and 118,824 selected particles for heterogeneous refinement under C1 symmetry. The selected 139,789 particles after the heterogeneous refinement were processed by Bayesian polishing and 3D classification without alignment in RELION. The selected 112,714 particles were imported back to cryoSPARC for 2D classification and NU refinement under C4 symmetry with global CTF refinement. Finally, a 3.74 Å resolution map was obtained from 110,202 particles.

For GlpFPOPC (Supplementary Fig. 12a), 4824 movies were collected, yielding 4318 micrographs, 4,067,871 initial particles, and 1,171,447 selected particles for heterogeneous refinement under C1 symmetry. The selected 520,152 particles after the heterogeneous refinement were processed by Bayesian polishing and 3D classification without alignment in RELION. The selected 268,667 particles were imported back to cryoSPARC for 2D classification and NU refinement under C4 symmetry with global CTF refinement. Finally, a 2.43 Å resolution map was obtained from 252,983 particles.

For GlpF-ΔP177POPC (Supplementary Fig. 12c), 8100 movies were collected, yielding 6934 micrographs, 8,748,925 initial particles, and 2,231,512 selected particles for heterogeneous refinement under C1 symmetry. The selected 1,062,495 particles after the heterogeneous refinement were processed by Bayesian polishing and 3D classification without alignment in RELION. The selected 475,168 particles were imported back to cryoSPARC for 2D classification and NU refinement under C4 symmetry with global CTF refinement. Finally, a 2.26 Å resolution map was obtained from 462,907 particles.

Model building and refinement

AlphaFold structures57,58 were used as starting models for AQP3DDM, AQP7POPC, and GlpFPOPC. The obtained cryo-EM maps were processed in the Servalcat pipeline59 to trim the maps with the refined pixel sizes. Model refinement using Coot60 and real-space refinement in PHENIX61 were iterated. The refined models were C4 symmetry imposed in the Servalcat pipeline except for AQP3DMPC. The geometry was evaluated using MolProbity62. The final refinement statistics are summarised in Supplementary Table 2. The molecular graphics figures were prepared using UCSF ChimeraX63. Pore radius of AQPs was calculated using MOLEonline server v.2.5 pore mode64.

Reconstitution of AQPs into proteoliposomes

POPC lipids in chloroform were dried using nitrogen gas, freeze-dried under vacuum, and rehydrated with buffer containing 20 mM Tris (pH 7.5) and 150 mM NaCl, followed by sonication. The POPC solution was diluted to 1 mg/mL with 0.1% DDM. The pooled DDM-solubilised AQPs were diluted to 0.1 mg/mL. The AQP-monomer:POPC weight ratios were ~1:800 for mixing. Bio-Beads SM-2 resin was added and incubated overnight at 4 °C with constant rotation. The mixture, except for the Bio-Beads, was subjected to ultracentrifuge at 100,000 × g for 20 min. The collected pellet was suspended and sonicated in buffer containing 20 mM Tris (pH 7.5) and 150 mM NaCl. The empty liposomes were made without AQPs. To measure the concentrations of proteins and phospholipids in the proteoliposomes and empty liposomes, the SYPRO Ruby stain (Thermo Fisher Scientific) of sodium dodecyl sulphate-polyacrylamide gel electrophoresis analysis gels and the LabAssay Phospholipid Kit were used, respectively. The SYPRO Ruby fluorescence was detected using a light-emitting diode transilluminator and analysed using Fiji software. The fluorescence intensity of the protein marker (XL-Ladder; Apro Science) was used to estimate protein concentrations.

Stopped-flow analysis

The 20 μL reconstituted samples diluted up to 1750 μL with buffer containing 20 mM Tris (pH 7.5) and 150 mM NaCl were used for measurements carried out on a SX20 stopped-flow spectrometer (Applied Photophysics). Measurements under hyperosmotic conditions were obtained by mixing diluted samples and the same volume of buffer containing 20 mM Tris (pH 7.5) and 250 mM NaCl, resulting in an outer buffer containing 20 mM Tris (pH 7.5) and 200 mM NaCl, leading to a 50-mM inwardly directed NaCl gradient. Another measurement was obtained by mixing diluted samples and the same volume of buffer containing 20 mM Tris (pH 7.5), 150 mM NaCl, and 200 mM glycerol, resulting in an outer buffer containing 20 mM Tris (pH 7.5), 150 mM NaCl, and 100 mM glycerol, leading to a 100-mM inwardly directed glycerol gradient. The kinetics of vesicle shrinkage or swelling were measured by recording the change in intensity of 90° scattered light at a wavelength of 450 nm at 20 °C over time.

The change in signal intensity under NaCl hyperosmotic conditions was calculated by subtracting the averaged intensity from 0.10 to 0.14 s and normalised by the maximum intensity from 0.10 to 1.5 s. The time course of the intensity change under NaCl hyperosmotic conditions was plotted from 0.10 to 1.5 s and fitted to a single-exponential curve to determine the rate constant k using IGOR Pro (WaveMetrics). The sizes of proteoliposomes and empty liposomes were verified by a particle analyser (ELSZ, Otsuka Electronics) at 20 °C. The water permeability coefficient (Pf) under hyperosmotic conditions was calculated according to the following equation as previously described29: Pf = (k × V0)/(ΔC × Vw × A), where V0 is the volume of the liposome sample, A is the surface area of the liposome sample, ΔC is the osmotic gradient, and Vw is the molar volume of water.

The change in signal intensity under glycerol gradient conditions was calculated by subtracting the minimum intensity from 0.10 to 10 s and normalised by the maximum intensity from 0.10 to 10 s. The time course of the intensity change under glycerol gradient conditions was plotted from 0.10 to 10 s. The glycerol permeability (Pgly) was calculated from the following equation as previously described29,65: Pgly = 1/((S/V) × τ), where S/V is the surface-to-volume ratio and τ is the exponential time constant fitted to the swelling phase of light scattering time course.

Statistics and reproducibility

All data are presented as means along with the standard error of the mean (SEM). The phospholipid concentrations in the purified proteins were measured in triplicate from the same purified proteins. Measurements using proteoliposomes and empty liposomes were performed in triplicate or tetraplicate from independently reconstituted liposome samples. The statistical analyses were performed using Student’s two-tailed unpaired t test in Excel (Microsoft) and multiple comparison Tukey test in IGOR Pro. A value of P < 0.05 was considered significant. The purification of each AQP protein by the detergent was examined at least two times, and it was reliably reproduced.

MD simulations

For MD simulations of AQP3, the initial structures were modelled using the cryo-EM structures exhibiting the Tyr212-out conformation (AQP3DDM) and the Tyr212-in conformation (AQP3POPC). In the modelling, we used the residues from Tyr19 to Glu270 of the AQP3 structures. The AQP3 structures were modelled as a tetramer generating symmetry copies with ChimeraX63. The protonation states were determined based on AQP3POPC structure, using PROPKA66 available in Maestro (Schrödinger, LLC, USA) and assuming an aqueous environment at pH 7.5. Both the Tyr212-out and Tyr212-in conformations of AQP3 were buried in DDM micelles and in a POPC lipid bilayer using Micelle Builder and Bilayer Builder, respectively, of the CHARMM-GUI web server67,68,69,70,71,72,73,74,75. The number of DDM molecules was adjusted to 280 per AQP3 tetramer to prevent gaps and holes in the micelles. The position of the membrane for AQP3 was determined from that for AQP7 (PDB: 6QZI)25 available in OPM database76. Thus, four systems for MD simulations were generated and referred to as ITyr212-out/DDM, ITyr212-out/POPC, ITyr212-in/DDM, and ITyr212-in/POPC (Supplementary Fig. 8a–d). The generated solution systems were neutralised adding Na+ and Cl ions, where the concentration of ions was set at 0.15 mol/L. The number of atoms in the generated systems was 260,379 for ITyr212-out/DDM, 120,262 for ITyr212-out/POPC, 259,012 for ITyr212-in/DDM, and 121,615 for ITyr212-in/POPC.

The MD simulations for the four systems were conducted using GROMACS 202277. The employed force field for protein was CHARMM36m78, and that for DDM and POPC was CHARMM3679,80,81,82,83. The setup of the force fields is the default one of Micelle Builder and Bilayer Builder of CHARMM-GUI67,68,69,70,71,72,73,74,75. In particular, the force fields of protein and POPC are often used for MD simulations in membrane–protein systems. As the water model, TIP3P84 was employed. The bond lengths involving hydrogens were constrained using LINCS85. The electrostatic interactions were calculated using the Particle Mesh Ewald method86. The van der Waals interactions were smoothly truncated over 1.0–1.2 nm. Before the MD simulation production runs, the energy minimisation by the steepest descent method and six equilibration runs, referred to as EQ1–EQ6 hereafter, were conducted sequentially according to the default setup of Micelle Builder and Bilayer Builder of CHARMM-GUI67,68,69,70,71,72,73,74,75. EQ1 and EQ2 were conducted in the NVT ensemble at 300 K, and EQ3–EQ6 were conducted in the NPT ensemble at 300 K and 1 atm. The thermostat in EQ1–EQ6 was the weak-coupling scheme of Berendsen87, and the barostat in EQ3–EQ6 was the isotropic Berendsen algorism87 for ITyr212-out/DDM and ITyr212-in/DDM and semi-isotropic Berendsen87 for ITyr212-out/POPC and ITyr212-in/POPC. The timestep and simulation length were set to 0.001 and 125 ps, respectively, in EQ1–EQ3 and to 0.002 and 500 ps, respectively, in EQ4–EQ6. In the energy minimisation and EQ1–EQ6, structural restraints with reference to the initial structures were applied, the force constants of which were reduced in a stepwise manner from EQ1 to EQ6. The ranges of force constants from EQ1 to EQ6 were 4000–50 kJ/mol/nm2 for backbone atoms of protein, 2000–0 kJ/mol/nm2 for sidechain non-hydrogen atoms, 1000–0 kJ/mol/nm2 for positional restraints of DDM and POPC molecules, and 1000–0 kJ/mol/rad2 for dihedral restraints of DDM and POPC molecules. The force constants in the energy minimisation were the same as those in EQ1. Positional restraints were imposed on carbon atoms connecting the tail and head group of DDM and on the z-coordinate of the phosphorus atoms of POPC. Dihedral restraints were imposed on 12 bonds involving carbon and oxygen atoms forming the two sugar rings in the head group of DDM and on the double bond between carbon atoms in the oleic acids of POPC.

After the equilibration runs, a 500 ns production run (timestep = 0.002 ps) was conducted for each of the four systems. The NPT ensemble at 300 K and 1 atm was employed in all production runs. The thermostat was the Nosé–Hoover method88,89, and the barostat was the isotropic Parrinello–Rahman method90,91 for ITyr212-out/DDM and ITyr212-in/DDM and semi-isotropic Parrinello–Rahman90,91 for ITyr212-out/POPC and ITyr212-in/POPC. Snapshots taken every 100 ps were used for analysis.

In a similar manner, to conduct MD simulations of GlpF, the initial structures were modelled using the cryo-EM structures exhibiting pore capped by loop D (GlpFPOPC) and pore opening (GlpF-ΔP177POPC). In the modelling, we used the residues from Ser5 to Cys261 of the GlpFPOPC structure and the residues from Ser5 to Gly256 of the GlpF-ΔP177POPC structure. The GlpF and GlpF-ΔP177 structures were modelled as a tetramer using ChimeraX63. The protonation states were determined based on the GlpFPOPC structure, using PROPKA66 of Maestro. Both the modelled open and capped pore structures of GlpF were buried in a POPC lipid bilayer using Bilayer Builder of CAHRMM-GUI67,68,69,70,71,72,73,74,75. The position of the membrane for GlpF was determined from that for GlpF (PDB: 1LDF)21 available in OPM database76. The two systems for MD simulations are referred to as IGlpF/POPC and IGlpF-ΔP177/POPC (Supplementary Fig. 13a, b). The generated systems were neutralised by adding Na+ and Cl ions, where the concentration of ions was set at 0.15 mol/L. The number of atoms in the generated systems is 128,094 for IGlpF/POPC and 122,487 for IGlpF-ΔP177/POPC. According to the same setup used for ITyr212-out/POPC and ITyr212-in/POPC of AQP3, the energy minimisation, equilibration runs, and production runs were conducted.

To investigate distributions of water molecules within the AQP3 pores, the pore regions were defined as cylinders with a radius of 8 Å, the central axes of which pass through the intermediate points between Cα atoms of Asn83 and Val195. Aligning AQP3 structures of all MD snapshots with the initial structures, the central axes of the pores were set in parallel to the z-axes. Dividing the defined pore region into small cylinders every 0.1 Å along the central axis, the number of oxygen atoms of water molecules within each small cylinder was counted for all snapshots, which provided the simulated distributions of water molecules within the pores for ITyr212-out/DDM, ITyr212-out/POPC, ITyr212-in/DDM, and ITyr212-in/POPC. In the same manner, the water distribution for GlpF was calculated. The central axis of the defined pore region for GlpF passes through the intermediate point between Cα atoms of Asn68 and Val52. The water molecules that passed through the pores were also counted for all MD trajectories. The number of oxygen atoms in the water molecules that passed was counted, which was the sum of both the passage in the extracellular direction and that in the intracellular direction. The water orientation in the pores was also calculated. The order parameter of water orientation was defined as the averaged cosθ, where θ of a water molecule is the angle between its dipole moment and the pore axis. The MD trajectories were analysed using Ambertools2292. The structural snapshots and movies of MD simulations were visualised by Pymol (Schrödinger, LLC, USA) and VMD93, respectively.

Details of the system setup for AQP3 and GlpF MD simulations are provided in Supplementary Tables 3 and 4.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.