Introduction

A dynamic library1,2,3,4,5,6,7,8,9 is reminiscent of an evolutionary process in that a certain species is selected under environmental change. Molecular self-assembly systems in which the isomers of assemblies are generated10,11,12,13,14,15 are suitable for creating dynamic libraries. Selection in such dynamic libraries16,17,18,19 and structural conversion20,21,22,23,24,25,26,27,28,29 are realized by selective stabilization of one of the assemblies by an external stimulus, such as a template guest molecule and solvent, changing the relative free-energy differences among the isomers. A building block with low symmetry30,31,32,33,34,35,36,37,38 enables the generation of many configurational isomers whose components are spatially arranged in different ways (Fig. 1). When a mixture of configurational isomers is generated in equilibrium, their free-energy differences are small because of their structural similarities. In such cases, an elaborate guest molecular design allows a single isomer to be differentiated from other isomers. This guest-inducing approach has been applied to the self-assembly of meal–organic cages, in which low-symmetric organic ligands are arranged in several defined ways under the requirement of complementarity between the coordination vectors of the metal ion and organic ligand39,40,41,42,43,44,45,46,47. Induced-fit selection of a single configurational isomer from a dynamic library has been achieved, but the amplification of two or more types of species separately by different guest molecules and/or conditions is challenging (Fig. 1). Such multiadaptive behavior endows simple molecular systems with highly responsive properties for amplifying different outputs, enabling signal transduction in response to different inputs.

Fig. 1: Schematic representation of multivariant optimization in a dynamic library composed of configurational isomers.
Fig. 1: Schematic representation of multivariant optimization in a dynamic library composed of configurational isomers.
Full size image

The desymmetrized building blocks (blue) allow the generation of five types of configurational isomers of the assembly (square-shaped tetramers in this case). Because their free-energy differences are small, a mixture of the isomers (dynamic library) is generated. When a certain guest molecule (brown rod) two or three of which are bound in the assemblies, one type of the isomer is preferentially selected (left). In this selection process, (1) the induced configurational isomer (head-to-tail in the left), (2) the number of guest molecules (three in the left), and (3) their spatial arrangement in the cluster (stack in the left) are determined by multivariant optimization. Such an optimization process enables the selection of another configurational isomer (head-to-head) binding two slip-stacked guest molecules when a different guest molecule (green rectangle) is added (right).

In a general dynamic library, a certain structure is induced by a template guest molecule. In other words, the shape of the guest molecule determines the assembled structure. When several guest molecules are accommodated in the cavity of an assembly, the spatial arrangement of the guest molecules in the cluster also contributes to the selection process. In this case, the number of guest molecules and their spatial arrangement are affected by the assembled structure (Fig. 1). Therefore, implementing such multivariant optimization in molecular systems is a significant step toward increasing complexity and would make it possible to precisely control the selection of several types of configurational isomers.

Molecular self-assembly of C2v-symmetric gear-shaped amphiphile (GSA), 12+ (Fig. 2a) provides a single isomer of a cube-shaped hexameric assembly (nanocube) with an S6 symmetry (Fig. 2b)48,49, where the configuration of the six GSAs in each face of the cube is unambiguously determined by previous single-crystal X-ray analysis50 (Fig. 2c). Desymmetrization of the GSA from C2V into CS symmetry (12+22+) generates two options for the arrangement of each GSA in the face of the cube (Fig. 2d), resulting in a dynamic library consisting of 16 configurational isomers of nanocubes (Fig. 2e and f).

Fig. 2: Self-assembly of cube-shaped assembly (nanocube) from gear-shaped amphiphiles (GSAs).
Fig. 2: Self-assembly of cube-shaped assembly (nanocube) from gear-shaped amphiphiles (GSAs).
Full size image

a Chemical structures of C2v- and CS-symmetric GSAs, 12+ and 22+. b Schematic representation of the [16]12+ nanocube in analogy to the Earth. c The geometry net of the [16]12+ nanocube. d Two possible arrangements of a GSA in the face of the cube, E- and P-arrangements. In the E-arrangement, Me (green) is located near the equator (left), whereas Me is positioned around the pole in the P-arrangement (right). E and P stand for Equator and Pole, respectively e Two types of S6-symmetric [26]12+ nanocubes, in which all six GSAs are located in E- or P-arrangement. f The geometry nets of possible configurational isomers except E- and P-[26]12+ shown in panel e. g Energy-minimized structure of [16]12+, in which one of six pairs of cation-π interactions working between blue and yellow aromatic rings is highlighted by a cyan rectangle. The colors of the segments of GSAs correspond to those of chemical structure of GSA 12+ shown in Fig. 2a. h A geometry net of the nanocube assembled from C2v-symmetric 12+ to show the cation-π interactions in the triply stacked pyridinium-phenyl-pyridinium rings (broken lines).

Herein, we report the selection of two types of configurational isomers (E- and P-types in Fig. 2e) from a dynamic library generated via the self-assembly of CS-symmetric GSA 22+ using different guest molecules. The two types of nanocubes differ in the size and shape of their inner cavities and were selected by different guest molecules. Because the two types of the nanocubes were induced by a cluster of guest molecules (dimer or trimer), the best isomer was found in the energy landscape by multivariant optimization of the arrangement of the GSAs in the nanocube and the arrangement of the guest molecules in the cluster. Furthermore, certain guest molecules induced both E- and P-type nanocubes, and their distributions were controlled by the concentration of the guest molecules and the temperature, leading to a different selection system in which the two types of configurational isomers were selected using one kind of guest molecules under different stimuli.

Results

Design of gear-shaped amphiphile

C2v-symmetric GSA 12+ assembled into a single isomer of the nanocube, [16]12+, with S6 symmetry in water (Fig. 2b and c)49. The driving force of this self-assembly is the hydrophobic effect, cation-π interactions between pyridinium and phenyl groups (Fig. 2g and h), and van der Waals interactions among gear-shaped hydrophobic surfaces50,51,52,53. In analogy to the Earth, there are the poles and the equator in the nanocube. The poles are the two vertices through which the S6 axis penetrates, and the equator is the six sides that do not contain the pole (Fig. 2b). In order to maintain six sets of cation-π interactions (pyridinium-phenyl-pyridinium) in the nanocube (yellow and blue parts in Fig. 2g and broken lines in Fig. 2h), the arrangement of each 12+ in the face of the cube is strictly determined; Me in 12+ (Fig. 2a) is placed near the equator (red lines in Fig. 2c), and one of two Me groups is placed around the poles (Fig. 2c).

When GSA is desymmetrized into CS by replacing one of the p-tolyl groups in 12+ with a methyl group to form a new GSA, 22+, there are two possibilities of the arrangement of 22+ in the face of the cube: one with Me near the equator (E-arrangement) or the other with Me near the pole (P-arrangement) (Fig. 2d). Consequently, a new GSA, 22+, generates 16 configurational isomers with E- or P-arrangement of the six GSAs in the nanocube (Fig. 2e and f). Among them, the nanocubes six GSAs of which are all E- or P-arrangement (E-type and P-type nanocubes) possess an S6 symmetry, and all the six GSAs in them are chemically equivalent (Fig. 2e).

Synthesis of GSA and its self-assembly

GSA 2·Cl2 was synthesized according to Jux’s approach54 in 11 steps in total 3.7% yield (Supplementary Figs. 1-10). The 1H NMR spectrum of 2·Cl2 in CD3OD showed the monomer state (Fig. 3 and Supplementary Fig. 11a). When 2·Cl2 was dissolved in D2O, a complicated, broad 1H NMR spectrum was obtained (Fig. 3 and Supplementary Fig. 11b). Some of aromatic and methyl protons were upfield shifted, indicating the aggregation of the GSA by the hydrophobic effect. 1H DOSY spectrum of the D2O solution showed that all the signals have the same diffusion coefficient of 1.1 × 10–10 m2 s–1 (Supplementary Fig. 12), which suggests the formation of 2-nm-sized assemblies, which is consistent of the size of the [16]12+ nanocube. Thus, GSA 22+ generates a dynamic library composed of the configurational isomers of the [26]12+ nanocube. Because of the complicated 1H NMR spectrum and low sensitivity of DOSY measurements for molecules of similar sizes, the formation of oligomers other than hexamers in the library is not excluded.

Fig. 3: Selection of two types of configurational isomers of the nanocubes by guest molecules.
Fig. 3: Selection of two types of configurational isomers of the nanocubes by guest molecules.
Full size image

a 1H NMR spectra (500 MHz, 298 K) of monomer GSA (22+) in CD3OD, a mixture of configurational isomers of the [26]12+ nanocubes, (G1)2E-[26]12+, and (G2)2P-[26]12+ in D2O. The assignment of the 1H NMR signals of (G1)2E-[26]12+, and (G2)2P-[26]12+ are shown in Supplementary Figs. 10c and 20, respectively. b List of guest molecules that induced E-[26]12+, P-[26]12+, and both types and their volumes. The volumes are indicated for a single guest molecule. The numbers of molecules encapsulated in [26]12+, n, are shown when determined.

Selection of two types of configurational isomers

When 1,3,5-tribromobenzene (G1) was added to a solution of the library, the 1H NMR spectrum was dramatically changed to show a very clear 1H NMR pattern (Fig. 3 and Supplementary Fig. 11c). 1H DOSY spectroscopy indicated that all signals had the same diffusion coefficient of 1.2 × 10–10 m2 s–1 (Supplementary Fig. 13), which corresponds to an entity with 1.6 diameters. A significant feature of the 1H NMR spectrum is the appearance of only one set of Me, Me, and Me signals, which indicates the selection of one type of S6-symmetric nanocube (E- or P- type in Fig. 2f). To determine which type of nanocube was selected, 1H-1H NOESY spectrum was measured (Fig. 4a and Supplementary Fig. 14). The NOE between the Me and Me signals was observed. Because Me is fixed near the equator, this NOE correlation indicates that Me is placed near the equator, which corresponds to E-type (E-[26]12+). The number of G1 encapsulated in E-[26]12+ was determined to be two by integrating the 1H signal for bound G1. The assignment of the 1H NMR signals of (G1)2E-[26]12+ is shown in Supplementary Fig. 11c, which was determined by 1H-1H COSY and 1H-1H NOESY (Supplementary Figs. 14 and 15) spectroscopy. Fourier transform ion cyclotron resonance (FT-ICR) mass spectrometry (negative ion mode) of an aqueous solution of (G1)2E-[26]Cl12 with CH3SO3H as an additive detected doubly and triply charged anionic species of [26(Cl)x(CH3SO3)y]12–xy (x = 6–11 and y = 3–9) (Supplementary Fig. 16). Unfortunately, the signals for E-[26]12+ with the bound guest molecules (G1) could not be detected. Other guest molecules that induced E-[26]12+ are listed in Fig. 3b and their 1H NMR spectra are shown in Supplementary Figs. 17 and 18.

Fig. 4: 1H−1H NOE spectra of E- and P-type nanocubes (500 MHz, 298 K, upfield region).
Fig. 4: 1H−1H NOE spectra of E- and P-type nanocubes (500 MHz, 298 K, upfield region).
Full size image

a E-[26]12+ binding two molecules of G1. b P-[26]12+, which binds two molecules of G2.

In the screening of guest molecules, we encountered another type of a simple 1H NMR pattern when 9,10-anthraquinone (G2) was added to the library (Fig. 3 and Supplementary Fig. 19). 1H DOSY spectroscopy indicated that all 1H NMR signals showed the same diffusion coefficient of 1.1 × 10–10 m2 s–1 (Supplementary Fig. 20), which corresponds to an entity with 1.7 diameters. One set of Me, Me, and Me 1H NMR signals were observed, but their chemical shifts are different from those of E-[26]12+ induced by G1. Then, 1H-1H NOESY spectrum was measured to determine which type of S6-symmentric [26]12+ was assembled. A D2O solution of 22+ in the presence of G2 showed NOE between Me and Me signals (Fig. 4b and Supplementary Fig. 21), indicating that Me is placed near the equator, thus Me is placed near the pole, which corresponds to P-type (P-[26]12+). The number of G2 encapsulated in P-[26]12+ was estimated to be two by integrating the 1H NMR signal of the bound G2 (Fig. 3a). Some of the 1H NMR signals of (G2)2P- [26]12+ were assigned by 1H-1H COSY and 1H-1H NOESY spectroscopy (Supplementary Figs. 21 and 22). FT-ICR mass spectrometry of an aqueous solution of (G2)2P-[26]Cl12 was also conducted under the same conditions as E-[26]Cl12. Although the signals of the doubly and triply charged anionic species of the free nanocubes were detected, the signals of the host-guest complex could not be found (Supplementary Fig. 23). Other guest molecules that induced P-[26]12+ are listed in Fig. 3b and their 1H NMR spectra are shown in Supplementary Fig. 24. Consequently, two types of configurational isomers were separately selected from an artificial dynamic library using different guest molecules.

Two types of S6-symmetric nanocubes (E-[26]12+ and P-[26]12+) were selected by different guest molecules (G1 and G2). The difference between E-[26]12+ and P-[26]12+ stems from the position of Me near the pole or equator (Fig. 2f), which would affect the size and shape of the nanocubes and their cavities. In the original [16]12+ nanocube, 12 p-tolyl methyl groups (six Me and six Me) are placed near the equator (Fig. 2c). The same is true in P-[26]12+, whereas six Me and six Me are near the equator in E-[26]12+ (Fig. 2f). Thus, geometric interactions of the GSAs near the equator in P-[26]12+ would be similar to those of [16]12+. Attempts of single crystallization of the two types of nanocubes under various conditions failed, so the difference between E-[26]12+ and P-[26]12+ was discussed based on their optimized structures, comparing with [16]12+ with a spherical cavity (Fig. 5c). E-[26]12+ is a closed structure where the holes around the poles are very small (Fig. 5a), whereas P-[26]12+ has large openings around the poles (Fig. 5b).

Fig. 5: Structures of nanocubes.
Fig. 5: Structures of nanocubes.
Full size image

ac Optimized structures of (a) E-[26]12+, (b) P-[26]12+, and (c) [16]12+. Views from the pole (above) and from the equator (below) are shown. The colors of the segments of GSAs correspond to those of chemical structure of GSA 22+ shown in Fig. 2a. d Definition of the equator length (purple broken line). e Schematic representation of cube-shaped [16]12+ and P-[16]12+ and a distorted cube of E-[26]12+.

The cavity size of E-[26]12+, 513.1 Å3, is smaller than that of [16]12+, 562.0 Å3, whereas that of P-[26]12+, 599.2 Å3, is larger than that of [16]12+ (Table 1). The spatial arrangement of the six GSAs in the nanocubes can be assessed by the equator length which is defined as the sum of six sides of the hexagonal ring created by six Me groups along the equator (Fig. 5d). The equator lengths of E-[26]12+, 60.8 Å, is much shorter than that of [16]12+, 64.6 Å (Table 1), which is consistent with the smaller cavity volume of E-[26]12+ than that of [16]12+. Although the equator length of P-[26]12+ is shorter than that of [16]12+, the cavity size of P-[26]12+ is larger than that of [16]12+, which is probably because of large openings in P-[26]12+.

Table 1 List of inner volume and equator length of nanocubes

Next, the distances between the two poles (dp) in E-[26]12+ and [16]12+ were compared (Table 1). dP is defined as the distance between the two carbon atoms of the Me groups around the poles. dp in E-[26]12+, 20.3 Å, is similar to that in [16]12+, 20.6 Å. Therefore, E-[26]12+ is distorted from the cube so that the equator length becomes shorter maintaining dp (Fig. 5e), which results in an ellipsoidal cavity. The distortion of P-[26]12+ from [16]12+ is smaller than that of E-[26]12+ (Supplementary Figs. 25 and 26). A large distortion of E-[26]12+ from the cube is mainly due to the lack of six p-tolyl groups in the equator. These results indicate that the partial structure located near the equator strongly affects the nanocube structure.

It should be noted that because the GSAs in the nanocubes only mesh with each other without directional bonds, the nanocubes has high adaptability, changing the size and shape of their inner cavities in response to the guest molecules55. Thus, the cavity sizes of the nanocubes are variable in certain ranges. We investigated the relationship between the cavity volumes and total guest volumes based on the energy-minimized structures of the E-and P-nanocubes obtained by molecular mechanics calculations (Supplementary Tables 1 and 2). The cavity volume of E-[26]12+ ranged from 471 to 536 Å3, whereas that of P-[26]12+ ranged from 655 to 731 Å3, exhibiting a rough linear correlation with the total guest volume (Fig. 6a). This result indicates that [26]12+ is highly adaptive, consistent with our previous study on [16]12+. The average occupancies of the guest molecules in the cavities of E- and P-[26]12+ are 52.9 ± 8.3% and 57.9 ± 1.8%, respectively, which are consistent with Rebek’s 55% rule56.

Fig. 6: Relationship between the cavity volume and shape of the E- and P-nanocube and guest molecules.
Fig. 6: Relationship between the cavity volume and shape of the E- and P-nanocube and guest molecules.
Full size image

a Plot of the cavity volume of the E- and P-nanocubes versus the total guest volume of the guest molecules (Supplementary Tables 1 and 2). The cavity volumes were estimated from the energy-minimized structures of (G)2E-[26]12+ and (G)3P-[26]12+ using molecular mechanics calculations. There is a clear boundary between the E- and P-nanocubes in terms of the cavity volume and total guest volume. b The preference of the E- and P-nanocubes for disubstituted benzene derivatives. The ortho- and meta-isomers prefer the E-nanocube, whereas the para-isomer prefers the P-isomer. Functional groups also affect preferences. 1H NMR spectra for these results are shown in Supplementary Figs. 3035.

Interconversion between two types of configurational isomers

We found some of guest molecules induced both E-[26]12+ and P-[26]12+ (Figs. 3b and 7). As an example, when 4 equiv. of G3 was added to the library, E-[26]12+ was mainly induced (Fig. 6b and Supplementary Fig. 27), whereas excess G3 induced P-[26]12+. Considering that the cavity volume of P-[26]12+ is larger than that of E-[26]12+, the numbers of guest molecules encapsulated in E-[26]12+ and P-[26]12+ would be different. In this equilibrium (Fig. 7a and Eq. 1), high concentrations of G3 prefer P-[26]12+.

$$({{{\bf{G}}}}{{{\bf{3}}}})_{2} \, {{\subset }} \, E-{[{{{{\boldsymbol{2}}}}}_{6}]}^{12+}+{{{\bf{G}}}}{{{\bf{3}}}} \, \rightleftarrows \, ({{{\bf{G}}}}{{{\bf{3}}}})_{3} \, {{\subset }} \, P-{[{{{{\boldsymbol{2}}}}}_{6}]}^{12+}$$
(1)
Fig. 7: Interconversion between E- and P-types nanocubes by change in the concentration of guest molecules and the temperature.
Fig. 7: Interconversion between E- and P-types nanocubes by change in the concentration of guest molecules and the temperature.
Full size image

a The equilibrium between E- and P-[26]12+ in the presence of G3. b 1H NMR spectra (500 MHz, D2O, 298 K) of (G3)2E-[26]12+ and (G3)3P-[26]12+, which were induced by the guest concentration change of G3. At low concentration [G3]/[26] = 4, E-type was induced, whereas excess G3 ([G3]/[26] = 48) induced P-type, because the number of G3 encapsulated in P-type nanocube is more than that in E-type. c The interconversion induced by the temperature change. E-type nanocube was preferred at high temperatures. Thermodynamic parameters of the equation in (a) determined by van’t Hoff analysis are shown.

As the total numbers of species on the left and right sides of this equilibrium differ, temperature should affect the equilibrium. When a mixture of E-[26]12+ and P-[26]12+ in about 1:1 ratio prepared at 298 K was heated, the equilibrium shifted toward E-[26]12+ (Fig. 7c and Supplementary Fig. 28). The thermodynamic parameters of the equation were determined by van’t Hoff analysis (ΔH = –36.8 kJ mol–1 and ΔS = –81.5 J mol–1 K–1) (Supplementary Fig. 29), indicating that P-[26]12+ is entropically disfavored compared with E-[26]12+, which is consistent with the idea that the numbers of G3 encapsulated in the two types of nanocubes are different.

The numbers of G3 in the two types of [26]12+ could not be determined by 1H NMR spectroscopy because the 1H NMR signals of bound G3 were not observed (Fig. 7b). This might be due to fast exchange of the guests between in and out of the nanocubes on the NMR time scale. Based on the volume of G3, 152 Å3, two (304 Å3) and three (456 Å3) molecules would be encapsulated in E-[26]12+ and P-[26]12+, respectively. Other molecules that induced both E- and P-type isomers similar to G3 are listed in Fig. 3b (Supplementary Fig. 30). Under the assumption that two and three guest molecules are encapsulated in E- and P-[26]12+, respectively, the cavity volumes of (G)2E-[26]12+and (G)3P-[26]12+ were calculated from the energy-minimized structures. The cavity volumes linearly correlated with the total guest volumes (Fig. 6a), and the occupancy of the cavity by the guest molecules in E- and P-[26]12+ ranged from 53.8 ± 2.6% and 58.1 ± 4.0%, respectively. These results are consistent with Rebek’s 55% rule and support the assumption that two and three guest molecules are bound in E- and P-[26]12+.

Focusing on the guest molecules two or three of which are encapsulated in E-[26]12+ and P-[26]12+ (Fig. 3b), guest molecules whose volumes ranged from 84 to 161 Å3 tend to induce E-[26]12+, whereas large guest molecules with volumes of 187–210 Å3 prefer P-[26]12+, and guest molecules with intermediate volumes (123–160 Å3) induced both E- and P-nanocubes. This result is consistent with the difference in cavity size between them. However, the boundary between the guest molecules that induce E- or P-nanocube and those that induce both E- and P-nanocubes is unclear. This is mainly because the shape of the guest cluster, which is induced by the nanocubes, also plays a key role in the selection of the configurational isomers.

To confirm this idea, ortho-, meta-, and para-isomers of disubstituted benzene derivatives were investigated for the induction of the E- and P-isomers (Fig. 6b). As mentioned above, G3 (methyl para-toluate) induces both E- and P-isomers (Fig. 7). The volumes of ortho- and meta- toluates are very similar to that of para-toluate, but ortho- and meta- toluates induced only the E-nanocube (Supplementary Figs. 3133). This result indicates that not only the total volume of the guest cluster but also its shape affects the selection process and that rod-shaped guest molecules (para-isomers) can induce both E- and P-nanocubes (Fig. 3b). It was found that toluic acid isomers exhibit different preferences (Fig. 6b and Supplementary Figs. 3436). Like methyl ortho-toluate, ortho-toluic acid induced only the E-nanocube. meta-Tolic acid mainly induced the E-nanocube, but its induction ability was weaker than that of ortho-toluic acid. para-Toluic acid induced mainly the P-nanocube and did not show switching of the induction of the E- and P-nanocubes depending on the guest concentration. These results indicate that functional groups (carboxylic acid and ester) also affect the selection of the two types of nanocubes and that such a complicated multivariant optimization process realizes the selection of two types of configurational isomers.

Discussion

In conclusion, a dynamic combinatorial library of configurational isomers for molecular self-assembly was created by the rational design of a low-symmetric gear-shaped amphiphile, 22+. Without an external stimulus, a complicated mixture of 16 configurational isomers was generated. Upon adding G1 to the library, an E-type nanocube (E-[26]12+) was selected exclusively. The cavity of E-[26]12+ is smaller than that of the original [16]12+, and its shape is ellipsoid because of the distortion of E-[26]12+. In contrast, G2 selectively induced P-[26]12+, whose structure is similar to [16]12. The structural difference between E-[26]12+ and P-[26]12+ originates from the difference in meshing among the six GSAs near the equator, which resulted in differences in the size and shape of their cavities. The selection process in this system is complicated because the energy minimum is determined by the mutual optimization of the spatial arrangements of both the GSAs in the nanocube and the guest molecules in the guest cluster. This differs from induced-fit selection systems where a host structure is induced by a guest molecule with a defined shape. Such multivariate optimization processes are key to creating complicated phenomena in living systems.

Methods

General information

1H, 13C, and 2D NMR spectra were recorded using a Bruker AV-500 (500 MHz) spectrometer. Chemical data are reported in parts per million (ppm, δ scale) downfield from tetramethylsilane (δ 0.00) and are referenced to proton resonance of a residual peak, CD3OD (δ 3.31 ppm) and D2O (δ 4.79 ppm). The data are presented as follows: chemical shifts, multiplicity (s = singlet, d = doublet, t = triplet, q = quartet and/or multiple resonances), coupling constants in Hertz (Hz), and integration. High-resolution mass spectra (HRMS) were obtained using a Water Xevo G2-S Tof mass spectrometer. CSI FT-ICR Mass spectra were obtained using Bruker scimaX. Column chromatography was carried out with Merck Silica gel 60 (0.063–0.200 nm) otherwise noted. Gel permeation liquid chromatography (GPC) was performed using a Japan Analytical Industry LC-9204 with JAIGEL 2HR + 2.5HR columns using chloroform as a solvent. Molecular mechanics calculations were performed using Materials Studio software (BIOVIA) with COMASS III force field.

Materials

Unless otherwise noted, all solvents and reagents were obtained from commercial suppliers (TCI Co., Ltd., WAKO Pure Chemical Industries Ltd., KANTO Chemical Co., Inc. and Sigma-Aldrich Co.) and were used as received. IRA 400 J Cl© was purchased from Organo Co. and washed with hexane, CH2Cl2, CH3CN, iso-propanol, MeOH, water, and brine twice before use.

Synthesis of gear-shaped amphiphile 2·Cl2

GSA 2·Cl2 was synthesized according to Supplementary Fig. 1.

Synthesis of compound 3

A solution of Br2 (2.8 mL, 55 mmol) in CH3COOH (30 mL) was added dropwise into a suspension of 2-methyl-4-nitroaniline (7.6 g, 50 mmol) in CH3COOH (100 mL). After the reaction mixture was stirred at room temperature of 12 h, then the mixture was poured into the ice water to give a yellow precipitate. The precipitate was collected by filtration, washed with H2O, and dried in a desiccator. Compound 3 was obtained (10.6 g, 45.9 mmol, 92%) as a yellow solid. 1H NMR (500 MHz, CDCl3, 298 K): δ 8.28 (d, J = 2.5 Hz, 1H), 7.96 (d, J = 2.0 Hz, 1H), 4.77 (s, 2H), 2.28 (s, 3H). The spectral data were in good agreement with the previous literature57.

Synthesis of compound 4

A suspension of compound 3 (5.1 g, 22 mmol), 4-methylphenylboronic acid (3.3 g, 24 mmol), Pd(PPh3)4 (380 mg, 0.33 mmol), Na2CO3 (4.7 g, 44 mmol) in mix solvent of toluene (100 mL), ethanol (20 mL), and water (30 mL) was degassed and heated at 90 °C for 12 h. After the reaction, the mixture was cooled to room temperature and quenched with water. The mixture was extracted with chloroform three times. The organic phase was washed with brine and dried over anhydrous MgSO4. After filtered off to remove MgSO4 and insoluble materials, the solvent was removed in vacuo. The crude product was purified by silica gel column chromatography (hexane/EtOAc = 4:1) to give product 4 (5.2 g, 21.5 mmol, 98 %) as a yellow solid. 1H NMR (500 MHz, CDCl3, 298 K): δ 8.00 (d, J = 2.5 Hz, 1H), 7.94 (d, J = 2.5 Hz, 1H), 7.30 (s, 4H), 4.42 (s, 2H), 2.42 (s, 3H), 2.26 (s, 3H). 13C NMR (125 MHz, CDCl3, 298 K): δ 148.5, 138.7, 138.3, 134.5, 130.1, 129.0, 126.3, 125.6, 124.9, 121.4, 21.4, 17.9. HR-ESI-TOF-MS (m/z): [M + H]+ calcd. for C14H15N2O2, 243.1129; found, 243.1144.

Synthesis of compound 5

Isoamyl nitrite (4.5 mL, 33 mmol) was added dropwise to a suspension of compound 4 (5.0 g, 20.6 mmol) and CuBr2 (6 g, 27 mmol) in CH3CN (50 mL). The reaction mixture was stirred at 70 °C for 12 h. After the mixture was cooled to room temperature, quenched with 10% ammonia solution, and extracted with EtOAc, the organic phase was washed with brine and dried over anhydrous MgSO4. After filtration, the solvent was removed in vacuo to give product 5 (5.7 g 18.6 mmol 90%) as a pale yellow solid. 1H NMR (500 MHz, CDCl3, 298 K): δ 8.08 (d, J = 2.5 Hz, 1H), 7.99 (d, J = 2.5 Hz, 1H), 7.27 (s, 4H), 2.59 (s, 3H), 2.43 (s, 3H). 13C NMR (125 MHz, CDCl3, 298 K): δ 146.5, 144.9, 140.9, 138.5, 137.4, 133.1, 129.2, 129.1, 123.6, 123.2, 24.7, 21.5. HR-ESI-TOF-MS (m/z): [M]+ calcd. for C14H1281BrNO2, 307.0031; found, 307.0008.

Synthesis of compound 6

A suspension of compound 5 (5.5 g, 18 mmol), 4-biphenylboronic acid (4.3 g, 22 mmol), Pd(PPh3)4 (312 mg, 0.27 mmol), Na2CO3 (4.3 g, 40 mmol) in mix solvent of toluene (100 mL), ethanol (20 mL), and water (30 mL) was degassed and heated at 90 °C for 12 h. After the mixture was cooled to room temperature and quenched with water, the reaction mixture was extracted with CH2Cl2. The organic phase was washed with brine and dried over anhydrous MgSO4. After filtration, the solvent was removed in vacuo. The crude product was purified by silica gel column chromatography (hexane/CHCl3 = 4:1) to afford product 6 (6.2 g, 16.3 mmol, 91 %) as a colorless solid. 1H NMR (500 MHz, CDCl3, 298 K): δ 8.13 (t, J = 3.3 Hz, 2H), 7.61 (d, J = 7.5 Hz, 2H), 7.53 (d, J = 8.0 Hz, 2H), 7.44 (t, J = 7.5 Hz, 2H), 7.35 (t, J = 7.5 Hz, 1H), 7.09 (d, J = 8.0 Hz, 2H), 6.98 (s, 4H), 2.30 (s, 3H), 2.27 (s, 3H). 13C NMR (125 MHz, CDCl3, 298 K): δ 147.0, 146.9, 143.2, 140.5, 140.0, 138.9, 137.7, 137.0, 130.2, 129.6, 129.0, 128.8, 127.6, 127.1, 126.8, 123.5, 122.8, 21.7, 21.2. HR-ESI-TOF-MS (m/z): [M]+ calcd. for C26H21NO2, 379.1572; found, 379. 1572.

Synthesis of compound 7

A solution of compound 6 (4.6 g 12.1 mmol) and SnCl2·2H2O (13.5 g 60 mmol) in EtOH (50 mL) and EtOAc (50 mL) was stirred at 80 °C for 12 h. After the mixture was cooled to room temperature and quenched with ice water. The pH was adjusted to slightly alkaline conditions by the addition of 10% NaOH aqueous solution. The layers were separated, and the aqueous phase was extracted with EtOAc. The combined organic phase was washed with brine and dried over anhydrous MgSO4. After filtration, the solvent was removed in vacuo to give product 7 (4.0 g, 11.4 mmol, 94 %) as a colorless solid. 1H NMR (500 MHz, CDCl3, 298 K): δ 7.60 (d, J = 7.2 Hz, 2H), 7.44 (d, J = 8.2 Hz, 2H), 7.41 (t, J = 7.7 Hz, 2H), 7.31 (t, J = 7.4 Hz, 1H), 7.08 (d, J = 8.2 Hz, 2H), 6.97 (d, J = 8.1 Hz, 2H), 6.92 (d, J = 8.0 Hz, 2H), 6.65 (d, J = 2.2 Hz, 1H), 6.61 (d, J = 2.3 Hz, 1H), 3.69 (s, 2H), 2.24 (s, 3H), 2.14 (s, 3H). 13C NMR (125 MHz, CDCl3, 298 K): δ 142.8, 141.1, 139.8, 137.9, 137.2, 131.66, 131.63, 131.0, 129.7, 128.8, 128.3, 127.16, 126.99, 126.3, 115.8, 114.7, 100.30, 100.14, 21.5, 21.2. HR-ESI-TOF-MS (m/z): [M + H]+ calcd. for C26H24N, 350.1904; found, 350. 1875.

Synthesis of compound 8

A solution of Br2 (1.9 mL, 37 mmol) in CHCl3 (30 mL) was added dropwise into a suspension of compound 7 (3.9 g, 11.1 mmol) in CH3COOH (60 mL) and CHCl3 (30 mL). After the reaction mixture was stirred at room temperature for 12 h, the mixture was quenched with aqueous Na2SO3 solution. The organic layer was separated, and the aqueous layer was extracted with CHCl3. The combined organic phase was washed with brine and dried over anhydrous MgSO4. After filtration, the solvent was removed in vacuo to give the product 8 (5.4 g, 10.6 mmol, 96%) as a yellow solid. 1H NMR (500 MHz, CDCl3, 298 K): δ 7.54 (d, J = 7.0 Hz, 2H), 7.37-7.41 (m 4H), 7.30 (t, J = 7.5 Hz, 1H), 6.95 (t, J = 8.0 Hz, 4H), 6.86 (d, J = 8.0 Hz, 2H), 4.82 (s, 2H), 2.23 (s, 3H), 2.19 (s, 3H). 13C NMR (125 MHz,CDCl3, 298 K): δ 141.9, 141.4, 140.8, 139.9, 138.7, 138.2, 136.4, 136.0, 133.0, 131.1, 129.8, 128.8, 128.3, 127.3, 127.0, 126.2, 111.4, 108.6, 22.6, 21.4. HR-ESI-TOF-MS (m/z): [M + H]+ calcd. for C26H22N79Br81Br, 507.0020; found, 507.0014.

Synthesis of compound 9

A suspension of compound 8 (2.2 g, 4.33 mmol), 4-(3-pyridyl)phenylboronic acid (2 g, 10 mmol), Pd(dppf)Cl2·CH2Cl2 (220 mg, 0.27 mmol), and K3PO4 (3.6 g, 17 mmol) in toluene (40 mL) and water (4 mL) was degassed and heated at 100 °C for 12 h in a sealed tube. After the mixture was cooled to room temperature and quenched with water, the reaction mixture was extracted with CH2Cl2. The organic phase was washed with brine and dried over anhydrous MgSO4. After filtration, the solvent was removed in vacuo. The crude product was purified by silica gel column chromatography (n-hexane/EtOAc = 1:1) and GPC to give product 9 (1.64 g, 2.5 mmol, 58%) as a yellow solid. 1H NMR (500 MHz, CDCl3, 298 K): δ 8.95 (d, J = 1.5 Hz, 1H), 8.80 (d, J = 1.5 Hz, 1H), 8.62 (d, J = 3.5 Hz, 1H), 8.55 (d, J = 3.5 Hz, 1H), 7.96 (d, J = 8.0 Hz, 1H), 7.83 (d, J = 8.0, 1H), 7.76 (d, J = 8.0 Hz, 2H), 7.57 (t, J = 7.5 Hz, 4H), 7.45 (d, J = 8.0 Hz, 2H), 7.38–7.41 (m, 5H), 7.33–7.28 (m, 5H), 7.12 (d, J = 8.0 Hz, 2H), 6.74 (d, J = 8.0 Hz, 2H), 6.66 (d, J = 8.0 Hz, 2H), 3.49 (s, 2H), 2.07 (s, 3H), 1.91 (s, 3H). 13C NMR (125 MHz, CDCl3, 298 K): δ 148.8, 148.49, 148.45, 148.31, 141.0, 140.6, 139.1, 138.7, 138.1, 137.4, 137.0, 136.35, 136.30, 135.8, 134.75, 134.57, 134.43, 134.2, 132.1, 131.9, 131.6, 131.16, 131.00, 128.8, 128.2, 127.6, 127.12, 127.04, 126.94, 126.5, 126.1, 124.5, 123.76, 123.62, 21.2, 19.9. HR-ESI-TOF-MS (m/z): [M + H]+ calcd. for C48H38N3, 656.3061; found, 656.3069.

Synthesis of compound 10

Isoamyl nitrite (760 μL, 5.6 mmol) was added dropwise to a suspension of compound 9 (1.42 g, 2.16 mmol) and CuBr2 (1.2 g, 5.4 mmol) in CH3CN (40 mL). The reaction mixture was stirred at 70 °C for 12 h. After the mixture was cooled to room temperature and quenched with 10% NH3 aqueous solution, the reaction mixture was extracted with EtOAc. The organic phase was washed with brine and dried over anhydrous MgSO4. After filtration, the solvent was removed in vacuo. The crude product was purified by silica gel column chromatography (n-hexane/EtOAc = 1:1) to give the product 10 (833 mg, 1.16 mmol, 54%) as a colorless solid. 1H NMR (500 MHz, CDCl3, 298 K): δ 8.97 (d, J = 1.9 Hz, 1H), 8.81 (d, J = 1.9 Hz, 1H), 8.61 (dd, J = 4.8, 1.5 Hz, 1H), 8.54 (dd, J = 4.8, 1.5 Hz, 1H), 7.99 (dt, J = 8.0, 1.9 Hz, 1H), 7.84 (dt, J = 8.0, 1.9 Hz, 1H), 7.74 (d, J = 8.2 Hz, 2H), 7.57-7.55 (m, 2H), 7.48 (d, J = 8.2 Hz, 3H), 7.44-7.38 (m, 8H), 7.32-7.30 (m, 3H), 7.22 (d, J = 8.3 Hz, 2H), 7.10 (d, J = 8.2 Hz, 2H), 6.71 (d, J = 8.0 Hz, 2H), 6.67 (d, J = 8.0 Hz, 2H), 2.06 (s, 3H), 1.95 (s, 3H). 13C NMR (125 MHz, CDCl3, 298 K): δ 148.7, 148.5, 148.4, 148.3, 142.5, 142.3, 141.9, 141.8, 141.4, 140.7, 139.9, 139.5, 138.8, 136.81, 136.8, 136.4, 136.3, 136.1, 135.7, 135.3, 134.5, 134.3, 131.5, 131.4, 130.8, 130.7, 130.6, 130.3, 128.9, 128.9, 127.7, 127.4, 127.4, 127.0, 126.4, 126.1, 124.9, 123.7, 123.6, 21.2, 20.9. HR-ESI-TOF-MS (m/z): [M + H]+ calcd. for C48H3679BrN2, 719.2057; found, 719.2085.

Synthesis of compound 11

A suspension of compound 10 (1050 mg, 1.46 mmol), 4-methylphenylboronic acid (992 mg, 7.3 mmol), Pd(dppf)Cl2·CH2Cl2 (120 mg, 0.15 mmol), and K3PO4 (1.43 g, 6.7 mmol) in toluene (20 mL) and water (2 mL) was degassed and heated at 100 °C for 12 h in a sealed tube. After the mixture was cooled to room temperature and quenched with water, the reaction mixture was extracted with CH2Cl2. The organic phase was washed with brine and dried over anhydrous MgSO4. After filtration, the solvent was removed in vacuo. The crude product was purified by silica gel column chromatography (n-hexane/EtOAc = 2:1) and GPC to give product 11 (233 mg, 0.32 mmol, 22%) as a yellow solid. 1H NMR (500 MHz, CDCl3, 298 K): δ 8.82 (d, J = 2.0 Hz, 1H), 8.67 (d, J = 2.0 Hz, 1H), 8.54 (dd, J = 4.8, 1.5 Hz, 1H), 8.47 (dd, J = 4.8, 1.5 Hz, 1H), 7.85 (dt, J = 8.0, 1.9 Hz, 1H), 7.70 (dt, J = 8.0, 1.8 Hz, 1H), 7.58 (d, J = 7.2 Hz, 2H), 7.46-7.39 (m, 7H), 7.34–7.29 (m, 2H), 7.24 (d, J = 8.3 Hz, 3H), 7.19 (d, J = 8.2 Hz, 2H), 7.10 (d, J = 8.3 Hz, 2H), 6.92 (d, J = 8.3 Hz, 2H), 6.76 (d, J = 7.4 Hz, 4H), 6.66 (dd, J = 7.7, 5.0 Hz, 4H), 2.06 (s, 6H), 1.99 (s, 3H). 13C NMR (125 MHz, CDCl3, 298 K): δ 148.40, 148.37, 148.23, 148.10, 141.6, 141.14, 141.02, 140.92, 140.64, 140.60, 140.55, 140.40, 138.50, 138.43, 137.5, 136.4, 135.2, 134.79, 134.74, 134.18, 134.01, 133.95, 132.3, 131.27, 131.14, 130.9, 128.8, 127.61, 127.58, 127.23, 127.08, 126.97, 126.37, 126.28, 126.25, 125.2, 123.58, 123.43, 21.2, 20.1. HR-ESI-TOF-MS (m/z): [M + H]+ calcd. for C55H43N2, 731.3421, found, 731.3443.

Synthesis of compound 2·I2

MeI (852 μL, 13.7 mmol) was added to a solution of compound 11 (100 mg 0.137 mmol) in CH2Cl2 (5 mL). The reaction mixture was stirred at room temperature for 12 h under dark. Then the solvent was removed in vacuo to obtain the 2·I2 (137 mg, 99%), which was used without further purification.

Synthesis of compound 2·Cl2

A suspension of 2·I2 (137 mg, 1.35 mmol) and IRA 400 J Cl© (850 mg) in water (10 mL) were stirred at room temperature for overnight. Then the suspension was filtered, and the filtrate was concentrated in vacuo. The crude material was washed by n-hexane, CHCl3, diethyl ether, and EtOAc twice, respectively to give the product (100 mg, 0.12 mmol, 88 % in 2 steps) as a pale yellow solid. 1H NMR (500 MHz, CD3OD, 298 K): δ 9.24 (s, 1H), 9.08 (s, 1H), 8.82 (d, J = 5.9 Hz, 1H), 8.78 (d, J = 7.8 Hz, 1H), 8.73 (d, J = 5.9 Hz, 1H), 8.60 (s, 1H), 8.08 (t, J = 7.1 Hz, 1H), 7.98 (t, J = 6.6 Hz, 1H), 7.72 (d, J = 8.2 Hz, 2H), 7.53 (s, 2H), 7.47 (s, 2H), 7.39 (t, J = 7.8 Hz, 6H), 7.29 (t, J = 7.2 Hz, 1H), 7.19 (d, J = 7.8 Hz, 2H), 7.14 (d, J = 8.3 Hz, 2H), 6.87–6.83 (m, 4H), 6.68 (t, J = 9.1 Hz, 4H), 4.45 (s, 3H), 4.37 (s, 3H), 2.01 (s, 3H), 2.00 (s, 3H), 1.92 (s, 3H). 13C NMR (125 MHz, CD3OD, 298 K): δ 145.3, 144.97, 144.81, 144.55, 144.50, 144.2, 143.6, 143.2, 142.6, 142.13, 141.97, 141.91, 141.51, 141.44, 141.24, 140.2, 139.3, 138.75, 138.72, 136.2, 135.0, 134.1, 133.0, 132.42, 132.30, 132.25, 131.9, 131.2, 129.8, 129.01, 128.85, 128.61, 128.56, 128.3, 127.83, 127.76, 127.2, 126.5, 21.0, 20.1. HR-ESI-TOF-MS (m/z): [M]2+ calcd. for C57H48N2, 380.1903; found, 380.1879.

General procedure of host-guest complexation between nanocube and guest molecules

For the encapsulation of liquid guest molecules that are insoluble in water, 1 μL of guest molecule was added to a D2O solution of a mixture of configurational isomers of nanocube, [26]12+ ([22+]0 = 1.0 mM), in an NMR tube then sonicated for 10 min. For the encapsulation of solid guest molecules, about 1 mg of the guest molecule was added to a D2O solution of [26]12+ in an NMR tube. The suspension was heated at 70 °C for 2 h. For gaseous molecules, guest molecules were added through bubbling to a D2O solution of [26]12+ in an NMR tube for 1 min. The nanocubes were characterized by various 1H NMR measurements and CSI-FT-ICR mass spectrometry, which are shown in the Supplementary Information.

Computational details

Molecular mechanics calculations (force field: COMPASS III) and calculations of molecular volumes of guest molecules and void spaces of the nanocubes were performed using BIOVIA Materials Studio (Dassault Systèmes). The initial structures of the nanocubes (E-[26]12+ and P-[26]12+) were created from the crystal structure of the nanocube previously reported54 by proper change of the substituents.