Introduction

Of the common architectures in biological systems, helices (e.g., RNA, DNA) are of fundamental importance (e.g., relay chemical information)1,2. Ribonucleic acid (RNA) and deoxyribonucleic acid (DNA) are single- and double-stranded helices, respectively, with the components assembling through hydrogen bonds. Mathematically, the three-dimensional (3D) structure of a helix is derived from the 2D structure of a zig-zag3. The structures of zig-zags and helices are, more generally, examples of skew apeirogons, which are part of a class of infinite 2-polytopes with vertices that are not all colinear (Scheme 1). Progression of the structure of a zig-zag to a helix requires addition of an axis of rotation wherein the vertices are located to lie on the surface of a cylinder3.

Scheme 1
scheme 1

Infinite skew apeirogons. a zig-zag skew and b helical polygon.

In this report, we describe a binary cocrystal with components that self-assemble to progress at the structural level from a zig-zag to a double helix and to a quadruple helix (Scheme 2). The formation of the 1D structures, which are related as supramolecular isomers4, is manifested as three crystal polymorphs (Forms I-III), wherein the components of each trimorph self-assemble through hydrogen bonds. We show the generation of the polymorphs to be influenced by water, with solutions of higher water content promoting the formation of the quadruple helix. We also show mechanochemistry and heat to promote the formation of the double helix and quadruple helix, respectively. While the development of self-assembled zig-zags and helices being ongoing challenges in synthetic chemistry (e.g., catalysis) and materials sciences (e.g., optoelectronics)5,6, we are unaware of a case wherein the components of a purely organic multi-component system, such as a cocrystal, self-assemble to generate zig-zag and helical structures. Related solids composed of metal-organic components, which are sustained by coordination bonds, are reported to form zig-zags and helices in limited cases6.

Scheme 2
scheme 2

Supramolecular isomers as crystal polymorphs. a zig-zag, b double helix, and, c quadruple helix. Rotation axes for Forms II and III as black horizontal lines.

Results and Discussion

The binary cocrystal (f-cat)·(bpe) (where: f-cat = 3-fluorocatechol; bpe = trans-1,2-bis(4-pyridyl)ethylene) is trimorphic7. Each trimorph is obtained through solution-mediated phase transformation8. Specifically, each form is generated from a slightly opened screw-top sample vial involving an equimolar solution of f-cat (42 mg) and bpe (60 mg) in methanol/water (4 mL, 1:1). Form I (yellow plates) is generated in a period of 5 min upon precipitation from a resulting orange solution. Form II (orange-green dichroic prisms) is generated in the same solvent system after a period of 10 h from a green solution. Prolonged suspension of Form II for 20 h results in the generation of Form III (blue needles) from a blue solution.

As chemical building blocks, f-cat and bpe can function as double hydrogen-bond donors and acceptors, respectively. The diol f-cat can furthermore adopt one of nine likely conformations to support a self-assembly process with bpe (Scheme 3).

Scheme 3
scheme 3

Conformations of f-cat (present study highlighted in blue).

Self-assembly of trimorphs

Single-crystal X-ray diffraction (SCXRD) analysis reveals the components of Form I to crystallize in the chiral orthorhombic space group P212121 with two molecules each of f-cat and bpe in the asymmetric unit (Fig. 1). The components form a 1D zig-zag chain along a 21-screw with the f-cat molecules adopting a gauche-syn conformation (Fig. 1a,b) (Table 1). The chains involve two unique (f-cat)·(bpe) units within the zig-zag chain based on a repeat distance slightly under 2.0 nm (19.6 Å). The components are sustained by O-H···N hydrogen bonds [O···N distances (Å): O(1)···N(1) 2.701(9)/O(3)···N(2) 2.698(9) and O(4)···N(3) 2.762(9)/O(2)···N(4) 2.823(10)]. The chains interdigitate to exhibit a tongue-in-groove fit within the crystallographic bc-plane (Fig. 1c).

Fig. 1: X-ray structure Form I.
figure 1

a zig-zag chain, b space-filling of single chain, and c packing of zig-zags.

Table 1 Structural parameters for 1D chains of polymorphs

In principle, the hydrogen bonding of the zig-zag chain can be manifested as two supramolecular isomers7 wherein the F-atoms point either in the same (parallel) or opposite (antiparallel) directions within an individual chain. For Form I, the f-cat components point in opposite directions, interacting with two crystallographically distinct bpe molecules. The shortest hydrogen-bond distances involve the more acidic hydroxyl group adjacent to the F-atom, with the components of the chain twisted (twist angles: 6.7°, 25.6°, 13.9°, 12.3°) about each hydrogen bond.

In contrast to Form I, the components of Form II self-assemble to form a double helix (Fig. 2). The components crystallize in the monoclinic centrosymmetric space group C2/c with one molecule each of f-cat and bpe in the asymmetric unit. As with Form I, the diol and bipyridine assemble via O-H···N hydrogen bonds [O···N distances (Å): O(2)···N(1) 2.7623(2), O(1)···N(2) 2.7067(1)]. The self-assembly generates chains exhibit a longer overall repeat distance on the order of 3.0 nm (31.4 Å) compared to Form I (Fig. 2a, b), with f-cat adopting a gauche-anti conformation (hydroxyl group twist -130°).

Fig. 2: X-ray structure of Form II.
figure 2

a double helix, b space-filling of double helix, and c packing of neighboring helices.

The F-atoms of Form II, in contrast to Form I, point in the same direction (Fig. 2a) along each helix chain, with the helixes running along a crystallographic b-glide plane in a head-to-head arrangement. The components of each strand are more twisted (twist angles: 80.3° and 73.9°) about each hydrogen bond compared to Form I. Offset infinite and face-to-face π-π stacking of bpe [centroid···centroid (Å): 3.867(5)] results in adjacent double helices related by a crystallographic center of inversion (Fig. 2c). The close packing is also mediated by edge-to-face π-interactions of the pyridyl groups and f-cat.

Whereas the components of Form I and Form II form a zig-zag and double helix, respectively, the components of Form III self-assemble to form a quadruple helix. The components crystallize in the trigonal space group R \(\bar{3}\) with one molecule of f-cat and bpe in the asymmetric unit. As with Form I and II, the components generate 1D chains sustained by O-H···N hydrogen bonds [O···N distances (Å): O(1)···N(1) 2.692(1), O(2)···N(2) 2.708(1)] (Fig. 3). The chains participate, in contrast to Form II, as a quadruple helix along the crystallographic c-axis wherein the f-cat molecules adopt an anti-anti conformation. As with Form II, the F-atoms of Form III point in the same direction within a chain (Fig. 3a), with the helices oriented in a head-to-head arrangement. The helical chains tessellate within the ab-plane with packing based on triangles with neighboring helices being of opposite handedness. The F-atoms of the helices converge with F···F interactions at a core separated by 3.820(2) Å. Each helical chain exhibits an overall repeat distance larger than Forms I and II, being on the order of 5.0 nm (48.2 Å) and with the components also twisted (36.8° and 13.3°) from coplanarity.

Fig. 3: X-ray structure Form III.
figure 3

a quadruple helix, b space-filling of showing intertwined hydrogen-bonded chains, c hexagonal packing of neighboring quadruple helices with highlight of F-atoms, and d isolated cluster of F-atoms (inset, space-filling).

Role of solvent

To further elucidate the role of solvent in supporting the formation of the polymorphs, we have determined that when equimolar f-cat and bpe are dissolved in either methanol or water, the zig-zag Form I forms as a precipitate as single crystals and a powder. Prolonged suspension for a period of either 2 or 4 h results in a phase transformation to generate either the double helix Form II or quadruple helix Form III, respectively. Form II predominated in mixed solutions with at least 30% methanol (v:v) while Form III predominated in more aqueous solutions (Scheme 4). The dependence of Form III on the high-water content likely can be ascribed to a hydrophobic packing9 effect involving the F-atoms that converge between the helices in the solid (Fig. 3d). A similar phenomenon has been described in materials composed of fluorinated proteins10.

Scheme 4
scheme 4

Synthetic routes to each polymorph of (f-cat)·(bpe) cocrystal. Neat and liquid assisted grinding (50 μL of methanol) were subjected using a ball mill for up to 30 min.

Form II was also determined to form by neat grinding, as well as liquid assisted grinding (LAG) with methanol11. When powder samples of either Form I or Form III were subjected to the grinding conditions, each solid converted to Form II. Grinding of a sample of Form II alone also did not result in a change in phase. A Hirshfeld analysis of each trimorph revealed C···C contacts to constitute the highest percentage in Form II (9.4%), which is consistent with the presence of the face-to-face π-stacks. Face-to-face stacking is significant in stabilizing the structures of proteins and other biomolecules9. The stability of Form II can be attributed to the presence of the face-to-face π-stacks. The prevalence of the stacking is corroborated with energy framework calculations at the B3LYP/6-311 G(d,p) level using CrystalExplorer where the packing of Form II involves dominant dispersion energies (Supplementary Fig. 5). We note that heating of a separate sample of Form II at 130 °C for 2 h resulted in a phase transformation to Form III (Supplementary Fig. 3).

Photoreactivity of π-stacks of double helix

The proximity of the C=C bonds (3.675(4) Å) within the stacks and between the double helices of Form II renders the solid photoactive12,13,14. Specifically, when Form II was subjected to UV-radiation (72 h), the C=C bonds of bpe underwent an intermolecular [2 + 2] photodimerization to afford rctt-tetrakis(4-pyridyl)cyclobutane (tpcb) stereospecifically and in near quantitative yield. The generation of tpcb was evidenced by the complete disappearance of the alkene peak at 7.65 ppm and appearance of the cyclobutane peak at 4.70 ppm on the 1H NMR spectrum (Supplementary Figs. 7 and 8).

When the resulting photoreacted solid was dissolved in ethanol, single crystals as colorless plates of composition 2(f-cat)·(tpcb) formed. The asymmetric unit consists of one full molecule of tpcb and two molecules of f-cat, which crystallize in the monoclinic space group P21/c (Fig. 4). The components generate a O-H···N hydrogen-bonded corrugated network within the crystallography ab-plane [O···N distances (Å): O(1)···N(3) 2.682(2), O(2)···N(4) 2.694(2), O(3 A)···N(2) 2.661(2), O(4 A)···N(1) 2.592(2)] (Fig. 4a). The cyclobutanes of 2(f-cat)·(tpcb) are bridged by f-cat in anti-gauche (-175.75°) and gauche-anti conformations (177.7°) (Fig. 4b, c). The network can be simplified as a sql topology (point symbol 44.62) with the nodes as the centroids of the cyclobutane rings of tpcb (Fig. 4d)15. Thus, the photodimerization supports a change from a double helix to a sql network. While [2 + 2] photodimerizations in the solid state have been reported between 3D network structures (i.e., diamondoid networks)16, we are unaware of an example wherein an intermolecular photodimerization occurs between helical structures.

Fig. 4: X-ray structure 2(f-cat)·(tpcb).
figure 4

a asymmetric unit, b hydrogen bonds, c sql topology, and d unit cell representation.

Conclusions

In this report, we have described trimorphism involving a hydrogen-bonded binary cocrystal. With f-cat being unsymmetrical and conformationally flexible, supramolecular isomerism was expressed in Form I as a planar zig-zag and Form II and Form III as double and quadruple helices, respectively. Components of the double helices of Form II were also demonstrated to be photoactive in the solid state. We are now studying the scope of the polymorphism and extension to additional unsymmetrical coformers with the idea of determining the breadth of self-assembled frameworks that existence within the polymorphism domain. With the structures of zig-zags also being present in biochemistry (e.g., Z-DNA), it is intriguing to consider whether zig-zags of biological origin have a capacity to spontaneously evolve and self-assemble into helical structures.

Materials and Methods

Materials

3-fluorocatechol, acetonitrile, chloroform, ethyl acetate, and methanol were purchased from Sigma-Aldrich Chemicals. Trans-1,2-bis(4-pyridyl)ethylene was purchased from TCI Chemicals. All chemicals were used as received and without further purification.

Photodimerization experiments

A powder sample of Form II was subjected to UV-radiation (up to 72 h) using an ACE photocabinet equipped with an ACE quartz, 450 W, broadband, medium pressure, Hg-vapor lamp.

Nuclear magnetic resonance

Proton nuclear magnetic resonance (¹H-NMR) spectra were measured on a Bruker AVANCE 400 MHz spectrometer using DMSO-d₆ as the solvent.

Mechanochemistry

Liquid-assisted grinding (LAG) experiments were conducted in an FTS-1000 shaker mill using 5.0 mL PTFE jars and two stainless-steel balls (5.0 mm diameter). The reactions were performed at 20 Hz with 50 µL of solvent.

Single-crystal X-ray diffraction

Single-crystal X-ray diffraction (SCXRD) data were collected on a Bruker Nonius Kappa CCD single-crystal X-ray diffractometer using MoKα radiation (λ = 0.71073 Å). Reflections were harvested from ϕ and ω scans with an intensity threshold of I > 2σ (I). Data collection, reduction, and cell refinement were accomplished using the Bruker Apex2 software suite. Using Olex217, structure solution and refinement were accomplished using SHELXT18 and SHELXL19, respectively. Non-hydrogen atoms were refined anisotropically. Hydrogen atoms of carbon atoms were refined in geometrically constrained positions.

Powder X-ray diffraction

Powder X-ray diffraction (PXRD) data were collected on a Bruker D8 Advance X-ray diffractometer using CuKα1 radiation (λ = 1.5418 Å) in the range 5–45° (scan type: coupled Two Theta/Theta; scan mode: continuous PSD fast; step size: 0.019°) (40 kV and 30 mA).

Computational analysis

CrystalExplorer 21.5 program20 was utilized to evaluate the interaction energies at B3LYP/6311 G(d,p) level. Interaction energies (Eele, Emol, Edis, Erep, Etot) of Forms I-III are shown in Supplementary Table 4 and energy frameworks are illustrated in Supplementary Figs. 46.