Introduction

The most recent common ancestor (MRCA) of cyanobacteria emerged approximately 3–2 billion years (Ga) ago1,2,3,4,5,6 and evolved into the current ecosystem through the Great Oxidation event (GOE) 2.3 Ga ago7, at least two Snowball Earth events (SEEs) 2.4 and 0.7 Ga ago8,9,10, and the Neoproterozoic Oxygenation event (NOE) 0.8–0.6 Ga ago11,12. Studies of fossils and molecular evolution models suggest that the MRCA of cyanobacteria already possessed primitive oxygenic photosynthetic systems1,2,4. Because the efficiency of photosynthesis is strongly influenced by light–dark environmental cycles, determining whether primitive cyanobacteria possessed a timekeeping system when photosynthesis became active in the GOE period is important to understanding the physiological origin of circadian clock systems13. Primitive biological clocks are also of interest from the perspective of geoscience, and the evolution of the Earth’s rotation period from ancient times to the present remains an issue of debate14.

Circadian clocks with autonomous 24-h oscillatory, temperature compensatory, and synchronous properties have been identified in various extant organisms15, including bacteria16, fungi17, plants18, and mammals19. The circadian clock of cyanobacteria has been studied intensively using the Synechococcus elongatus PCC7942 (Se7942) strain as a model system20,21. The clock oscillator can be reconstructed in a test tube by incubating the core protein KaiC from the Se7942 strain (KaiCSe) in the presence of KaiASe, KaiBSe, and adenosine triphosphate (ATP)22. KaiCSe exhibits rhythms in phosphorylation23,24 and interactions with KaiASe and KaiBSe25,26,27,28; however, the frequency of these rhythms is highly correlated with the ATPase activity of KaiCSe itself29,30. For example, when the ATPase activity of KaiCSe doubles as a result of amino acid substitutions, the frequencies of both the in vitro oscillator and the intracellular rhythm also double. This process mediated by KaiCSe is known as cross-scale causality and is also observed in temperature compensation31.

One hypothesis worth testing is whether primitive cyanobacteria utilized a Kai-protein oscillator. However, the concept of a protein-based oscillator is derived almost exclusively from experiments and observations using one extant species, the Se7942 strain. Therefore, whether Kai proteins functioned as circadian oscillators in ancestral cyanobacteria, or even in extant species other than the Se7942 strain, remains unknown.

In this work, we study the function and structure of extant and ancestral Kai-proteins to present a scenario for the evolution of the self-sustained Kai-protein oscillators. According to our estimates, the earliest Kai-protein oscillator emerges in the MRCA of cyanobacteria at approximately 2.2 Ga ago and is able to synchronize with temperature cycles of 18 h, which is shorter than the current rotation period of Earth. The ancient Kai-protein oscillator has been inherited through a long evolutionary process by most extant freshwater cyanobacteria and some marine cyanobacteria living under 24-h environmental cycles.

Results

Feasibility study of reconstructing Kai-protein oscillators in extant species

Duplication of the predecessor of the kaiC gene and their subsequent fusion into a double-domain predecessor occurred between 3.8 and 3.5 Ga ago32. We used the result of large-scale genome analyses of cyanobacteria and other species33 to construct a phylogenetic tree of extant double-domain KaiC homologs (Fig. 1a and Supplementary Fig. 1). The KaiC tree included four groups: group I was composed mainly of KaiCs from freshwater cyanobacteria, including Se7942 (KaiCSe, ID: 001); group II consisted of marine cyanobacteria; group III included other bacteria; and group IV was formed by other bacteria including archaea. Most freshwater cyanobacteria have only one kaiC of group I, but others have additional group III or IV kaiC originated from secondary copies or lateral transfer. Thus, these group III and IV KaiCs from freshwater cyanobacteria (gray entries in Fig. 1a) in the protein-based tree can be excluded when identifying groups I–IV and nodes based on species and growing habitant (see Supplementary Discussion).

Fig. 1: Feasibility studies of in vitro oscillators for extant and ancestral Kai proteins.
figure 1

a Phylogenetic tree of extant double-domain KaiC homologs. Cyanobacteria in freshwater, marine cyanobacteria, proteobacteria, bacteria, and euryarchaeota/crenarchaeota are colored green, blue, cyan, yellow, and red, respectively. Group III and IV KaiCs from freshwater cyanobacteria (gray entries) are originated from secondary copies of group I kaiC gene or lateral gene transfer. The three numbers preceding the name of each entry indicate the ID number, the total number of amino acids, and the sequence identity with KaiCSe, respectively. A*B*C*: sets of species-specific KaiA*, KaiB*, and KaiC*. ASeB*C*: sets of KaiASe, KaiB*, and KaiC*. ASeBSeC*: sets of KaiASe, KaiBSe, and KaiC*. Rhythmic sets are indicated by filled circles, whereas arhythmic sets are labeled with crosses (see each result in Supplementary Fig. 3). Prep. C* and Prep. A*B* indicate whether species-specific Kai proteins were adequately expressed and purified (open circles), prepared at a level that could be analyzed (triangles), or not prepared at all (crosses). Ex. A*B* indicates the classification of the existence of kaiA* and kaiB* genes; e.g., kaiA* and kaiB* coexist (ab), only kaiA* exists (a–), only kaiB* exists (–b), and neither exists (–). Symbols α–η represent evolutionarily important branching points for which the amino acid sequence of ancestral KaiC was restored (Fig. 2a and Supplementary Fig. 4a). In vitro rhythm assays of A*B*C* (circles) or ASeB*C* (diamonds) at 30 °C (blue) and 40 °C (red) as functions of incubation time (IT) for b Cyanothece sp. PCC7822 (KaiCCP, ID: 007) in group I, c Oscillatoriales cyanobacterium JSC 12 (KaiCOs, ID: 002) in group I, d Synechococcus sp. RCC307 (KaiCSR, ID: 046) in group II, e Synechococcus sp. WH 7803 (KaiCWH, ID: 056) in group II, f Rhodospirillum rubrum F11 (KaiCRr, ID: 104) in group III, g Cyanothece sp. ATCC 51142 (KaiCCy2, ID: 085) in group III, and h Flavobacterium johnsoniae UW101 (KaiCFj, ID: 212) in group IV. Shown are the representative examples of the fraction of phosphorylated KaiC (P-KaiC) from multiple experiments summarized in Supplementary Fig. 3. AR indicates that the period could not be determined (arrhythmic). In vitro rhythm assays for i AγBα/βCα (circles) and ASeBSeCα (diamonds), j AγBα/βCβ (circles) and ASeBSeCβ (diamonds), k AγBγCγ (circles) and ASeBSeCγ (diamonds), l AζBγCζ (circles) and ASeBSeCζ (diamonds), m AδBδCδ (circles) and ASeBSeCδ (diamonds), n AγBεCε (circles) and ASeBSeCε (diamonds), and o AγBηCη (circles) and ASeBSeCη (diamonds). Plots are shown as means ± s.d. of three replicates (see Table 1 for details on the number of replicates and their sources). Blue circles and gray diamonds correspond to the data taken at 30 °C, and red circles correspond to the data taken at 40 °C.

Rhodobacter sphaeroides KaiC (KaiCRs, ID: 194), which functions as an hourglass-like timekeeping system34, and Rhodopseudomonas palustris TIE 1 KaiC (KaiCRp, ID: 184), which is not self-sustained but enhances fitness in the rhythmic environment35, were classified into group IV in the present KaiC tree. The overall topology and grouping of the protein-based tree of double-domain KaiCs are consistent with those of the 16S rRNA-based phylogenic tree (Supplementary Fig. 2, Supplementary Discussion) as well as with a previous report32.

We used the in vitro rhythm assay22 to test the feasibility of establishing the in vitro oscillator for 41 representative extant species including the Se7942 strain at pH 8.0 and 30 °C (Supplementary Fig. 3). In most KaiCs from extant species belonging to group I, circadian rhythms were observed in the presence of the corresponding species-specific KaiA and KaiB. For example, Cyanothece sp. PCC7822 KaiC (KaiCCP, ID: 007) cycled between phosphorylated and de-phosphorylated forms with a period of 27.2 h (Fig. 1b) in the presence of KaiACP and KaiBCP, as observed in the Se7942 system (Supplementary Fig. 3a). The same rhythm was observed in Oscillatoriales cyanobacterium JSC 12 KaiC (KaiCOs, ID: 002) (Fig. 1c). Cases in which an obvious rhythm was detected in a set (A*B*C*) corresponding to species-specific KaiA (KaiA*), KaiB*, and KaiC* are shown in filled circles in Fig. 1a. Regardless of the behaviors of the A*B*C* set, cases in which oscillation was detected when KaiA* and KaiB* were replaced by KaiASe and KaiBSe, respectively (ASeBSeC*), are also marked with filled circles in a different column (Fig. 1a). In approximately 80% of group I A*B*C* trials, the rhythm showed a period of 25.2 ± 4.9 h at 30 °C, whereas in approximately 70% of the examined group I ASeBSeC* sets, the period was 23.4 ± 4.5 h (Supplementary Fig. 3ac).

In species with KaiCs classified into groups II–IV that did not have a homologous gene for kaiA, KaiASe was hypothetically used for the in vitro rhythm assay (ASeB*C*). Robust circadian rhythms were observed for most of the examined A*B*C* sets in the first half of the group II entries (approximately 70%, Supplementary Fig. 3d), such as Synechococcus sp. RCC307 (KaiCSR, ID: 046) (Fig. 1d) and Synechococcus sp. WH 7803 (KaiCWH, ID: 056) (Fig. 1e). In the other half of group II, which consisted of Prochlorococcus strains lacking species-specific KaiA, most KaiCs could not be expressed or purified. We were able to purify KaiC from Prochlorococcus marinus str. MIT 9211 (ID: 073); however, it showed no rhythm, even in the corresponding ASeBSeC* set (Fig. 1a and Supplementary Fig. 3d).

Entries in Group III are KaiCs from proteobacteria (cyan entries in Fig. 1a) and bacteria (yellow entries in Fig. 1a), and KaiCs as secondary copies of group I kaiC gene (gray entries in Fig. 1a). Proteobacterial KaiC from Rhodospirillum rubrum F11 (KaiCRr, ID: 104) in group III (Fig. 1f) failed to show self-sustained oscillation in their ASeB*C* and ASeBSeC* sets. Two gray entries in Group III, KaiCs from Cyanothece sp. ATCC 51142 (KaiCCy2, ID: 085) and Synechocystis sp. PCC 6803 (KaiCSy3, ID: 093), exhibited self-sustained oscillation in the presence of species-specific KaiB and chimeric KaiA homolog often called KaiA336 (Fig. 1g and Supplementary Fig. 3e).

The rhythmic fraction of the group IV entries was relatively low (Fig. 1a): 0% for both the ASeB*C* and ASeBSeC* sets of group IV (Supplementary Fig. 3f). For example, KaiC from Flavobacterium johnsoniae UW101 (KaiCFj, ID: 212) in group IV (Fig. 1h) did not show self-sustained oscillation in either the ASeB*C* and ASeBSeC* sets.

These results suggest that not all extant KaiC homologs with a double-domain architecture have a circadian clock function, which raises the question of how and when KaiC acquired its function as a self-sustained circadian clock.

In vitro reconstruction of ancestral Kai protein oscillators

We restored the possible amino acid sequences of ancestral KaiCs corresponding to seven evolutionary branching points (α–η) (Fig. 1a and Supplementary Fig. 1, Supplementary Discussion). Ancestral KaiCs consist of 509–560 amino acids and contain an N-terminal CI domain and a C-terminal CII domain (Fig. 2a and Supplementary Fig. 4a). Similar to KaiCSe, all ancestral KaiCs retain catalytic glutamates essential for ATPase activity in the CI domain and kinase/phosphatase activity in the CII domain, as well as a potential dual phosphorylation site (S431 and T432 in KaiCSe) in the CII domain (Fig. 2a). A phylogenetic tree of extant KaiB homologs was constructed (Supplementary Fig. 5) and compared with that of the double-domain KaiC homologs (Fig. 1a). According to the similarities in the taxonomic patterns of the belonging species (see coloring by growing habitat), the branches corresponding to the γ, δ, ε, and η points were assigned to the KaiB tree (Supplementary Figs. 5 and 4b), whereas α and β were identified as a point (α/β) with no clear distinction between them. Consistent with a previous study showing that ancestral KaiA appeared at a later time than ancestral KaiB and KaiC32,37, we were only able to assign three later branches for the γ, δ, and ζ points in our tree of double-domain KaiA homologs (Supplementary Figs. 6 and 4c). Because ancient seawater was neutral (pH ~ 7)38, whereas the current seawater is slightly basic (pH ~ 8), and because some cyanobacteria undergo a change in intracellular pH in response to the pH of their environment39, these ancestral forms of Kai proteins were expressed and purified for an in vitro rhythm assay at pH 7.0 and a temperature of 30 °C.

Fig. 2: Domain architecture and biochemical properties of ancestral KaiCs.
figure 2

a Key residues and motifs mapped to the domain compositions of KaiCSe and ancestral KaiCs. b Temperature dependence of the steady-state ATPase activity of ancestral KaiCs at pH 7.0 and KaiCSe at pH 8.0. Open circles correspond to raw data shown in Supplementary Table. 1. Bar graphs show means ± s.d.; For KaiCSe (30 °C and 40 °C), three independent purifications; For KaiCα, KaiCβ, and KaiCδ (30 °C and 40 °C), three replicates from a single purification; For KaiCγ, three replicates from a single purification (30 °C) and three replicates from two independent purifications (40 °C); For KaiCε and KaiCη (30 and 40 °C), six replicates from two independent purifications; For KaiCζ, six replicates from a single purification (30 °C) and six replicates from two independent purifications (40 °C). c Pre–steady-state analysis of the ATPase activity. Dark-colored circles correspond to the mean of separate experiments; For KaiCα, KaiCδ, KaiCε, and KaiCζ, three independent purifications (open squares, triangles, and diamonds); For KaiCβ, two independent purifications (open squares and triangles); For KaiCγ, four independent purifications (open squares, triangles, inverted triangles, and diamonds). The data of KaiCSe were taken from the previous report29 and presented as mean ± s.d. of separate experiments using three independent purifications. Bioluminescent rhythm assays for Se7942 reporter strains carrying d kaiASe: kaiBSe: kaiCγ and e kaiASe: kaiBSe: kaiCβ, but without kaiCSe, were performed under continuous-light illumination condition at 30 °C. Thin and thick lines represent individual (n = 12 and 7 for (d) and (e), respectively) and average traces, respectively. The period length (32.9 ± 0.7 h) of the strain with kaiASe: kaiBSe: kaiCγ was consistent with that of the in vitro AγBγCγ set at pH 7.6–7.8 at 30 °C in (f). f pH-dependence of the period length for the in vitro AγBγCγ, AδBδCδ, and ASeBSeCSe sets at 30 °C and 40 °C. The data for the ASeBSeCγ set at 30 °C at pH 7.0 (three replicates from a single purification) and pH 8.0 (three independent purifications) were presented by brown circles as means ± s.d. Each plot other than the ASeBSeCγ set corresponds to a single experiment using a single purification.

A hypothetical oldest set (AγBα/βCα) consisting of KaiAγ, KaiBα/β, and KaiCα did not show any signs of oscillation, and this was also observed in the ASeBSeCα set (Fig. 1i). Similar results were obtained for the second oldest AγBα/βCβ and ASeBSeCβ sets (Fig. 1j). By contrast, the AγBγCγ and ASeBSeCγ sets exhibited robust oscillations with periods of 23.4 ± 1.5 and 25.8 ± 0.3 h (Fig. 1k), respectively. Similarly, both the ζ sets (AζBγCζ: 28.4 ± 1.8 h, ASeBSeCζ: 28.3 ± 0.1 h, Fig. 1l) and the δ sets (AδBδCδ: 36.8 ± 2.7 h, ASeBSeCδ: 22.2 ± 0.4 h, Fig. 1m) showed rhythmic oscillation. Neither the ε sets nor the η sets (AγBεCε, ASeBSeCε, AγBηCη, nor ASeBSeCη) exhibited any rhythmicity (Fig. 1n, o).

These results indicate that the oldest KaiCα (62% identity to KaiCSe, Fig. 2a) and the subsequent KaiCβ (74% identity) lacked oscillatory ability and that this ability was acquired during the transition from KaiCβ to KaiCγ (89% identity). Consistent with this interpretation, the ATPase activities of KaiCα and KaiCβ were temperature-dependent (Q10: 2.1–2.2), whereas those of KaiCγ and KaiCδ were temperature-compensated (Q10: 0.9–1.2) (Fig. 2b and Supplementary Table 1). Pre–steady-state analysis of the ATPase activity supported this interpretation: KaiCSe exhibits damped oscillation of ATPase activity even in the absence KaiASe and KaiBSe29, and similar characteristics were observed in KaiCγ, KaiCδ, and KaiCζ (Fig. 2c). In addition, an Se7942 reporter strain in which kaiCSe was replaced by kaiCγ showed a bioluminescent rhythm under constant-light illumination conditions at 30 °C (Fig. 2d, e), and its period length (32.9 ± 0.7 h) was consistent with that of the in vitro AγBγCγ set at pH 7.6–7.8 and 30 °C (Fig. 2f). The acquisition of oscillatory abilities that occurred in the transformation from KaiCβ to KaiCγ is not a wild hypothesis confined to the test tube but a possible evolutionary event that could have occurred in primitive cyanobacteria.

We found that the period length of the AγBγCγ set was sensitive to ambient pH. The cycle length, which was 39 h at pH 8.0 and 30 °C, became shorter under acidic conditions, finally stabilizing at 23.4 ± 1.5 h at pH 7.0 and 30 °C (Fig. 2f). The ASeBSeCSe and AδBδCδ sets showed a similar tendency, although they were less pH-dependent than the AγBγCγ set. The results for the ASeBSeCγ set (Figs. 1k and 2f) suggest that the pronounced pH sensitivity of the period length is derived from KaiCγ itself. Taking into account that the intracellular pH of cyanobacteria changes in response to the external pH39, that the pH of ancient seawater was ~738, and that the period length is insensitive to ionic strength (Supplementary Fig. 7), the period length at pH 7.0 and 30 °C is reasonably concluded to be a better estimate for ancestral types than that at pH 8.0 and 30 °C. In fact, the AγBγCγ set was entrained by external temperature cycles of 18 to 24 h at pH 7 (Fig. 3a). It worth noting that successful entrainment by the short 18-h cycle was confirmed only for the γ set at pH 7 (Fig. 3a), but not for the ASeBSeCSe set at pH 8 (Fig. 3b)40.

Fig. 3: In vitro temperature entrainment of the Kai-protein oscillators.
figure 3

a Fraction of phosphorylated KaiC (P-KaiC) of the AγBγCγ set at pH 7.0. b P-KaiC of the ASeBSeCSe set at pH 8.0. Each set was reconstituted at 0 h (red circles), 6 h (blue circles), and 12 h (gray circles) intervals and incubated at 30 °C for 36, 30, and 24 h, respectively. After incubation, the pre-incubated samples were exposed to temperature cycles (red lines) of 9 h at 40 °C and 9 h at 30 °C (9H9L: top), 10 h at 40 °C and 10 h at 30 °C (10H10L: middle), or 12 h at 40 °C and 12 h at 30 °C (12H12L: bottom).

Structural evolution of ancestral KaiC

To explore the structural origin of the autonomous oscillatory properties acquired in the transition from β to γ, we solved the X-ray crystal structures for KaiCβ, KaiCγ, and KaiCδ (Supplementary Table 2). The overall structures of KaiCγ and KaiCδ were similar to that of KaiCSe (Fig. 4a), and their main chains could be superimposed with a root-mean-squared deviation (RMSD) of 1.1 Å. The similarity between KaiCγ and KaiCSe was confirmed by the superimposition of other parts using one particular CII domain (star in Fig. 4b). By contrast, the structure of KaiCβ differed markedly from that of KaiCγ (main-chain RMSD of 2.0 Å). When KaiCβ and KaiCγ were superimposed using one of the CII domains (star in Fig. 4c), the other CI and CII domains did not overlap (main-chain RMSD of 5.3 Å). These findings suggest that ancestral KaiCs acquired the basic architecture for establishing the autonomous oscillator found in KaiCSe during the transition from KaiCβ to KaiCγ.

Fig. 4: Crystal structures of ancestral KaiCs.
figure 4

a Similarity of the overall structures of KaiCγ (pink), KaiCδ (blue), and KaiCSe (gray, PDB ID: 7DXQ). Black thick arrows indicate the viewing directions for each panel. b Superimposition of KaiCγ (pink) and KaiCSe (gray) using one of the CII domains (magenta star). The main-chain root-mean-squared deviation (RMSD) values of the fitted and other parts were 0.7 and 1.2 Å, respectively. For clarity, only the two most distantly located subunits within the hexamer are shown. c Superimposition of KaiCγ (pink) and KaiCβ (yellow) using one of the CII domains (magenta star). Main-chain RMSD values of the fitted and other parts were 0.7 and 5.3 Å, respectively. d Zoomed-in views of the pre-hydrolysis CI–CI interfaces (left) for KaiCγ (pink/orange), KaiCδ (cyan/blue), and KaiCSe (white/gray), and the post-hydrolysis CI–CI interface (right) for KaiCβ (yellow/brown). Dotted magenta lines represent hydrogen bonds. e Zoomed-in views of the nucleotide enclosure loop for KaiCγ (top) and KaiCβ (bottom). In bottom panel, KaiCγ was superimposed on KaiCβ using one CI protomer, and then only the AMP–PNP molecule (green stick model) bound to KaiCγ is shown for comparison with the ADP molecule (magenta stick model) of KaiCβ. f Exchange with AMP–PNP was barely detected in KaiCβ-bound ADP, in contrast to KaiCSe- and KaiCγ-bound ADP. Ancestral KaiCs and KaiCSe were incubated at pH 7.0 and 8.0, respectively, at 30 °C in the presence of 1 mM AMP–PNP; the fractions of nucleotides bound by KaiCs were quantified. Bar graphs show mean ± s.d. of three replicates from a single purification. g CII domains of KaiCγ (left) and KaiCβ (right) viewed from the inner-diameter side of the hexamers. A-loop and PSw correspond to the C-terminal tail and the phosphor-switch, respectively, in KaiC. The magenta meshes represent the FobsFcalc omit maps contoured at 3σ. Dotted red lines represent potentially flexible parts that could not be determined because of poor electron density.

Structural differences between KaiCβ and KaiCγ were found in both the CI and CII domains. Three major differences were observed in the CI domain with the ATPase activity responsible for period-length determination29,30. The first is the type of bound nucleotide. KaiCγ and KaiCδ were crystallized in the pre-hydrolysis state with a non-hydrolyzable ATP analog (AMP-PNP) bound to the CI domain, as described for KaiCSe29,41 (left in Fig. 4d and Supplementary Fig. 8a); by contrast, KaiCβ was crystallized in the post-hydrolysis state with an adenosine diphosphate (ADP) bound to its CI active site even in the presence of 5 mM AMP–PNP (right in Fig. 4d and Supplementary Fig. 8b).

The second difference is the interface between two adjacent CI domains. In the pre-hydrolysis states of KaiCγ, KaiCδ, and KaiCSe (left in Fig. 4d and Supplementary Fig. 8c), each CI–CI interface is stabilized by the hydrogen bond between the Ser residue side chain (S46γ, S44δ, and S48Se: Fig. 2a) in one CI domain and the carboxyl oxygen atom of the Glu residue (E196γ, E194δ, and E198Se) in the adjacent domain. However, in KaiCβ, the corresponding Ser residue is replaced by Pro (P42β: Fig. 2a) and the packing of the CI–CI interface becomes loose because of the cleavage of the inter-domain hydrogen bond (right in Fig. 4d), as observed for the post-hydrolysis CI ring of KaiCSe27,29. In addition, the replacement of the amino acid corresponding to S199 in KaiCγ (S199γ) by A195 in KaiCβ (A195β) leads to further selective stabilization of the post-hydrolysis CI conformation of KaiCβ. In the pre-hydrolysis state of KaiCγ (left in Fig. 4d), the α8 helix is stabilized by the hydrogen bond between S199γ and E195γ (between S197δ and E193δ in KaiCδ, and between S201Se and E197Se in KaiCSe). In KaiCβ, however, cleavage of this hydrogen bond by A195β enables the partial unfolding of the α8 helix (right in Fig. 4d), as confirmed by the loose CI–CI interface of the post-hydrolysis CI of KaiCSe27,29. These observations support the preferential binding of ADP to the loose CI–CI interface of KaiCβ.

The third difference is the insertion of a residue into the loop enclosing the nucleotide bound in the active site of the CI domain. Compared with KaiCSe and KaiCγ (top in Fig. 4e), KaiCβ had a proline residue inserted at the 235th position (P235β: Fig. 2a) and its structure showed that the residue was located in the turn of the nucleotide enclosure loop (bottom of Fig. 4e). The distorted structure of the ADP bound nearby suggests that, in KaiCβ, P235β pushes its adenine ring toward the center of the CI ring and inhibits the release of bound ADP. In rhythmic KaiCs such as KaiCγ, KaiCδ, and KaiCSe, no such inhibitory structures for releasing nucleotides exist. Consistent with this structural observation, biochemical experiments showed that ADP bound to KaiCβ was highly stabilized and, in contrast to ADP bound to KaiCSe and KaiCγ, was not easily exchanged by AMP–PNP (Fig. 4f).

These observations suggest that the loose CI–CI interface results in the preferential binding of ADP (right in Fig. 4d) and that the elongation of the nucleotide enclosure loop (bottom in Fig. 4e) further prevents the exchange of generated or bound ADP (Fig. 4f). These factors might have caused the consistent occupation of the CI site by ADP in KaiCβ, resulting in a reduction of the ATPase activity (1.8 ± 0.2 ATP d−1 at 30 °C, Fig. 2b) and the loss of the damped oscillatory relaxation in KaiCβ (Fig. 2c). KaiC might have acquired its oscillatory ability by evolving its CI structure to enable nucleotides bound to the CI domain to be exchanged on a reasonable circadian time scale.

The structural differences between KaiCβ and KaiCγ were not only observed in the CI–CI interface but also in the CII domain and the CI–CII interface. In rhythmic KaiCγ, a region upstream of the C-terminal tail to which KaiAγ binds42 was clearly resolved as a loop structure (A-loop) with a stabilizing hydrogen bond between Q439γ and I487γ (left in Fig. 4g), as observed between Q441Se and I489Se in KaiCSe41,43. In KaiCβ, however, the A-loop was destabilized by R436β, which breaks this hydrogen bond (right in Fig. 4g). The electron density of the phosphor-switch, which changes conformation upon phosphoryl modification of the dual phosphorylation site (S431Se and T432Se) in rhythmic KaiCSe41, was clearly identified in KaiCγ but not in KaiCβ (right in Fig. 4g). In addition, the 389th residue located in the CI–CII interface is Glu in KaiCβ (E389β), whereas the corresponding residue is Gln in the rhythmic KaiCγ (Q392γ) and KaiCSe (Q394Se). In KaiCSe, Q394Se is critical for establishing the rhythmic coupling between the CI and CII domains by switching the interaction with E214Se and/or R217Se (Fig. 2a and left in Fig. 4g), as evidenced by the arrhythmicity of the Q394E variant of KaiCSe41.

Discussion

Conservation of autonomous Kai-protein oscillators in group I

Since the successful in vitro reconstruction of the ASeBSeCSe oscillator in 200522, the Se7942 strain has become the major target of research on the circadian clock system in cyanobacteria20,21. So far, there have been no reports of comprehensive feasibility study of establishing in vitro oscillators for bacteria other than the Se7942 strain. For this reason, it was possible to argue that many cyanobacteria actually require transcription-translation feedback44 to maintain their rhythm, and that Se7942 is an exceptional case. The present study clearly demonstrates that this is not the case, and that the self-sustained Kai-protein oscillators are conserved in approximately 80% of group I cyanobacteria (Fig. 1a).

At the same time, careful interpretation may be needed for the other entries of group I, which cannot be tested due to unavailability of the Kai-protein samples or failed to show the self-sustained circadian oscillation in vitro (Fig. 1 and Supplementary Fig. 3ac). For example, it is known that Nostoc (Anabaena) sp. PCC 7120 exhibits in vivo bioluminescence rhythm45,46, but in this study, we could not conduct the in vitro test because its species-specific KaiCNo (ID: 024 in Fig. 1a) was not expressed even by 12 different trials using different expression hosts, fusion tags, and culturing conditions47. Since KaiC from Anabaena sp. 90 (KaiCAn, ID: 035 in Fig. 1a) was rhythmic in vitro in the presence of KaiAAn and KaiBAn (Supplementary Fig. 3b), it is possible that the ANoBNoCNo set will also be judged as rhythmic if KaiCNo can be prepared. As another example, KaiC from Fischerella sp. NIES 3754 (KaiCFi, ID: 060 in Fig. 1a) showed circadian rhythm in the presence of KaiASe and KaiBSe, but was determined to be arrhythmic in the presence of KaiAFi and KaiBFi (Supplementary Fig. 3c). One possible explanation for these examples is that a part of the A*B*C* sets in group I becomes stabilized and self-sustained only in special intracellular circumstances that cannot always be fully reproduced in vitro. If we were allowed to take such an extended view of self-sustained protein-based oscillation, the proportion of group I entries with the autonomous Kai-protein oscillators would be even higher than 80%. In this context, although this study does not deny the self-sustainment in the extended sense for the α and β sets in ancient bacteria, but it is clear from our experiments that they are not self-sustained in vitro in the strict sense.

γ node as the time when self-sustained Kai protein oscillators were established

It is important to verify obtained results from the perspective of uncertainties associated with the sequence estimation. The uncertainty in sequence prediction may come from the arbitrary nature of how much gap is allowed when aligning sequences of existing species. We thus performed the same in vitro rhythm assay on another set of ancestral sequences, which was estimated from the alignment using different gap-opening penalty (Supplementary Fig. 9a–c). While both the α and β nodes were arrhythmic, the γ nodes were found to be rhythmic regardless of the way of the sequence prediction (Supplementary Fig. 9d–f). Furthermore, the ancestral lineage of the autonomous oscillations originating from the γ node provides a reasonable explanation for the distribution pattern of the Kai-protein autonomous oscillators in the extant species (groups I–IV), excluding entries derived from secondary copies or lateral transfer (gray entries in Fig. 1a). These evidences suggest the limited effects of the arbitrariness of the sequence estimation on the present view that the self-sustained Kai-protein oscillator appeared at the γ node.

On the other hand, the effects of the uncertainty on the estimation of the cycle length may not be negligible for the following three reasons. First, the length of the cycle is a characteristic of the system composed of three types of ancestral Kai proteins, so the uncertainty is at least potentially additive. Second, the period length of the ASeBSeCSe set can be tuned in the range of 60% to 660% even by one amino acid substitution in a very critical region of KaiCSe48. Third, the cycle length of the AγBγCγ set was more pH-sensitive than that of the ASeBSeCSe set (Fig. 2f). Taken together, it would be reasonable to conclude that there is uncertainty between the period length of the reconstructed ancestral systems and the true period length of ancient bacteria.

A possible scenario for the evolution of the Kai protein oscillator

The 16S rRNA-based phylogenic tree (Supplementary Fig. 2) shows the presence of a node (γ point) diverging into freshwater and marine cyanobacteria, which can be interpreted as corresponding to the MRCA of cyanobacteria. KaiCγ is the topologically identical node that branches into the extant KaiCs of freshwater cyanobacteria (group I) and marine cyanobacteria (group II) (Fig. 1a and Supplementary Fig. 1), and the same applies to KaiBγ (Supplementary Fig. 5) and KaiAγ (Supplementary Fig. 6). These observations clearly indicate that the AγBγCγ set functioned as the self-sustained circadian oscillator in the MRCA of cyanobacteria capable of oxygenic photosynthesis.

There are many estimates as to when the cyanobacterial MRCA first appeared2, which range from 3.5 Ga to 1.5 Ga ago1,3,4,5,6. Our estimate obtained using 16S rRNA sequences dated an ancestral bacterium at an α/β point at 3.1 (3.5–2.8: upper–lower confidence intervals) Ga ago and the MRCA of cyanobacteria at the γ point at 2.2 (2.8–1.7) Ga ago (Fig. 5 and Supplementary Table 3). An ancestral form of the double-domain KaiA, which is evolutionarily related to KaiAγ, might have appeared approximately 1.0 Ga ago32; however, its appearance was subsequently updated to an earlier period, as late as 1.5 Ga ago37.

Fig. 5: Evolutionary origins and diversification of self-sustained Kai protein circadian oscillators.
figure 5

Divergence times were estimated on the basis of TimeTree53 analysis using 16S rRNA sequences. Filled circles and horizontal bars indicate estimated divergence times (1 Ga = 1 billion years) and confidence intervals, respectively (Supplementary Table 3). The approximate times of the emergence of the most recent common ancestor (MRCA) of cyanobacteria1,2,3,4,5,6, Great Oxidation Event (GOE)7, first and second Snowball Earth events (SEEs)8,9,10, the post-GOE period known as the “boring billion years” of ever-stable global conditions64, and the Neoproterozoic Oxygenation Event (NOE)11,12 are schematically shown for reference.

Under our divergence time estimate, the MRCA of cyanobacteria would have emerged around the GOE and first SEE period (Fig. 5). This period is characterized by the enhancement of oxygenic photosynthesis due to changes in the properties of ancient cyanobacteria1,2,3 or other ancient bacteria4,5,6, which destroyed the methane greenhouse and triggered global glaciation49. The resonance between the endogenous clock and environmental light–dark cycles is the basis for more efficient growth and survival of organisms, including cyanobacteria50. Thus, it is reasonable to speculate that the acquisition of the autonomous γ set allowed the MRCA of cyanobacteria to expect the timing of the next sunrise and prepare in advance to initiate oxygenic photosynthesis accordingly. The Earth’s rotation period 2 Ga ago has been suggested to be approximately 4 h shorter than that at present14; however, depending on the pH of the growing environment, the γ set in the cyanobacterial MRCA might have been sufficiently functional as a circadian oscillator (Figs. 1k and 3).

Regardless of the accuracy of the divergence time estimation, another ancestral bacterium corresponding to the ε/η point appeared at approximately the same time as the MRCA of cyanobacteria (Fig. 5). The ε/η point is assumed to possess not–self-sustained (Fig. 1o) and temperature-dependent (Fig. 2b) KaiCη, an ancestral form of the hourglass-like KaiCRs34 and not–self-sustained KaiCRp35. The self-sustained protein-based circadian oscillator established in the γ set was subsequently inherited by KaiCζ and KaiCSe and, consequently, by approximately 70% of group I cyanobacteria in freshwater (Fig. 1a and Supplementary Fig. 1). The inheritance from KaiCγ applied not only to KaiCζ but also to KaiCδ, and the autonomous oscillatory property was eventually passed on to marine cyanobacteria located downstream of KaiCδ. However, our results indicate that it was not inherited by the Prochlorococcus group located downstream of KaiCδ or that the inherited traits have changed by today. Given that the molecular evolution of oxygenic photosynthesis pre-dates the cyanobacterial MRCA by >1 Ga2,4, the unique evolution of the ancestral Kai proteins during the emergence of the cyanobacterial MRCA provided the autotrophic basis for the time-optimal acquisition and consumption of energy, later leading to their massive proliferation across various habitats on Earth.

Methods

Construction of protein-based phylogenic trees

The amino acid sequences of KaiC homologs were obtained from a dataset described previously33. From a total of 1194 entries, duplicate entries that had identical sequences but were from different species/strains were removed, resulting in 151 entries of 480–570 amino acid residues with >30% sequence identity to KaiCSe. These were extracted to form a dataset for analyzing KaiC homologs with double-domain structures (Supplementary Fig. 1).

The MEGA X software (version 10.1.7)51 was used for phylogenic analyses. The 151 sequences were arranged using pairwise/multiple alignments with a gap-opening penalty of 7.5. A preliminary phylogenetic tree was constructed using the maximum likelihood method, and 100 bootstrap replications were generated. To confirm the stability of the resultant phylogenetic pattern, different alignment results were analyzed using gap-opening penalties of 2.5, 5.0, 10.0, 15.0, and 20.0. In the preliminary phylogenetic trees, double-domain homologs of KaiC were classified into four groups as follows: cyanobacteria in freshwater (group I: 58 entries), cyanobacteria in seawater (group II: 26 entries), other bacteria (group III: 23 entries), other bacteria including archaea (group IV: 42 entries), and outgroup (2 entries). The final phylogenetic tree of double-domain KaiC homologs (Fig. 1a) was obtained using a gap-opening penalty of 7.5 through 500 bootstrap replications. A detailed description of the entries in each group is provided in Supplementary Discussion and Supplementary Fig. 1.

Phylogenetic analyses of KaiA and KaiB homologs were performed as described for KaiC using a dataset from a previous study33. For KaiB, we extracted 193 entries of 80–120 amino acid residues with >30% sequence identity to KaiBSe (Supplementary Fig. 5). For KaiA, extraction was performed with 50–350 amino acid residues with >30% sequence identity to KaiASe, resulting in 151 entries of double-domain homologs of KaiA (Supplementary Fig. 6). The species used to construct the phylogenetic trees for KaiA and KaiB were not necessarily identical to those used for the KaiC tree. The final phylogenetic trees for KaiB (Supplementary Fig. 5) and KaiA (Supplementary Fig. 6) were obtained using a gap-opening penalty of 7.5 through 500 bootstrap replications. See Supplementary Information for detailed descriptions of the entries in each group.

Construction of 16S rRNA-based phylogenic trees

The 16S rRNA sequences of extant species with double-domain KaiC homologs were obtained from KEGG52. The 16S rRNA-based tree shown in Supplementary Fig. 2 was constructed using MEGA X software (version 10.1.7) with 108 entries of the 151 KaiC homologs used for construction of the protein-based tree (Fig. 1a).

Restoration of amino acid sequences of ancestral KaiCs

The amino acid sequences corresponding to seven nodes on the phylogenic tree of KaiC (Fig. 1a) were inferred on the basis of the maximum likelihood estimation using the amino acid substitution model of LG with Freqs. (+F) and gamma-distributed substitution rates. The oldest ancestor (KaiCα) was detected directly below a node branching into groups I–IV, Pyrococcus horikoshii OT3 KaiC (KaiCHori), and Pyrodictium delaneyi KaiC (KaiCDela).

There were two branches emerging from KaiCα: one ended in groups I–III, and the other ended in group IV. A node connecting to groups I–III (KaiCβ) showed high bootstrap values (98%–99%) irrespective of the gap-opening penalties (7.5–15). The KaiCβ node branched into groups I–II and group III. A node branching into groups I and II (KaiCγ) also emerged with high bootstrap values (99%) irrespective of the gap-opening penalties used. Although the common ancestor of group II (KaiCδ) showed a high bootstrap value (31%–97%), those of group I (KaiCζ) and group III (KaiCε) showed relatively lower values (20%–51% and 53%–63%, respectively). The common ancestor of group IV (KaiCη) was detected with the bootstrap vale (98%) as high as those of KaiCβ and KaiCγ.

Restoration of amino acid sequences of ancestral KaiAs and KaiBs

The amino acid sequences corresponding to the nodes on the phylogenic trees of KaiA and KaiB were estimated as described for ancestral KaiCs. In the KaiB tree constructed with 193 entries (Supplementary Fig. 5), we assigned one node for each, KaiBγ and KaiBδ, on the basis of similarities in the taxonomic patterns of the corresponding entries (see coloring by growth habitat in Supplementary Figs. 1 and 5). The KaiBδ node was located downstream of KaiBγ. Because a node upstream of KaiBγ was a common ancestor for all entries, it was termed KaiBα/β. A node that did not contain downstream archaeal entries was termed KaiBε.

Only three later branching points (γ, δ, and ζ) could be assigned in the double-domain KaiA tree consisting of 151 entries (Supplementary Fig. 6), which is consistent with the later appearance of ancestral KaiA than ancestral KaiB37. KaiAγ was reasonably allocated to a node corresponding to a common ancestor of the double-domain KaiA from both freshwater cyanobacteria and seawater cyanobacteria. Similarly, KaiAζ and KaiAδ were assigned to nodes corresponding to ancestors of cyanobacteria in freshwater and seawater, respectively.

Divergence time estimation

The divergence times of the α/β, γ, δ, ε/η and ζ points in the 16S rRNA-based tree (Supplementary Fig. 2) were estimated using MEGA X software (version 10.1.7)51. According to the TimeTree53 estimate of the divergence time between Synechococcus and Flavobacterium, a calibration point was established at the α/β point as normal distribution with a median at 3.134 Ga and a deviation of 0.5 Ga.

Cloning of extant and ancestral kaiA, kaiB, and kaiC

The DNA sequences of extant and ancestral Kai-proteins were synthesized by Eurofins genomics with codon optimization for expression in Escherichia coli (E. coli). Unless otherwise stated, the synthesized genes were cloned into the pGEX6P-1 vector (Cytiva) with BamHI and NotI cleaving sites to overexpress GST-tagged Kai proteins. For ancestral KaiCα, KaiCβ, KaiCε, and KaiCη, the synthesized genes were cloned into the pET-3a vector (Novagen) with NdeI and BamHI cleaving sites using nucleotides encoding hexagonal histidine (His) at the 3′ end. For extant KaiCs from Flavobacterium johnsoniae UW101 and Rhodopseudomonas palustris TIE-1 strains, genes were cloned into the pASK-IBA5plus vector (IBA GmbH) with the BsaI cleaving site to overexpress Strep-tagged KaiCs because of the poor yields associated with the GST-tagged form.

Expression and purification of extant and ancestral Kai-proteins

Extant and ancestral KaiCs were expressed in E. coli BL21 cells and then purified using affinity chromatography, ion exchange chromatography, and size exclusion chromatography47 according to the type of fusion tags used. Irrespective of the extant or ancestral types, KaiA and KaiB were purified following the methods used for KaiASe and KaiBSe, respectively54.

In vitro rhythm assay

For the in vitro rhythm assay of the extant species, KaiA, KaiB, and KaiC were mixed in Tris-buffer (20 mM Tris, 150 mM NaCl, 5 mM MgCl2, 1 mM DTT, and 0.5 mM EDTA, pH 8.0) containing 1 mM ATP, and the time-evolution of the phosphorylation of KaiC was monitored. The potential oscillation of the respective KaiCs was assessed in the presence of various concentrations of KaiA (0.01, 0.02, 0.04, and 0.08 mg/mL) and a fixed concentration of KaiB (0.04 mg/mL) according to previous observations that the amount of KaiA in the system markedly affects the amplitude or stability of the rhythm31,48,55. KaiCs that showed any slight sign of oscillation under either condition exhibited a stable rhythm in the presence of moderate concentrations of KaiA (0.02–0.04 mg/mL) (Supplementary Fig. 3). For the assay of KaiCSy3 from Synechocystis sp. PCC 6803 (ID: 093 in Fig. 1a), a chimeric KaiA homolog (protein ID: WP_010873803.1, ~22% identity to KaiASe) that is not listed in the present KaiA tree but has recently been identified was used together with KaiBSy3 (ID: 179 in Supplementary Fig. 5d)36. Similarly, for the assay of KaiCCy2 from Cyanothece sp. ATCC 51142 (ID: 085 in Fig. 1a), a KaiA homolog (protein ID: WP_009543244.1, ~17% identity to KaiASe) and KaiBCy2 (ID: 086 in Supplementary Fig. 5d) were used36.

For the in vitro rhythm assay of the ancestral forms, KaiA, KaiB, and KaiC were mixed in Tris-buffer (20 mM Tris, 150 mM NaCl, 5 mM MgCl2, 1 mM DTT, and 0.5 mM EDTA, pH 7.0) containing 1 mM ATP at final Kai-protein concentrations listed in Table 1.

Table 1 Kai-protein concentrations for in vitro rhythm assays

ATPase measurements

ATPase measurements were performed by monitoring the accumulation of ADP47 using an ultrahigh-performance liquid chromatography (UHPLC) system (Chromaster, Hitachi). Ancestral KaiCs and KaiCSe were incubated in a Tris-buffer containing 1 mM ATP at pH 7.0 and pH 8.0, respectively. The ATPase activity of KaiC was defined as the number of hydrolyzed ATP molecules into ADP molecules per KaiC monomer per unit time.

Pre-steady-state ATPase assay

For the pre-steady-state ATPase assay29, an apo-KaiC monomer was first prepared using a phospho-buffer (50 mM NaH2PO4, 150 mM NaCl, 50 mM l-arginine, 50 mM l-glutamic acid, 5 mM MgCl2, and 1 mM DTT, pH 7.8)56. Reassembly of KaiC hexamers was induced by mixing the apo-KaiC monomer with the phospho-buffer containing ATP, and the resultant mixture containing 1 mM ATP was immediately subjected to UHPLC analysis for ATPase measurements.

Entrainment by external temperature cycles

Kai-protein oscillators were exposed to temperature cycles40 of one half of the period length at 40 °C and the other half at 30 °C using an automated sampling device57. For each of the AγBγCγ (pH 7.0) and ASeBSeCSe (pH 8.0) sets, the Kai-protein oscillators were reconstituted at 0, 6, and 12 h intervals and incubated at 30 °C for 36, 30, and 24 h, respectively. The pre-incubated samples were subjected to five temperature cycles of 18 h (9 h at 40 °C and 9 h at 30 °C: 9H9L), 20 h (10 h at 40 °C and 10 h at 30 °C: 10H10L), or 24 h (12 h at 40 °C and 12 h at 30 °C: 12H12L) to evaluate the entrainability.

pH and Q 10 values

The pH of the buffers listed in the main text and in Supplementary Information was measured at 25 °C during buffer preparation. The pKa value of Tris is slightly temperature-dependent58. We confirmed that Tris buffers with pH 7.0 when prepared at 25 °C change to pH 6.8 at 30 °C and pH 6.6 at 40 °C, and Tris buffers with pH 8.0 at 25 °C change to pH 7.8 at 30 °C and pH 7.6 at 40 °C, as actual measured values at the corresponding temperatures. Therefore, Q10 values calculated from data obtained at 30 °C and 40 °C using the corresponding Tris buffers could include additional pH-dependent effects, if any, of approximately –0.2 pH units.

In vivo rhythm assays

Synechococcus elongatus PCC 7942 was used as the background strain for this assay. To conduct the in vivo bioluminescence assay and analysis16, cyanobacterial cells carrying the kaiBC-reporter cassette were cultured at 30 °C for 3 days on BG-11 solid medium under constant light (LL) at 100 μmol m−2 s−1. After one or three 17-h light/17-h dark treatments, the cells were cultured at 30 °C under LL (45 μmol m−2 s−1) at an LED daylight lamp, and bioluminescence profiles were monitored using a photomultiplier tube detector.

X-ray structure analyses of KaiCβ, KaiCγ, and KaiCδ

Crystals of KaiCβ, KaiCγ, and KaiCδ were obtained using the sitting-drop vapor diffusion method with 5 mM AMP-PNP. KaiCβ (3 mg/mL) was mixed with a solution of 18 mM acetic acid and 65 mM sodium malonate (pH 5.7). The cryo-protectant treatment was performed using 30% (w/v) PEG4K. KaiCγ (3 mg/mL) was crystallized in a solution of 0.94 M sodium formate and 1.2 M sodium malonate (pH 8.5), which are both cryo-protectants. KaiCδ (3 mg/mL) was added to 0.1 M Tris-HCl (pH 7.0), 1 M KCl, and 1.14 M sodium malonate.

KaiCβ, KaiCγ, and KaiCδ crystals were frozen using liquid nitrogen and placed under the cryo-stream of the SPring-8 BL44XU beamline (Harima, Japan). The crystals were exposed to X-ray irradiation at a wavelength of 0.9 Å, and diffraction images were collected by using an EIGER X 16 M detector (Dectris) and the semi-automatic data reduction software, KAMO59. Diffraction intensity data were processed using XDS (version Jun 30, 2023)60. The initial models for KaiCβ, KaiCγ, and KaiCδ were obtained using the molecular replacement method with MOLREP (version 11.9.02)61 and the coordinates of the KaiCSe hexamer (PDB ID: 2GBL) as a template. Refinement was achieved using REFMAC5 (version 5.8.0415)62. The model was constructed by using COOT (version 0.9.8.7)63. PyMOL (Schrödinger) (version 2.5.2) was used for the graphic presentation of the refined models.

The KaiCβ, KaiCγ, and KaiCδ crystals had lattices of P21 (a = 93 Å, b = 386 Å, c = 108 Å, β = 113°; two hexamers in an asymmetric unit), P1 (a = 91 Å, b = 110 Å, c = 167 Å, α = 78°, β = 87°, γ = 82°; single hexamer in an asymmetric unit), and P21 (a = 92 Å, b = 175 Å, c = 95 Å, β = 97°; single hexamer in an asymmetric unit), respectively.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.