Introduction

Synovial joints are complex organs consisting of articular cartilage that minimizes joint friction, synovium that secretes lubricating fluid, and ligaments that stabilize the articulating joint. Injury or lifelong wear of joint tissues can lead to osteoarthritis, a degenerative joint disease that is a leading cause of disability1. Damaged articular cartilage and ligaments exhibit poor regenerative capacity in mammals2, and regeneration of functional hyaline cartilage with lubricating properties has remained an elusive target. Current disease-modifying strategies, such as surgical joint replacement and autologous connective tissue grafting, have limitations such as prosthetic joint loosening, high donor site morbidity, graft failure, and repair with fibrous tissue that has inferior biomechanical properties3,4,5. There is an urgent clinical need to develop more effective treatments for joint injury; however, the fundamental mechanisms driving ligament and articular cartilage cell fate specification and integration into functional tissues are still largely unknown.

Success in activating regenerative programs in mammalian joints has been largely limited to neonates6,7, and even the highly regenerative axolotl fails to repair a critical size joint defect8. However, the highly regenerative zebrafish possesses unique regenerative properties distinct from the mammalian fibrotic response, and the study of these differences has led to new tools for improving regenerative outcomes in mammalian cardiac injury models9. Zebrafish have over 70% genetic conservation with humans, including important similarities in the composition of their synovial joints, such as the presence of synovium and multilayered articular cartilage that secrete lubricating proteins10,11.

The pathways governing the development of human and zebrafish synovial joints are also highly conserved12. Joints develop from a pre-cartilaginous condensation of cells expressing sox9a, a transcription factor required for chondrogenesis13. Within the chondrogenic condensation, multiple signaling pathways synergize to inhibit chondrocyte maturation and form an interzone region which later cavitates and differentiates into mature joint tissues such as articular cartilage, synovium, and ligaments. The BMP ligand Gdf5 is highly but transiently expressed within the interzone14. Other genes that modulate zebrafish synovial joint development include trps1, a zinc finger transcriptional repressor that regulates chondrocyte development15, and nkx3-2, a transcription factor which specifies the jaw joint14 (reviewed in Smeeton et al., 201712). Outside of the interzone, cells within the remaining chondrogenic condensation mature into pre-hypertrophic chondrocytes (col2a1 + ;matn1 + ) and subsequently hypertrophic chondrocytes (col10a1a + ), which are eventually replaced by bone.

While adult zebrafish retain high capacity to regenerate individual skeletal tissues16,17,18,19, currently, there are no published adult models for the regeneration of a complex synovial joint with lubricating articular cartilage. Existing surgical injury models for articular cartilage regeneration in adult zebrafish only indirectly damage the cartilage, a major limitation as less than half of injured fish undergo subsequent joint degeneration17. Thus, we devised a novel whole-joint injury model to probe the zebrafish’s ability to rebuild multiple skeletal cell and tissue types de novo and reintegrate them into a functional joint after catastrophic injury.

Here we establish adult zebrafish whole-joint resection injury as a new model for understanding the coordinated regeneration of an integrated synovial joint organ. This surgery completely removes all pre-existing joint tissues, including articular cartilage, synovium, all intra- and extra-synovial ligaments, and some subchondral bone. Through histological analysis and 3D fluorescence imaging, we find that proliferation of endogenous mesenchymal cells leads to functional synovial joint regeneration by 2 months post-injury. Strikingly, we observe that regenerated articular cartilage expresses prg4b, the hallmark of superficial hyaline articular cartilage11,20. From single-cell RNA sequencing (scRNAseq) analysis of the first 70 days after resection injury, we describe the timeline and transcriptional signatures of five phases of joint regeneration. Through pseudotime analysis and lineage tracing, we identify a multipotent neural crest-derived population as a cell source for regenerated joint tissues. Taken together, our findings establish adult zebrafish whole-joint resection as a new model for de novo regeneration of a functional synovial joint with lubricating articular cartilage. This reductionist model can be used to uncover cellular and genetic targets for enhancing the regenerative abilities of human joints.

Results

Adult joint resection injury and functional regeneration

To test whether zebrafish have the capacity to regenerate multiple synovial joint cell types after removal of pre-existing mature joint tissues, we devised a new joint resection injury to the jaw joint. The resection surgery consists of three cuts to the anguloarticular, interopercular, and quadrate bones (Fig. 1a). Gross imaging immediately after surgery shows complete removal of the jaw joint, including the articular cartilage, synovial membrane, and interopercular-mandibular (IOM) ligament, from one side of the zebrafish’s jaw (Fig. 1b). Morphometric analysis at 0 days post-joint resection (dpjr) shows an initial surgical resection length of 1 mm generated by three independent surgeons, demonstrating the consistency of the surgery (Fig. 1c). Repeated gross measurements of individual fish until 56 dpjr showed no change in weight of resected vs uninjured (Supplementary Fig. 1a,b). Micro-computed tomography (microCT) visualization of the craniofacial skeleton confirmed that joint resection fully removes the articulation present in the uninjured jaw (Fig. 1d). By 28 dpjr, we observed bone growth in the form of protrusions from the cut sites (green arrowhead) and islands of bone (green asterisk). At 1 year post-joint resection (ypjr), the gap was largely bridged by new bone, and a structure resembling an articulating joint surface was present (green arrow). Histological analysis via Hematoxylin & Eosin staining (H&E) showed the regeneration of a joint structure capped with tissue resembling multilayered articular cartilage, in both morphology and location, by 56 dpjr (Fig. 1e). Both the uninjured and regenerated joints contained cavities lined with articular chondrocytes, including cells with the rounded morphology of deeper articular chondrocytes (black arrowhead) and cells with the flatter morphology of superficial articular chondrocytes (black arrow). Alcian Blue staining showed that the regenerated joint contained a robust articular cartilage matrix (Fig. 1f).

Fig. 1: Zebrafish whole-joint resection injury and regeneration.
figure 1

a, Schematic of the articulating bones at the zebrafish jaw joint that are cut to surgically remove all jaw joint tissues including the articular cartilage (blue) and the interopercular-mandibular (IOM) ligament (yellow). b, Reflected light images of the zebrafish jaw in ventral view show the zebrafish jaw joint (blue dashed line) and IOM ligament (yellow dashed line) completely removed after joint resection (cut sites indicated by white dashed line). c, Quantification of length of bone removed by 3 independent surgeons in millimeters (n = 4 per surgeon), one-way ANOVA, error bar represents SEM. d, Micro-computed tomography (microCT)-based skeletal reconstructions of the lateral view of the uninjured zebrafish jaw joint (n = 2) as well as the bones in the injury site at 0 dpjr (n = 13), 28 dpjr (n = 4), and 1 ypjr (n = 7). Blue insets show the articulating bones of the uninjured jaw joint, bone growth at 28 dpjr (green arrow and asterisk), and a regenerated joint-like structure (green arrow) at 1 ypjr. Magenta dashed lines indicate cut sites. Representative images are from different samples. e, H&E staining of an uninjured joint (n = 5) and a regenerated joint in ventral sections at 56 dpjr (n = 9). Insets show deep (black arrowhead) and superficial (black arrow) articular chondrocyte layers. f, Alcian Blue and Nuclear Fast Red staining of an uninjured joint (n = 3) and a regenerated joint in sagittal sections at 56 dpjr (n = 4). g, Quantification of premaxillary function over the time course of joint regeneration. Premaxillary function in each fish was scored as normal, nonfunctional, or abnormal. h, Immunofluorescence staining for MF20 (green) and Acetylated Tubulin (magenta) in sagittal sections of uninjured joint tissue (n = 4) and at 1 dpjr (n = 3) and 56 dpjr (n = 4). Blue dashed lines denote cut sites. White arrowhead indicates innervation of muscle. Scale bars, 200 µm (b), 2.5 mm (d), 200 µm (d insets), 100 µm (e), 25 µm (e insets), 50 µm (f), 300 µm (h). AA, anguloarticular bone; IOP, interopercular bone; Q, quadrate bone; RA, retroarticular bone. Source data are provided as a Source Data file.

Next, we sought to determine whether joint function was restored after resection. To do so, we quantified footage of jaw activity during swimming and feeding. Slow-motion analysis of jaw movements in individual live fish did not show any major disruption in lower jaw rotational movement following injury (n = 21), likely due to the intact jaw joint on the contralateral side driving lower jaw movement. However, qualitative scoring of jaw movement recordings showed that immediately post-surgery and out to 21 dpjr, premaxillary protrusion was disrupted (abnormal or nonfunctional) (n = 17/17) (Supplementary Fig. 1c). We observed restored craniofacial coordination by 56 dpjr in the form of normal premaxillary protrusion (n = 14/17) (Fig. 1g). Additionally, premaxilla retraction was disrupted following injury, resulting in larger intervals of time between consecutive suction feeding motions (n = 6; p = 0.0056), and was restored by 56 dpjr (Supplementary Fig. 1d).

To determine whether restored motion was active or passive, we characterized the recovery of muscles and nerves after resection. We performed immunofluorescence staining for sarcomeric myosin heavy chain (MF20) and Acetylated Tubulin on adjacent sections to visualize muscles and nerves, respectively (Fig. 1h). In uninjured fish, we observed muscle inserting into the quadrate bone and innervation of tissues surrounding the joint and ligament. At 1 dpjr, muscle and nerves were damaged and did not extend into the injured area. By 56 dpjr, muscle inserted into the new tissue and was densely innervated, as were other regenerated tissues. These data suggest that premaxillary motion recovered by 56 dpjr was likely physiologic. Overall, these observations point to the regeneration of a functional joint-like structure covered by multilayered articular cartilage following complete resection.

Epithelia and proliferating mesenchyme bridge the injury gap

After observing the regeneration of an articulating surface, we assessed the early injury response to whole-joint resection. Histological analysis showed that at 0 dpjr, the resected bone stubs were exposed (Fig. 2a). By 1 dpjr, the resection site was enclosed by epithelia directly covering the cut bones. At 3 dpjr, bands of mesenchymal cells were present under the cap of epithelial tissue at the anterior, posterior, and dorsal cut sites. By 7 dpjr, epithelial and mesenchymal tissues formed a bridge spanning the regenerating zone. Morphological hallmarks of cellular lineage commitment were visible by 14 dpjr, with chondrocytes and bony islands arising within the mesenchymal bridge (Fig. 2a’).

Fig. 2: Early epithelial and mesenchymal cell response to joint resection.
figure 2

a, H&E staining of early injury response in ventral sections at 0 dpjr (n = 5), 1 dpjr (n = 4), 3 dpjr (n = 4), 7 dpjr (n = 6), and 14 dpjr (n = 8). Black arrowhead denotes epithelium and black arrow denotes mesenchyme. a’, Inset shows regenerating chondrocytes (arrow) and bone (asterisk) at 14 dpjr. Black dashed lines indicate cut sites. b, Repeated live imaging of Sox10:Cre;actb2:loxP-BFP-STOP-loxP-DsRed showing dynamics of BFP+ epithelia at the anterior cut site at 1, 10, 20, 30, 40, and 50 minutes post-joint-resection (n = 3). White dashed lines indicate cut sites. White arrows indicate areas of epithelial migration. c, Representative images of PCNA immunofluorescence in sagittal sections of uninjured joint tissue and at 1 dpjr, 3 dpjr, and 7 dpjr. Region of interest for quantification is outlined in magenta dashed lines and resected bone stubs are outlined in solid blue lines. d, Quantification of percentage of PCNA-positive nuclei showing significant increase at 3 dpjr (n = 4 for Uninjured, n = 5 for 1 dpjr, 3 dpjr, and 7 dpjr), p = 0.0017 for Uninjured vs. 3 dpjr, p = 0.0024 for 1 dpjr vs. 3 dpjr, one-way ANOVA with Tukey’s multiple comparison test. e, Representative images of RNAscope smFISH for hmga1a (yellow) and tnfaip6 (magenta) in sagittal sections of uninjured joint tissue and at 3 dpjr. Resected bone stubs are outlined in solid green lines. Region of interest for quantification at 3 dpjr is outlined with light grey dashed line. f, Quantification of smFISH hmga1a and tnfaip6 transcript expression from uninjured and 3 dpjr (n = 3 per time point), p = 0.0386 for hmga1a and p = 0.0487 for tnfaip6, two-tailed unpaired t-test. g-j, Representative image of RNAscope smFISH for hmga1a (yellow), tnfaip6 (magenta), and mki67 (cyan) in sagittal sections of the anterior cut site at 1 dpjr (n = 3) (g), with insets illustrating expression in the periosteal region (h), stromal region (i), and bone-adjacent mesenchyme (j). d, f, Error bars represent SEM. Scale bars, 100 µm (a), 25 µm (a’), 250 µm (b), 300 µm (c, e, g), 20 µm (e inset, h-j). AA, anguloarticular bone; IOP, interopercular bone; Q, quadrate bone; RA, retroarticular bone. Source data are provided as a Source Data file.

Since we observed epithelia capping the resection site by 1 dpjr, we sought to characterize the dynamics of wound closure. We performed joint resections on Sox10:Cre;actb2:loxP-BFP-STOP-loxP-DsRed fish, in which blue fluorescence (BFP) is converted to red fluorescence (DsRed) in all neural crest lineage-derived cells21,22,23, allowing us to visualize epithelium in BFP. Live repeated imaging of intubated fish every 1-2 minutes showed that within an hour post-resection, the cut sites were enclosed by BFP+ epithelia (Fig. 2b, Supplementary Fig. 2a). Closure appeared to occur at least in part through the migration of epithelial cells, with epithelium continuing to migrate into the wound site after initial capping of the bone stub (Supplementary Movie 1).

To characterize the proliferative response of the injured and regenerating tissues, we performed immunofluorescence staining for Proliferating Cell Nuclear Antigen (PCNA). While we observed low levels of proliferation prior to injury or at 1 dpjr, there was a significant increase in PCNA+ cells by 3 dpjr (Fig. 2c,d). Proliferating cells were found surrounding the resected bone stubs (up to 650 µm from each resection site) and throughout the mesenchyme. By 7 dpjr, the proliferative response significantly decreased and was largely limited to epithelial cells and chondrocytes. Proliferating chondrocytes (PCNA + , Sox9a + ) were observed surrounding the resected bone stubs and in chondrocyte islands within the bridging mesenchyme (Supplementary Fig. 2b). Compared to articular chondrocytes in the uninjured joint, proliferation was significantly increased in regenerating chondrocytes at 7 dpjr (Supplementary Fig. 2c). Proliferation in the skin at 3 dpjr was mainly localized to injury-adjacent epithelium (IAE) surrounding the resected bone stubs, with few proliferating cells within the injury epithelium (IE) bridging the wound site (Supplementary Fig. 2d). By 7 dpjr, the IE had significantly higher PCNA+ cells compared to the 3 dpjr IE (Supplementary Fig. 2e), indicating that the cell cycle dynamics of the IE are distinct from that of the IAE. Overall, these data reveal that the identity of proliferating cell populations in the regenerating joint undergoes major shifts over time.

Next, we sought to further characterize the identity of proliferating mesenchymal cells. We performed single-molecule fluorescent in situ hybridization (smFISH) for hmga1a, which maintains adult stem cell pools and is required for zebrafish cardiac regeneration24,25, and tnfaip6, which labels a mesenchymal progenitor population in zebrafish fin regeneration26. While we found little to no expression in the uninjured joint, there was strong induction of hmga1a and tnfaip6 (Fig. 2e, f) within the highly proliferative mesenchymal domain (Fig. 2c,d) at 3 dpjr. At 1 dpjr (Fig. 2g), hmga1a + /tnfaip6+ cells were present within the periosteal (Fig. 2h), stromal (Fig. 2i), and bone-adjacent mesenchyme (Fig. 2j) compartments. These regions also had co-expression of mki67, which labels proliferating cells. These data suggest that proliferating hmga1a + /tnfaip6+ mesenchymal cells at 3 dpjr may arise from stromal and/or periosteal populations around the resected bone.

Mature joint tissue types regenerate following resection

To describe the timing and pattern of bone regrowth, we conducted repeated live imaging of the resection site using early osteoblast reporter line sp7:eGFP (Fig. 3a). Immediately after joint resection, there was a gap in GFP fluorescence consistent with the area of the resected bone. By 14 dpjr, we saw the re-emergence of GFP fluorescence in the form of both bony protrusions from the resected stubs and bony islands within the injury mesenchyme. The formation of bony islands independent of resected bony stumps indicate that whole-joint regeneration is a mosaic process anatomically. Bone regeneration occurred at least in part through endochondral ossification, evidenced by the presence of sox9a:GFP+ chondrocytes both in islands within the injury site and associated with the resected bone stubs (Supplementary Fig. 3a,b). Histological analyses at 14 dpjr showed mineralized tissue building on a cartilage template, further supporting the formation of new bone through endochondral ossification (Supplementary Fig. 3c,d).

Fig. 3: Regeneration of all mature joint tissues after joint resection.
figure 3

a, Repeated live imaging of the ventral view of sp7:eGFP expression before injury (n = 28), at 0 dpjr (n = 25), and at 14 dpjr (n = 19). Bony protrusions denoted by white arrowhead and bony islands indicated by white arrow. b-f, Confocal microscopy imaging of tissue-cleared thbs4a_p1:eGFP and scxa:mCherry expression in the uninjured joint (b) and uninjured ligament (c) (n = 4), at 0 dpjr (n = 3) (d), and in a regenerated joint (e) and regenerated ligament (f) at 56 dpjr (n = 7). Bone visualized with calcein blue staining. White asterisks mark autofluorescent blood vessels. g, Repeated live imaging of the ventral view of scxa:mCherry and sp7:eGFP expression before injury (n = 28), at 0 dpjr (n = 25), and at 57 dpjr (n = 8). h, prg4b (yellow) RNAscope smFISH in ventral sections of the uninjured jaw joint and at 8 dpjr, 14 dpjr, and 21 dpjr (n = 3 per time point). i, RNAscope smFISH for prg4b (yellow) and sox9a (magenta) in sagittal sections of uninjured articular cartilage and at 56 dpjr (n = 4 per timepoint). a,d,g, White dashed lines indicate cut sites. Scale bars, 1 mm (a,g), 200 µm (b-f), 100 µm (h), 20 µm (i). AA, anguloarticular bone; IOP, interopercular bone; Q, quadrate bone; RA, retroarticular bone.

We then sought to determine whether mature joint cell types regenerate via assessment of gene expression as well as cell and tissue morphology. Imaging uninjured jaw joints of thbs4a_p1:eGFP;scxa:mCherry transgenic fish with calcein-stained bones showed that articular chondrocytes lining the joint were thbs4a_p1:eGFP + ;scxa:mCherry- and show a rounded cell morphology (Fig. 3b), whereas ligamentocytes were thbs4a_p1:eGFP + ;scxa:mCherry+ with an elongated cell morphology (Fig. 3c). Whole-joint resection completely removed the articular cartilage and IOM ligament (Fig. 3d). At 56 dpjr, regenerated articular chondrocytes were thbs4a_p1:eGFP + ;scxa:mCherry- and lined a cavity between two regenerated bones (Fig. 3e). Regenerated ligamentocytes were thbs4a_p1:eGFP + ;scxa:mCherry + , had an elongated morphology, and bridged two regenerated bones (Fig. 3f). Live imaging of scxa:mCherry;sp7:eGFP double-transgenics confirmed new ligament tissue bundles connecting newly formed joint-associated jaw bones by 57 dpjr (Fig. 3g). Further imaging during earlier phases of regeneration indicated that thbs4a_p1:eGFP + ;scxa:mCherry+ ligaments and ligament entheses begin forming by 14 dpjr (Supplementary Fig. 3e). These data show that after removal of pre-existing mature joint tissues, zebrafish can re-establish ligamentocyte and articular chondrocyte cell fates.

To determine whether regenerated joints are true lubricated synovial joints, we performed smFISH for prg4b, which encodes for a lubricating glycoprotein secreted by articular chondrocytes and synoviocytes. Uninjured animals had robust prg4b expression in cells lining the joint cavity (Fig. 3h). At 8 dpjr, we observed minimal prg4b expression within the mesenchymal bridge. At 14 dpjr, prg4b was expressed by cells at the edges of a regenerating joint cavity. Interestingly, prg4b was robustly expressed along both surfaces of a new articulating joint structure by 21 dpjr. At 14 and 21 dpjr, at least one of the prg4b+ articular surfaces appeared to be lining a chondrocyte island that was not completely mineralized, suggesting that the regeneration of an articular cartilage surface and the ossification of the bones it covers may occur in parallel.

Analysis via microCT and subsequent combinatorial smFISH for prg4b and sox9a showed that similar to uninjured joints, articular cartilage lining the regenerated joint at 56 dpjr is multilayered, with sox9a + /prg4b+ superficial chondrocytes found at the surface of the joint and sox9a + /prg4b- chondrocytes located in the deeper zones (Fig. 3i, Supplementary Fig. 3f,g). Next, we performed smFISH for prg4b and sox9a on samples that scored abnormally on the premaxillary function test at 56 dpjr (Fig. 1g). Despite poor premaxillary function scoring relative to other fish, a sox9a + /prg4b+ joint cavity was observed in all tested samples (Supplementary Fig. 3h). This suggested that all injured fish exhibit some degree of anatomical regeneration and that cell fate restoration may precede functional recovery.

To further probe the dynamics of synovial cavity formation, we performed smFISH for prg4b in combination with sox9a at 14 dpjr (Supplementary Fig. 3i-i’). We observed that prg4b expression lined the interface of two sox9a+ chondrocyte islands, indicating that the synovial cavity forms between adjacent chondrocyte islands. The emergence of discontinuous sox9a:GFP+ chondrocyte islands during regeneration (Supplementary Fig. 3b) contrasts with development, where the jaw joint forms within a continuous field of chondroprogenitors11. Altogether, these data show that after complete removal of the joint, regenerated articular chondrocytes and synoviocytes express genes encoding lubricating proteins necessary for healthy joint function and movement.

ECM changes and activation of skeletal development programs

To define the cellular and transcriptional mechanisms driving whole-joint regeneration, we performed scRNAseq on live-sorted cells from uninjured joints and regenerated joint tissues at 6 timepoints post-resection (1, 3, 7, 14, 28, and 70 dpjr) (Fig. 4a). Unbiased clustering at a low resolution yielded 10 broad superclusters of cells (Fig. 4b). Skeletal cells were marked by either mesenchymal marker prrx1b (Skeletal 1) or osteoblast marker bglap (Skeletal 2) (Supplementary Fig. 4a, b). The Immune 1 supercluster included neutrophils (mpx + ) and macrophages (mpeg1.1 + ), while the Immune 2 supercluster primarily included lymphocytes, such as B cells and T cells (ikzf1 + ). Elevated numbers of neutrophils and macrophages at 1 dpjr indicated an influx of immune cells into the injury site as part of the wound healing response (Supplementary Fig. 5). Other superclusters included epithelial cells (epcam + ), endothelial cells (pecam1 + ), perivascular cells (myh11a + ), muscle cells (myod1 + ), neural cells (mpz + ), and odontoblasts (scpp1 + ).

Fig. 4: Transcriptional dynamics of skeletal cells throughout regeneration.
figure 4

a, Schematic detailing the generation of single-cell RNA sequencing (scRNAseq) dataset. Jaw joint tissue was collected from uninjured joints and regenerated joints at 1 dpjr, 3 dpjr, 7 dpjr, 14 dpjr, 28 dpjr, and 70 dpjr (n = 2-3 per time point). Samples were dissociated and sorted for all live cells prior to scRNAseq library generation. b, UMAP of low-resolution clusters (r = 0.02) for integrated samples. Superclusters include 2 skeletal clusters, 2 immune clusters, and individual epithelial, endothelial, perivascular, muscular, neural, and odontoblast clusters. c,d, UMAP of skeletal cells clustered at a high resolution (r = 0.95) and colored by cluster (c) or timepoint (d). e, Single-cell trajectory analysis of the osteoblasts, chondrocytes, chondrogenic fibroblasts, and regenerative mesenchyme clusters using Monocle3 colored by pseudotime (top) or cluster (bottom). f, Differential gene expression analysis plotted along the single-cell trajectory over time. g, HCR smFISH for fn1a (cyan), sox9a (magenta), and nkx3-2 (yellow) in sagittal sections of uninjured joints, at 3 dpjr, and at 7 dpjr. 3 dpjr inset displays sox9a expression along the edges of the resected quadrate bone, and 7 dpjr inset shows sox9a and nkx3-2 co-expression in regenerating chondrocyte islands. h, Quantification of smFISH fn1a, sox9a, and nkx3-2 transcript expression from uninjured, 3 dpjr, and 7 dpjr (n = 3 per timepoint for nkx3-2, n = 4 per timepoint for sox9a and fn1a); p = 0.0032 for fn1a Uninjured vs. 3 dpjr, p = 0.0002 for fn1a Uninjured vs. 7 dpjr, p = 0.0027 for sox9a Uninjured vs. 3 dpjr, p = 0.0003 for sox9a Uninjured vs. 7 dpjr, p = 0.0005 for nkx3-2 Uninjured vs. 3 dpjr, p < 0.0001 for nkx3-2 Uninjured vs. 7 dpjr, one-way ANOVA with Tukey’s multiple comparison test. i-o, RNAscope smFISH for gdf5 (green) and sox9a (magenta) transcripts in sagittal sections of the uninjured joint (i), uninjured Meckel’s cartilage (j), and at 7 dpjr (k) (n = 3 per time point). Quantification of gdf5 transcript expression from uninjured and 7 dpjr, p = 0.0412, one-way ANOVA with Tukey’s multiple comparison test. (l). Insets illustrate increased gdf5 transcript expression in the perichondrium surrounding Meckel’s cartilage (m), the regenerative mesenchyme (n), and in regenerating chondrocyte islands (o) at 7 dpjr. p, Schematic of the major events along the whole-joint regeneration time course. g,i-k, Regions of interest for quantification are outlined in light gray dashed lines. h,l, Error bars represent SEM. g,k, Resected bone stubs are outlined in solid white lines. Scale bars, 200 µm (g,i-k), 50 µm (g insets, m-o). AA, anguloarticular bone; IOP, interopercular bone; Q, quadrate bone; RA, retroarticular bone; MC, Meckel’s cartilage. Source data are provided as a Source Data file.

To elucidate the transcriptional dynamics of regenerating skeletal cells, we subsetted the Skeletal 1, Skeletal 2, and Perivascular clusters and re-clustered at a higher resolution to yield 21 clusters (Fig. 4c, Supplementary Figs. 4b, c, 6a, b, Supplementary Data 1). Within the skeletal population, we captured the major cell types present in uninjured joints: osteoblasts (bglap + ), chondrocytes (col2a1a + ), synoviocytes (prg4b + , col2a1a-), and ligamentocytes/tenocytes (scxa + , thbs4a + , tnmd + ) (Supplementary Data 2). In addition to the perivascular (myh11a + ) cluster, we identified a periosteal (mmel1 + ) cluster. Clustering at this resolution revealed 8 clusters of dermal fibroblasts (DF; pcdb1 + , pah + , qdpra + ) and 5 clusters of stromal cells (col5a3a + , coch + , and/or cxcl12a + ).

Additionally, we identified a cluster specific to early regeneration, comprised mainly of cells from 1 dpjr and 3 dpjr (82% of cells in this cluster) (Fig. 4d, Supplementary Fig. 6c). The cluster is defined by expression of proliferation (pcna + , mki67 + ) and injury-responsive mesenchyme (fn1a + , mdka + , hmga1a + ) markers (Supplementary Fig. 4b,c). We also noted a population of chondrogenic fibroblasts (71% from 14 dpjr and 28 dpjr) that co-expressed markers of fibroblasts (col1a1a + , col5a1 + ) and joint development programs (sox9a + , nkx3-2 + , trps1 + ) (Supplementary Fig. 4c). We then used Monocle3 to measure transcriptional changes in “pseudotime” in the early regenerative fibroblasts, chondrogenic fibroblasts, chondrocytes, and osteoblasts. Monocle3 identified regenerative mesenchyme as the earliest developmental stage and showed that the regenerative mesenchyme cells diverged into two differentiation trajectories, terminating in the osteoblast and chondrocyte clusters (Fig. 4e). Visualization of the trajectories showed the chondrogenic fibroblasts at an intermediate differentiation state between the regenerative mesenchyme and chondrocytes. Regenerative mesenchyme genes (fn1a, mdka, col12a1a) were highly expressed both at early and intermediate phases along the differentiation trajectories, while joint development genes (sox9a, nkx3-2, gdf5) and osteoblast markers (col10a1a, sp7, bglap) were most highly expressed in the chondrocyte and osteoblast differentiation trajectories, respectively (Fig. 4f, Supplementary Fig. 4c).

We then used Monocle3 to group genes with similar expression patterns along the trajectory into 70 modules (Supplementary Fig. 7). Modules specific to the osteoblast cluster included Module 21 (enriched in pre-osteoblast genes runx2a and spp1), Module 26, (enriched in osteoblast differentiation gene sp7), and Module 51 (enriched in mature osteoblast genes bglap and ifitm5). (Supplementary Fig. 8). Modules specific to the chondrocyte cluster included Module 8 (enriched in cartilage development genes sox9a and gdf5) and Module 32 (enriched in mature cartilage genes matn1 and col2a1a). Modules specific to the chondrogenic fibroblast cluster include Module 3 (enriched in joint cell fate markers trps1 and clu) and Module 1 (enriched in fibroblast markers col12a1a and col5a1). Among the modules specific to the regenerative mesenchyme cluster, we identified 3 modules (19, 29, and 38) associated with cell cycle re-entry and two modules (18 and 20) enriched for pro-regenerative fibroblast markers. Together, these gene module analyses revealed groups of co-regulated genes that were unique to various regenerative cell states and whose expression patterns shifted as mesenchymal cells differentiated into mature skeletal cell lineages.

To further probe the potential role of chondrogenic fibroblasts in whole-joint regeneration, we subsetted chondrogenic fibroblasts that plotted along the pseudotime differentiation trajectory from regenerative mesenchyme to chondrocytes and performed branch trajectory analysis (Supplementary Fig. 9a,b). We found that mesenchymal markers (prrx1b, mdka, col1a1a, col1a2) were more highly expressed at the beginning of the chondrogenic fibroblast trajectory, with expression waning as pseudotime progressed (Supplementary Fig. 9c). Joint development markers (trps1, nkx3-2, sox9a) and articular cartilage markers (clu, f13a1b, prg4b) were upregulated midway through the trajectory and had increased expression with later pseudotime. Markers of mature chondrocytes (ucmab, col2a1a, cnmd, acanb) had minimal expression in the first half of the trajectory but were significantly upregulated as pseudotime progressed. We then grouped differentially expressed genes along this path into modules, which revealed other co-regulated genes at distinct points of the trajectory (Supplementary Fig. 9d, Supplementary Data 3). Multiple growth factor signaling components grouped in modules specific to later pseudotime (tgfb1a in Module 3; fgfr2 and smad2 in Module 4; tgfbi in Module 7; fgfr1b and bmpr1bb in Module 8), indicating a potential role of TGF-Beta and FGF signaling in stimulating a shift to chondrogenesis in this fibroblast population. We also found taz associated with Module 5, which was specific to chondrogenic fibroblasts in early pseudotime, suggesting a potential role of YAP/TAZ signaling in regulating the developmental trajectory of this population. Overall, these differential gene expression analyses further indicate that the chondrogenic fibroblasts may represent a transitional progenitor population that have greater lineage commitment to the chondrogenic lineage compared to regenerative mesenchyme.

Given the lineage relationships we inferred between the fn1a+ early mesenchyme and sox9a + ;nkx3-2+ regenerating chondrocytes, next we assessed the spatiotemporal expression of these markers in vivo. At 3 dpjr, smFISH analyses show broad expression of fn1a throughout the regenerative mesenchyme, while the majority of sox9a and nkx3-2 transcripts were observed lining resected bone stubs (Fig. 4g). At 7 dpjr, fn1a + , sox9a + , nkx3-2+ mesenchymal cells were found within the regenerating zone, while chondrocyte islands were fn1a- and sox9a + ;nkx3-2 + . These data confirm the presence of injury-induced regenerative cell states transcriptionally similar to the chondrogenic fibroblasts found in the scRNAseq. Expression of fn1a, sox9a, and nkx3-2 was significantly increased at 3 and 7 dpjr compared to the uninjured joint (Fig. 4h). While fn1a was broadly expressed throughout the regenerative mesenchyme at 3 dpjr, its expression at 7 dpjr was maintained in mesenchymal cells but largely excluded from nascent chondrocyte islands. This transcriptional shift indicates a change in cell populations in the regenerating zone by 7 dpjr as injury-responsive mesenchymal cells begin differentiating into mature skeletal cell types.

Next, we assessed the expression of gdf5, a TGF-Beta superfamily member which we found to be associated with the cartilage trajectory in pseudotime analysis. While it is expressed in low levels in the adult joint (Fig. 4i) and in perichondrium around Meckel’s cartilage (Fig. 4j), gdf5 was significantly upregulated in the regenerating zone at 7 dpjr (Fig. 4k,l). Increased expression of gdf5 was observed in three populations of regenerating cells: in perichondrium around Meckel’s cartilage (Fig. 4m), at low levels in subsets of the regenerative mesenchyme (Fig. 4n), and at high levels in a subpopulation of sox9a+ chondrocytes (Fig. 4o). These data suggest that after whole-joint resection, re-establishment of mature joint tissues is preceded by injury-responsive mesenchymal cells undergoing a phase of broad ECM deposition that becomes spatially restricted and coincides with a redeployment of skeletal developmental programs by 7 dpjr. Taken together, our analyses demonstrate that whole-joint regeneration progresses through 5 main stages: wound closure, mesenchymal cell proliferation, cell lineage differentiation, tissue morphogenesis, and tissue remodeling (Fig. 4p).

Multipotent neural crest-lineage cells rebuild joint tissues

During development, zebrafish craniofacial skeletal structures are derived from the cranial neural crest. To test whether new joint tissues similarly arise from a neural crest-derived cell source during regeneration, we performed joint resections on Sox10:Cre;actb2:loxP-BFP-STOP-loxP-DsRed fish, in which neural crest lineage-derived cells are DsRed + . Prior to injury, the ligament and jaw joint cartilage were DsRed+ (Fig. 5a). Immediately after joint resection, no DsRed fluorescence was detected in the injury site. At 56 dpjr, DsRed expression was observed in regenerated bone, ligament, and cartilage. These data show that similar to development, joint tissues are derived from the neural crest lineage during regeneration.

Fig. 5: Neural crest-derived multipotent cells contribute to regenerated joint tissues.
figure 5

a, Lineage trace of DsRed-labeled neural crest lineage-derived cells using Sox10:Cre;actb2:loxP-BFP-STOP-loxP-DsRed (n = 3-5 per timepoint); bone visualized with calcein green staining. White arrow denotes regenerated joint surface and white arrowhead indicates regenerating chondrocytes. White dashed lines indicate cut sites. b, Simplex plots of archetype analysis colored by cluster and split by time point. c, UMAP of skeletal cells colored by most similar archetypes. d, Distribution of pseudotime of skeletal cell clusters across timepoints. The center of each box plot is marked by a horizontal line and represents the median. The bounds of the box are specified by the lower quartile (25th percentile) and the upper quartile (75th percentile). The whiskers extend from the box and denote the data points that fall within 1.5 times the interquartile range. e, Cell type composition of cells at 1 dpjr that have pseudotime between 0 and 2. f, Simplex plots of archetype analysis with osteoblasts, chondrocytes, chondrogenic fibroblasts, and regenerative mesenchyme colored by pseudotime. g, Single-clone lineage trace of regenerated Zebrabow ligament from Sample 1 (n = 5). Orange arrow denotes green ligamentocyte circled in the corresponding polar graph. h, Single-clone lineage trace of regenerated Zebrabow joint from Sample 1 (n = 5). Orange arrowhead denotes green chondrocyte circled in the corresponding polar graph. i, Polar graph of HS color distribution of regenerated cells from Sample 1. Orange circle indicates a green ligamentocyte and chondrocyte clone pair fewer than 10 units apart in color space. j, Histogram displaying the frequency distribution of distances between regenerated cells from Sample 1 in HS color space. Approximately 10% of cell pairs in Sample 1 are predicted clones. k, Single-clone lineage trace of regenerated joint tissues from Sample 2 (n = 5). Orange arrowhead denotes cyan chondrocyte circled in the corresponding polar graph. Orange arrow denotes cyan adipocyte circled in the corresponding polar graph. l, Polar graph of HS color distribution of regenerated cells from Sample 2. Orange circle indicates a cyan adipocyte and chondrocyte clone pair fewer than 10 units apart in color space. m, Histogram displaying the frequency distribution of distances between regenerated cells from Sample 2 in HS color space. Approximately 17.5% of cell pairs in Sample 2 are predicted clones. Scale bars, 250 µm (a), 125 µm (g,h,k). AA, anguloarticular bone; IOP, interopercular bone; Q, quadrate bone; RA, retroarticular bone.

Since our lineage tracing data revealed DsRed+ regenerated joint tissues of distinct cell types, we sought to delineate the degree of cellular plasticity in regenerating skeletal cells. To model plasticity, we applied archetype analysis (AA) to the scRNAseq dataset of zebrafish whole-joint regeneration. AA maps individual cells onto points within a simplex, offering insights into how cells balance multiple functions. The vertices of the simplex correspond to archetypes, which represent extreme gene expression profiles optimized for a functional task27,28,29,30,31. Where each cluster falls within this simplex is indicative of how specifically it fits a single task (specialists close to archetypes) or how generalized it is to complete several tasks (generalists near the center of the simplex or along a continuum between two or more vertices). In the context of regeneration, we assume “generalist cells” correspond to multipotent progenitor-like states, while “specialists” reflect cells committed to particular lineages or functions. We calculated the variance explained by varying numbers of k vertices (k = 1-30) (Supplementary Fig. 10a). As there was little biological variance explained by including additional archetypes, we chose a 6-vertex simplex to represent joint regeneration phenotypic space.

Next, we mapped the skeletal clusters from the scRNAseq dataset onto the simplex and performed Gene Ontology (GO) analysis to identify the genes and biological processes enriched at each vertex (Supplementary Figs. 10b,c, 11, 12, Supplementary Data 4). Throughout regeneration, mature skeletal cell populations were optimized for a single task (e.g., osteoblasts: ARCH3 (ossification and osteoblast differentiation) and chondrocytes: ARCH6 (cartilage development and positive regulation of developmental processes)), indicating a specialized function for each of these populations within the joint (Fig. 5b). Stromal and dermal fibroblast populations were also found among the specialists (i.e., DF 1: ARCH4 (L-phenylalanine metabolism); Stroma 3: ARCH2 (extracellular matrix organization)). In contrast, the regenerative mesenchyme mapped near the center of the simplex at all time points. Chondrogenic fibroblasts optimized a single task (ARCH5: cartilage development and negative regulation of developmental processes) in the uninjured and 70 dpjr samples. However, at 14 dpjr, chondrogenic fibroblasts plotted along a continuum between ARCH 5/ARCH 6 and between ARCH 5/ARCH 2, suggesting a degree of cell plasticity enabling them to shift between archetypes during joint regeneration. Highlighting cells classified as specialists on a UMAP of skeletal clusters further confirmed that none of the archetypes align with the regenerative mesenchyme cluster (Fig. 5c).

Additionally, we identified the cell type composition of cells close to each archetype and generated box plots of similarity scores for each cluster to each archetype (Supplementary Figs. 13 & 14). While the majority of mature skeletal clusters had a high similarity score corresponding to one archetype (e.g., chondrocytes with a similarity score of >0.87 to ARCH6), regenerative mesenchyme did not have a similarity score above 0.40 when compared to any archetype. The large distance of the regenerative mesenchyme cluster from archetype vertices suggested that these cells are a generalist population with the ability to perform several functional tasks.

We then assessed the pseudotime distribution of all skeletal cells and found that the population with the earliest pseudotime (0 to 2) in the 1 dpjr sample largely consisted of regenerative mesenchyme cells, signifying their undifferentiated, progenitor-like state (Fig. 5d,e). Plotting osteoblast and chondrocyte differentiation trajectories onto simplex maps showed that mature clusters with the latest pseudotime (osteoblasts and chondrocytes) were optimized at archetype vertices, while the cell population with the earliest pseudotime (regenerative mesenchyme) was found near the center of the simplex (Fig. 5f). These data shed light on cell plasticity within the regenerating joint and suggest that neural crest-derived regenerative mesenchyme may be a multipotent population with the capacity to differentiate into multiple mature joint cell lineages with specialized functions.

Next, we sought to identify gene signatures and growth factor pathways that may stimulate regenerative mesenchyme cells to enter distinct developmental paths. Thus, we subsetted cells that plotted along the pseudotime trajectory from the root to the branchpoint in regenerative mesenchyme diverging into two separate paths towards the osteoblast and chondrocyte lineages (Supplementary Fig. 15a,b). We wanted to identify genes that were differentially expressed in regenerative mesenchyme cells on the differentiation trajectories to chondrocytes vs. osteoblasts. Grouping these genes into modules revealed groups of co-regulated genes that may influence the cell fate decisions of regenerative mesenchyme (Supplementary Fig. 15c, Supplementary Data 5). Modules found prior to the branchpoint (i.e., Modules 2, 11, 26) were enriched in genes related to macromolecule biosynthesis (mrpl52, mrpl21, polr2d), cell cycle (pcna, mcm2, mcm5), and DNA repair (chek2, hpf1). Modules specific to the chondrogenic regenerative mesenchyme (i.e., Modules 3, 8, 39) were enriched in genes related to ECM organization (hapln1a, aqp3a, dcn, nid1b), and modules specific to the osteogenic regenerative mesenchyme (i.e., Modules 4 and 18) were enriched in genes related to ossification (runx2a, spp1, timp2b, enpp1).

We then focused on expression patterns of growth factor signaling components. Plotting individual genes suggested differential regulation of multiple growth factor pathways, such as TGF-Beta, FGF, and Wnt, between the chondrogenic and osteogenic branches of regenerative mesenchyme (Supplementary Fig. 15d). TGF-Beta superfamily members upregulated in osteogenic regenerative mesenchyme included bmp2a, bmp8a, and bmper, while bmp5, tgfbi, and BMP co-receptor scube3 were upregulated in chondrogenic regenerative mesenchyme. BMP antagonists aspn, grem2b, and chrdl2 were expressed in the chondrogenic trajectory. Another BMP antagonist, chrd, was expressed prior to and just after the branchpoint. Next, we assessed the Wnt signaling pathway. We found that wnt9a and fzd8a were expressed along the osteogenic arm of regenerative mesenchyme, while wnt5a was upregulated in chondrogenic regenerative mesenchyme. Wnt inhibitors frzb, dkk3b, sfrp1a, and sfrp1b were more highly expressed along the chondrogenic trajectory than the osteogenic trajectory. Analysis of the FGF pathway showed that fgfr4 was more highly expressed in osteogenic regenerative mesenchyme, while fgfr2 expression was higher at the branchpoint and in chondrogenic regenerative mesenchyme. Overall, these data show differential expression of TGF-Beta, FGF, and Wnt signaling pathway components in regenerative mesenchyme cells diverging towards chondrogenic or osteogenic lineages, suggesting that these growth factor pathways may play a role in stimulating regenerative mesenchyme to enter different developmental paths towards distinct skeletal cell fates.

To confirm in vivo whether neural crest-derived cells have multipotent potential during adult joint regeneration, we used inducible sox10:CreERT232,33 in combination with ubi:Zebrabow34. In the Zebrabow transgenic line, recombination results in permanent expression of either dTomato, mCerulean, or eYFP, enabling the tracking of both converted cells and their progeny. We treated sox10:CreERT2;ubi:Zebrabow fish with one dose of 5 μM 4-OHT from 24 to 36 hours post-fertilization and performed clonal analysis in adulthood. In each joint, we identified chondrocytes, ligamentocytes, and adipocytes based on cell location and morphology. To determine distance in hue-saturation (HS) color space between cells in each sample, we developed a custom application in MATLAB 2024a (Supplementary Fig. 16a-f). We assigned each cell a cell-type code (1: ligamentocyte, 2: adipocyte, 3: chondrocyte) (Supplementary Fig. 17a-c); then, mean hue (H) and saturation (S) for each cell were calculated in the app, and these HS values were plotted on a polar graph. In uninjured controls, all 3 cell types were found dispersed across the RGB spectrum (n = 5), (Supplementary Fig. 18a-j). Previously, it has been shown that the mean distance between clonally derived cells is <40 units, with the frequency distribution peaking at ~20 units apart in color space35,36. We employed a conservative binning approach of designating cell pairs between 0-10 color space units apart as predicted clones. Within this first bin of the histogram for each control sample, we identified cells of distinct lineages, highlighting the multipotent potential of the neural crest during development.

We then sought to discern whether these neural crest-derived tissues have multipotent potential during regeneration in adulthood. Single-clone lineage tracing analysis was performed on 56 dpjr regenerated samples (n = 5) (Fig. 5g-m, Supplementary Fig. 19a-f). Similarly to control samples, we found that regenerated joints had a high proportion of cells 0-10 units apart in color space. Next, we assessed whether predicted clone pairs in regenerated samples included cells of different lineages. We found that regenerated samples contained predicted clone pairs of distinct lineages that were fewer than 10 units apart (i.e., green ligamentocyte and chondrocyte, Fig. 5g-i). We then focused on samples with rare recombinant types, such as in the cyan color range. Within samples that contain cyan clones, we identified cyan clone pairs of different lineages (i.e., cyan chondrocyte and adipocyte, Fig. 5k,l). Additionally, we observed cells of the same lineage far apart in color space (i.e., >180 units), suggesting that multiple distinct clones contribute to each regenerated joint tissue type. As all pre-existing joint tissues had been removed by joint resection, these single-clone lineage tracing data suggest that neural crest-derived cells responding to whole-joint injury in the adult zebrafish are not lineage restricted. The multipotent potential of the regenerative mesenchyme suggests that it may consist of either a resident progenitor pool maintained in the adult joint or a population of neural crest-derived cells that have lost their lineage restrictions via dedifferentiation. Taken together, our computational and in vivo single-cell analyses indicate that a multipotent, neural crest-derived cell population in the zebrafish jaw joint has the ability to rebuild mature joint tissues in adulthood.

Discussion

Here we establish a model of adult de novo synovial joint regeneration. While previous studies have explored mechanisms driving regeneration of individual joint tissues16,17,18,19, single-tissue injuries do not provide a model for rebuilding joint tissues without a pre-existing source of the same cell type. Moreover, in the adult whole-limb regeneration models published to date, an ability to regenerate lubricating hyaline cartilage has not been described. While previous studies have induced the formation of a functional synovial joint via growth factor treatment, this success was constrained to neonates6. In this study, we show that after complete removal of the synovial jaw joint in the adult zebrafish, all native joint tissues regenerate. The regenerated articular cartilage expresses prg4b, which encodes a protein responsible for lubricating the joint. While other mammalian tissues such as skin, bone, and liver retain latent regenerative capacity, this is the first model to have shown resident cells in an adult synovial joint to have endogenous ability to repair prg4b+ tissue. While tissue patterning of the uninjured joint is not wholly recapitulated after complete resection, we used microCT reconstructions, testing of feeding coordination, visualization of muscle and nerve recovery, and imaging of compatible transgenics to show that regenerated joint tissues assemble into an integrated 3D structure that generates functional movement.

Whether regenerating tissues arise from a multipotent progenitor pool or via cellular dedifferentiation has long been a topic of debate. Many published models, such as zebrafish fin, ligament, and heart regeneration, as well as newt muscle regeneration, rely primarily on mechanisms of dedifferentiation18,37,38,39. Our resection surgery differs from these injury models in that we fully remove the articular cartilage, ligaments, and synovium that could serve as a source for rebuilding these tissues. Some dedifferentiation of osteoblasts in the remaining bone stubs may occur, as we observe bony protrusions as early as 3 dpjr. However, we also see new bone growth in the form of bony islands within the mesenchyme, indicating that regenerating osteoblasts may be derived from multiple sources. The de novo regeneration of joint tissues after complete resection suggests either the presence of a resident progenitor pool in the uninjured joint or the occurrence of non-lineage-restricted cell dedifferentiation. This differs from existing zebrafish regeneration models where dedifferentiation does not result in the acquisition of multipotency (e.g., fin regeneration)37. The archetype and clonality analyses we performed on regenerated cells in our model, however, support a multipotent neural crest-derived population giving rise to distinct synovial joint cell lineages. This observation parallels published work which found that axolotl limbs regenerate through dedifferentiation of connective tissue cells into a multipotent progenitor36. Since regenerated axolotl limbs are mesoderm-derived and regenerated zebrafish jaw joints are neural crest-derived, activation of the multipotent state observed in both models is likely independent of cells’ embryonic lineage. Though zebrafish synovial jaw joints and human synovial joints in the limbs are derived from different embryonic germ layers, they share many similarities in structure, cell type composition, and genetic mechanisms driving their development10,11,12. Thus, further characterizing the mechanisms driving the regenerative response in our zebrafish model may provide insight into what cells and genetic pathways should be stimulated to awaken a similar regenerative program in human joints.

An emerging theme in the field of skeletal biology is the importance of tissue cross-talk in maintaining joint homeostasis40,41. Our model enables the investigation of how cross-talk between different joint tissues (i.e., bone and articular cartilage) may facilitate the regeneration of the joint as an integrated organ. Our scRNAseq data also captured non-skeletal populations such as immune cells, neural cells, and endothelial cells. Immune cells in particular show major population shifts during early regeneration (Supplementary Fig. 5), and future work will probe the functional role of such non-skeletal cells in rebuilding the joint. We also showed that wound closure appears to occur at least in part through migration of epithelial cells over the resected bone stubs (Fig. 2b, Supplementary Movie 1). These data parallel previous findings showing that, similarly to adult mammalian wounds, closure of adult zebrafish wounds at least 1 mm large occurred via keratinocyte migration42. This is in contrast to the purse-string driven concentric closure promoting wound healing in mammalian embryos, zebrafish embryos, and smaller adult zebrafish43,44,45. The average resection length of 1 mm induced by our surgery (Fig. 1c) further supports epithelial cell migration contributing to wound closure in our model. We also observed that the domains of proliferating epithelial cells were dynamic (Supplementary Fig. 2d,e). Given the wound epithelium’s role in recruiting fibroblasts to form a blastema46, our model provides a new paradigm to investigate how epithelial-mesenchymal cell cross-talk may contribute to zebrafish whole-joint regeneration.

The regenerative program induced after our surgery differs from the fracture healing response of other zebrafish craniofacial skeletal regeneration models16. While lower jawbone healing in zebrafish relies on hybrid bone-cartilage cells derived from periosteum, we did not identify any clusters co-expressing osteoblast and chondrocyte markers in our scRNAseq data. This indicates that after joint resection, zebrafish skeletal tissues undergo a mechanism of healing unique from the response to lower jawbone resection or fracture healing. In mouse models of osteoarthritis, Sox9-expressing progenitors contribute to osteophyte formation rather than joint regeneration47. Hence, although mammalian joint cells respond to acute injury, they fail to correctly re-deploy developmental pathways. The chick embryo has been shown to recapitulate gene expression patterns characteristic of developing joints after injury48, but adult chickens have not been shown to have this ability. Additionally, the chicken embryo joint was only able to regenerate when the surrounding tissues were kept intact via a window excision model. In contrast, we have shown that even when all associated joint tissues are completely removed, adult zebrafish are able to engage resident skeletal cells to correctly activate developmental genes to rebuild multiple joint tissues. Our finding of sox9a upregulation lining the resected bone stubs is consistent with a model wherein the initial re-expression of joint development genes occurs in periosteal cells. Future lineage tracing experiments are required to determine whether the periosteum serves as a source for regenerating joint cells.

The activation of developmental mechanisms we observed in zebrafish whole-joint regeneration presents similarities to other vertebrate models of joint repair. For example, the re-expression of Sox9 was also observed in a neonatal mouse model of joint regeneration6. Similar to our model, regeneration of a joint in the mouse digit also involved the expression of Prg4, bone formation through endochondral ossification, and replenishment of cell populations through proliferation (though chondrocyte proliferation was not observed). A published model of embryonic chicken elbow joint regeneration also demonstrated induction of SOX-9 and GDF-5, though this was observed as early as 48 hours after surgical excision48. Developmental programs in our model, however, are not reactivated until 3-7 dpjr (Fig. 4p). These differences may be due to the stage at which each respective injury is conducted – the chicken elbow joint excision surgery was performed in embryos; therefore, it is likely that organisms retain greater capacity to recapitulate developmental events at this stage. The time course in our model is more similar to other adult joint regeneration models, such as zebrafish ligament regeneration, where ligament fate markers are reactivated by 7 days post-injury18. Determining the degree to which zebrafish whole-joint regeneration either redeploys developmental programs or activates regeneration-specific mechanisms can be valuable for optimizing therapeutic approaches for degenerative joint diseases. Unraveling the cues guiding resident skeletal cells to successfully regenerate joint tissues in response to resection injury in adult zebrafish lays the foundation for awakening a similar regenerative program in human joints.

Methods

Zebrafish Lines

All zebrafish experiments were approved by the Columbia University Institutional Animal Care and Use Committee. Published lines used in this study include AB, Tg(scxa:mCherry)fb30149, Tg(sp7:EGFP)b121250, Tg(thbs4a_p1:eGFP)el91218, Tg(actab2:loxP-BFP-STOP-loxP-DsRed)sd2751, Tg(sox9a:GFP)zc81tg52, Tg(Mmu.Sox10-Mmu.Fos:Cre)zf38421, Tg(sox10:CreERT2)el77732, and Zebrabow - Tg(ubb:LOX2272-LOXP-RFP-LOX2272-CFP-LOXP-YFP)a1334. Embryos were raised in a methylene blue salt solution at 28.5 °C, while juvenile and adult fish were maintained in groups of 15-30 animals. Joint resection surgeries were performed in size-matched male and female 3-9 mpf adult zebrafish using standard body length (SL) measurements.

Adult Joint Resection Surgery

Fish were anesthetized with Tricaine MS-222 at a concentration of 0.17 mg/mL and placed into a damp sponge ventral side up. 3 mm Vannas spring scissors (Fine Science Tools, cat. #1500000) were then used to make 3 cuts to remove the entirety of the quadrate-articular joint on the left side. The surgery also removed the retroarticular bone between the IOM ligament and anguloarticular bone. The first cut was performed anterior to the quadrate through the anguloarticular bone, and the second cut was performed posterior to the quadrate through the interopercular bone (Fig. 1a). Student Dumont #5 Forceps (Fine Science Tools, cat. #9115020) were used to grasp the joint tissue before making the third cut dorsal to the joint along the quadrate bone. The forceps were then used to remove the joint as well as any residual tissue or bone fragments in the resection site. Fish were then revived by circulating clean system water over the fish’s gills, housed on-system, and monitored for three days to ensure normal swimming and behavior post-surgery. For euthanasia at experimental endpoints, fish were placed into a lethal dose of Tricaine MS-222 for 30 minutes on ice.

MicroCT

For microCT, adult zebrafish were euthanized, fixed in 4% PFA overnight, and rinsed twice in 1x PBS. The scans were performed on the µCT system (MILabs, Netherlands) using a fixed 100 µm aluminum primary filter and a standard 400 µm aluminum secondary filter. The parameters were as follows: voltage of 20 kV, current of 0.07 mA, and exposure time of 38 ms, resulting in a 20 micron voxel volume. Raw data was reconstructed in 3D Slicer image computing platform.

Functional Testing

Functional testing was conducted by quantifying slow motion footage of jaw activity during swimming and feeding. All fish used for functional testing were wild-type ABs ~8-9 months old, with an equal number of males and females tested, and fish were selected to be a similar weight and length for their respective sex. Videos of the fish were collected one day before whole-joint resection, 3 hours after whole-joint resection, and then again for 1, 3, 7, 14, 21, 28, and 56 days after whole-joint resection, all at the same time of day. Footage was captured using a Canon EOS Rebel T6 DSLR Camera with an EF-S 18-55 mm lens. Videos were taken using a resolution of 1280×720 at 60fps, then imported to Microsoft Clipchamp software where they are slowed down to one tenth speed for analysis.

Histology

Adult zebrafish samples were prepared for histological analysis as previously described18. Briefly, following euthanasia and overnight fixation in 4% PFA/1xPBS at 4 °C, samples were washed twice for 30 minutes in 1xPBS. The head was then separated from the trunk for embedding of desired structures. Samples were decalcified in 0.5 M EDTA solution for 10 days rocking at room temperature. Prior to embedding, the tissue was dehydrated with a series of ethanol/DepC-treated water dilutions (DepC, 30% EtOH, 50% EtOH, 70% EtOH, 95% EtOH, 100% EtOH) followed by a second series of ethanol to Hemo-De Xylene Substitute dilutions (50% Hemo-De, 75% Hemo-De, 100% Hemo-De) (Electron Microscopy Sciences, 23412-01). The samples were then incubated in a 1:1 Hemo-De:Paraffin solution at 65 °C for an hour. After an overnight incubation in 100% paraffin at 65 °C, samples were embedded in freshly melted paraffin. Paraffin sectioning was performed using Thermo HM355S automatic microtome to collect 5 μm sections.

For hematoxylin & eosin (H&E) staining, 5 μm paraffin sections were deparaffinized with Hemo-De Xylene Substitute (Electron Microscopy Sciences, 23412-01) and rehydrated through an ethanol series to distilled water. Sections were then stained in hematoxylin (Sigma-Aldrich, HHS16) for 2 min followed by a brief rinse in acetic acid (Fisher Scientific, A38-212), 20x dip in water, 2 min in Scott Tap Water Bluing Reagent (Mercedes Scientific, 11160), 2 x 15x dips in water, and 2 x 15x dips in 95% ethanol. The sections were then stained in Eosin (Sigma-Aldrich, E6003) for 30 s followed by 3 ×1 min washes in 95% ethanol and 2 ×1 min washes in 100% ethanol. After 2 ×2 min in Hemo-De, samples were mounted with cytoseal (Fisher Scientific, 23-244256) for imaging.

For Safranin O / Fast Green staining, 5 μm paraffin sections were deparaffinized with Anatech Pro-Par Clearant Xylene Substitute (Rankin, 510), rehydrated through an ethanol series to 70% ethanol, and air dried. Sections were then stained in hematoxylin (Sigma-Aldrich, HHS16) for 7 minutes and washed gently in running tap water for 10 minutes. Samples were then stained with 0.08% Fast Green (Fisher Scientific, A1652006) for 3 minutes, rinsed in 1.0% acetic acid (Fisher Scientific, A38-212) for 10 seconds, and stained with 0.1% Safranin O (Millipore Sigma, TMS-009-C) for 5 minutes. After 1 dip in 0.5% acetic acid, samples were dipped in two changes of distilled water and then air dried. One dip in Pro-Par Clearant cleared the samples before mounting with cytoseal (Fisher Scientific, 23-244256) for imaging.

For Alcian Blue / Nuclear Fast Red staining, 5 μm paraffin sections were deparaffinized with Anatech Pro-Par Clearant Xylene Substitute (Rankin, 510) and rehydrated through an ethanol series to distilled water. Then, samples were placed in an alcian blue solution made of 2.5 g Alcian blue powder (Fisher Scientific, AC400460100), 7.5 mL acetic acid (Fisher Scientific, A38-212), and 242.5 mL distilled water overnight. Sections were rinsed in running tap water for 5 mins followed by a second quick rinse of distilled water. Samples were counterstained with Nuclear Fast Red Solution (Fisher Scientific, N01841G) for 5 mins, washed 2 ×1 min in distilled water, and dehydrated through an ethanol series to 100% ethanol. After 2 ×1 min in Pro-Par Clearant, samples were mounted with cytoseal (Fisher Scientific, 23-244256).

Tissue Clearing

CUBIC tissue clearing protocol was adapted from Susaki et al., 201453 and used for zebrafish as described in Anderson & Mo et al., 202318. Whole adult zebrafish were fixed using 4% PFA/PBS at 4 °C overnight. After samples were washed in PBS twice for 30 minutes each while rocking, heads were separated from the trunk of the body. At a ratio of 5-8 heads per 50 mL, samples were incubated in a CUBIC-1 solution in the dark at 37 °C with rocking for 3-4 days. Tissue was then transferred to fresh CUBIC-1 solution for another 2-4 days. After removing CUBIC-1, samples were washed twice in PBS for 30 minutes each at RT and transferred to a 20% sucrose/PBS solution for ~2 hours at RT (all washes with rocking). Then, the samples were transferred to CUBIC-2 solution in the dark at RT, with gentle rocking for 2-3 days before imaging. Once sufficiently cleared, heads were mounted in a chambered coverglass with a drop of CUBIC-2 (Fisher Scientific, 12-565-335) and imaged using a Leica SP8 confocal microscope.

Both CUBIC-1 and CUBIC-2 solutions were prepared at 100 °C while covered. CUBIC-1 was a mixture of 25 wt% urea, 25 wt% N,N,N’,N’-tetrakis(2-hydroxypropyl) ethylenediamine (Fisher Scientific, AAL16280AP) and 15 wt% Triton X-100. Deionized water was heated and added to N,N,N’,N’-Tetrakis(2-hydroxypropyl)ethylenediamine. After dissolving, urea was added to the solution and mixed until homogenous. Solution was then removed from heat, and after cooling to room temperature, Triton-X-100 was added at RT with gentle stirring.

CUBIC-2 was prepared as a mixture of 50 wt% sucrose, 25 wt% urea, 10 wt% 2,2′,2′’-nitrilotriethanol (Fisher Scientific, AAL044860E), and 0.1% (v/v) Triton X-100. Deionized water was heated and added to 2,2′,2′’-nitrilotriethanol. When the mixture was homogenous, urea was added. After the urea was sufficiently dissolved, sucrose was mixed into the solution. Once the solution was clear, it was removed from heat and allowed to cool to room temperature. Triton-X-100 was then added at RT with gentle stirring. Both CUBIC-1 and CUBIC-2 solutions were stored in the dark at RT after preparation and used within 1 week.

Calcein Blue Bone Staining

Calcein blue bone staining solution was prepared by adding Calcein blue powder (Millipore Sigma, C1429) to 1X PBS to a final concentration of 1 mg Calcein blue/2 mL 1X PBS. After fixation, samples were stained for 2-3 days at 4 °C in a Calcein blue solution, covered to minimize light exposure. Samples were washed twice in 1X PBS before proceeding to tissue clearing and imaging.

Intubation for Long-term Live Imaging

Intubation rig design was adapted from Castranova et al., 202254. On the day of imaging, a long-duration anesthetizing solution was created by dissolving Tricaine MS-222 into two liters of system water to a concentration of 126 mg/L. A water heater was added to the water and set to 29-30 °C to ensure a temperature of 26.5-28.5 °C in the water that reaches the fish. Once anesthetized with tricaine at a concentration of 0.17 mg/mL, fish were placed ventral side up on a damp sponge in a Petri dish. A peristaltic pump was utilized for intubation. Inflow and outflow tubes, consisting of food-safe silicone tubing with an outer diameter of 3/16” and an inner diameter of 1/8”, were connected to the rollers in opposite directions. The inflow tube provided anesthetizing water to the fish and was connected first to a pulse dampener to reduce the impact of the flow and ensure smooth imaging, before a second tube on the other end of the pulse dampener culminated in a narrower 1/8” outer diameter tube to place in the fish’s mouth for intubation. For the outflow tube, one end was placed within the Petri dish that held the fish and the other end was placed into the anesthesia water reservoir to remove excess water and complete the water cycle. To intubate the anesthetized fish, the peristaltic pump was set to 60 rpm. Using forceps, the narrow inflow tube was guided to the zebrafish’s mouth and gently inserted, intubating as shallow as possible while allowing the tubing to remain within the mouth. Once the inflow tube was inserted, the peristaltic pump was adjusted to 100 rpm to maintain anesthetization. Fish were then revived by circulating clean system water over the fish’s gills.

Immunofluorescence, HCR, and RNAscope

Immunofluorescence and smFISH experiments were performed on 5 μm formalin-fixed paraffin-embedded (FFPE) tissue sections. For immunofluorescence, sections were deparaffinized with 3 Hemo-De washes and rehydrated through an ethanol series (100%, 80%, 50%). Antigen retrieval was performed using sodium citrate buffer, and samples were blocked for 1 hour with GeneTex immunostain blocker solution (GTX73323). After an overnight 4 °C incubation with primary antibody (PCNA, 1:1000, Sigma-Aldrich, P8825; Sox9a, 1:200, GeneTex, GTX128370; Acetylated Tubulin, 1:200, MilliporeSigma, T7451; MF20, 1:20, DSHB, AB_2147781), samples were washed with 1X PBS-T and incubated with secondary antibody (1:500, Invitrogen, A-21245, A-11001) diluted in ImmunoStain blocker solution (GeneTex, GTX73323) and Hoechst 33342 nuclear stain (1:1000, Fisher Scientific, 51–17). Sections were mounted with Fluoromount-G (Southern Biotech, 0100-01).

Single-molecule fluorescence in situ hybridization was performed according to manufacturer guidelines using RNAScope Multiplex Fluorescent Reagent Kit v2 Assay (ACD Bio) or Hybridization Chain Reaction (HCR) RNA-Fish (Molecular Instruments), except that slides were covered in parafilm for the HCR overnight incubation step in probe solution. Target retrieval was performed for 4 min in a steamer (using 1x Target Retrieval Reagent for RNAscope and sodium citrate buffer for HCR).

RNAScope probes used were Dr-sox9a-C3 (543491-C3), Dr-gdf5-C2 (1255581-C2), Dr-prg4b-C2 (848211-C2), Dr-mki67-C1 (574951-C1), Dr-hmga1a-C3 (1732481-C3), Dr-tnfaip6-C2 (840371-C2) from ACD Bio. Opal 520, 570, and 690 fluorophores were used for visualizing expression (1:1000; Akoya Biosciences, cat. #FP1487001KT, #FP1488001KT, #FP1497001KT). HCR probes used were sox9a-B1 (0.4pmol, Molecular Instruments), nkx3-2-B3 (0.4pmol, Molecular Instruments), and fn1a-B5 (0.4pmol, IDT). The amplifiers used were B1-647, B3-546, and B5-488 (Molecular Instruments). Sections were mounted with Fluoromount-G + DAPI (Southern Biotech, 0100-02).

Drug Treatments

One treatment with 5 µM (Z)-4-Hydroxytamoxifen, ≥98% Z isomer (Sigma-Aldrich, H7904) was used to convert sox10:CreERT2 lines in the dark from 24 to 36 hpf. Following drug treatment, embryos were washed 2×15 min in methylene blue water and then screened for conversion using a Leica M165FC stereo microscope before being housed on-system.

Imaging

Images of histological sections were captured using a Zeiss AxioLab 5 Microscope with a Axiocam 208 color camera (Zen v3.9.101.01000).

Live imaging was performed using a Leica M165FC stereo microscope with Leica K5 sCMOS camera and LASX software v3.7.4.23463. Fluorescence imaging was performed using a Leica SP8 confocal microscope with LASX software v3.5.7.23225. All images were processed in Fiji55.

Single-cell RNA sequencing

Whole-joint resection (n = 18 joints for 1 dpjr, 10 joints for 3 dpjr, 7 dpjr, and 14 dpjr, 12 joints for 28 dpjr, and 13 joints for 70 dpjr) was performed on size-matched Sox10Cre;actb2:loxP-BFP-loxP-DsRed; flk1:GFP fish at 3-6 mpf. Regenerated jaw joint tissues from each injured fish and uninjured controls (32 joints) were microdissected and incubated in [0.25% trypsin (Life Technologies, 15090–046), 1 mM EDTA, and 400 mg/ml collagenase D (Sigma-Aldrich, 11088882001) in HBSS] for 1-1.5 hrs at 28 °C. Mechanical dissociation of joints was achieved by nutation and agitation with a P1000 pipette every 5 min for the duration of incubation period. Samples from timepoints with more calcified tissues (i.e., uninjured or 70 dpjr) were manually dissociated for longer than samples from timepoints with fewer calcified tissues (i.e., 7 dpjr). Efficiency of dissociation was visually scored such that different samples from distinct timepoints were dissociated to a similar extent. Regenerated joints are represented by two independent samples per time point (1 sample of pooled joints and 1 sample from a single animal), and uninjured joints are represented by 3 independent samples (1 sample of pooled joints and 2 samples from single animals). Live cells were sorted with FACS and immediately processed to create single-cell RNA Sequencing libraries using the Chromium Single Cell 3′ Library & Gel Bead Kit v2 (10X Genomics), following the manufacturer’s recommendations. Libraries were sequenced at the Columbia Single Cell Analysis Core on the NovaSeq 6000 (Illumina, San Diego, CA) with 101 bp sequencing for Read1, 10 bp sequencing for Index1, 101 bp sequencing for Read2, and 10 bp sequencing for Index2. Each sample was sequenced to a mean read depth of greater than 86,000 reads per cell. CellRanger v6.1.2 (10X Genomics) was used for alignment to GRCz11 with default parameters to generate cell-by-gene count matrices.

scRNAseq libraries were analyzed using R version 4.4.0 in RStudio with Seurat (v5.0.3.) CellRanger outputs for each sample were converted to Seurat objects and merged. Outputs were filtered for cells between 600 and 2,500 unique features per cell and less than 7% mitochondrial RNA detected per cell to remove low-quality cells and doublets. Next, data were processed with log normalization, selection of 2500 variable features, and scaled using all features expressed in 3 or more cells. Samples were then integrated with RPCA-based integration, and principal component analysis was used to perform linear dimensional reduction to compute 150 principal components. Jackstraw permutation identified all 150 principal components as significant (p < 0.05), and K-nearest-neighbor graph construction (KNN) was performed. After quality control filtering, a total of 63,305 high-quality cells with a median 600 genes per cell were obtained. Metadata for pooled and single-animal samples from the same injury time points were merged for downstream analysis. Broad superclusters were identified by performing clustering at a low resolution of 0.02. Clusters and gene expression plots were graphed using UMAP-non-linear dimensional reduction to maintain global variability.

To analyze scRNAseq data from skeletal cells, the ‘Skeletal 1’, ‘Skeletal 2’, and ‘Perivascular’ superclusters were subset and merged. Unprocessed gene expression data from these cells was re-processed with log-normalization, feature selection, and feature scaling. Principal components 1-150 were used for KNN and graphed using UMAP-non-linear dimensional reduction before clustering at a resolution of r = 0.9 (Supplementary Fig. 5a,b). Two clusters (cluster 18 and 22) expressed epithelial cell markers (epcam) and were excluded as a likely doublet cluster. Unprocessed data from cells in remaining clusters were then re-processed with log normalization, feature selection, and feature scaling. Principal components 1-150 were used for KNN and graphed with UMAP-non-linear dimensional reduction, and clustering was performed once more at a higher resolution of r = 0.95. Cluster markers were identified with function FindAllMarkers using a Wilcoxon Rank Sum test for features expressed in at least 10% of the cluster and a minimum of 0.2 log-fold-change greater expression than the other cells in the skeletal clusters.

Pseudotime Analysis

R package Monocle3 (v1.3.7) was used for pseudotime trajectory analysis. The pre-analyzed Seurat object containing all skeletal cells was converted to a monocle format. To limit bias, the root of the trajectory was specified programmatically rather than with manual selection. A helper function was used to group cells from the 1 dpjr sample according to which trajectory graph they are nearest to, calculate the fraction of cells at each node from the earliest time point, and designate the node that is most heavily occupied by early cells as the root (regenerative mesenchyme). The output was used to assess pseudotime distribution and cell type composition of 1 dpjr cells (Fig. 5d,e). To identify genes upregulated along osteoblast and chondrocyte trajectories, the osteoblast, chondrocyte, chondrogenic fibroblasts, and regenerative mesenchyme clusters were subset from the skeletal cell object and converted to a monocle format. Pre-processing, alignment, and UMAP dimensionality reduction were then performed. Cells were clustered using Monocle3’s algorithm and a principal graph within each partition was fit using the learn_graph function. The same helper function described above was then used to programmatically select regenerative mesenchyme as the root node. Cells were plotted and colored by both pseudotime and cluster (Fig. 4e). The output was used for further analyses, including finding genes that change as a function of pseudotime (Fig. 4f), finding modules of co-regulated genes (Supplementary Figs. 7 & 8), and branch trajectory analysis (Supplementary Figs. 9 & 15).

Quantification and Statistical Analysis

Proliferation was measured as the percentage PCNA+ cells relative to the total number of Hoechst+ cells within a region of interest (ROI) in the uninjured joint domain or regenerating joint injury domain between the resection sites. Additional ROIs were defined within the larger joint area, targeting specific regions such as the uninjured chondrocyte domain, regenerating chondrocyte domain, uninjured epithelium domain, injury epithelium domain, and injury-adjacent epithelium domain. ROIs were determined based on the regional nuclear morphology in the tissue section. Hoechst+ cells were quantified using FIJI’s StarDist nuclei detection and segmentation tool, while PCNA+ cells were manually scored by an observer blinded to both the time point and ROI identity.

Expression levels of fn1a, nkx3.2, sox9a, and gdf5 were quantified from smFISH images using ImageJ. Uninjured and regenerating joint domains were delineated based on regional nuclear morphology within the tissue section. Regenerating joint domains were specifically outlined adjacent to the cut sites to encompass the entire joint injury area, capturing bone stubs, regenerating chondrocyte islands, injury epithelium, and other regenerating tissues located between the cut sites. The corrected total fluorescence was calculated by determining the integrated density within each ROI for each channel. Tissue autofluorescence and background signal was accounted for by calculating the average mean gray value from 3 distinct background regions within each image. The corrected total fluorescence (CTF) was determined using the following formula: CTF = Integrated Density – (Area of ROI × Average Mean Gray Value).

Expression levels of hmga1a and tnfaip6 were quantified from smFISH images using ImageJ. Uninjured and regenerating joint domains were delineated based on regional nuclear morphology within the tissue section. Regenerating joint domains were specifically outlined adjacent to the cut sites to encompass the entire regenerating mesenchyme, while omitting bone stubs, regenerating chondrocyte islands, and epithelium. The normalized mean fluorescence intensity was calculated by determining the mean gray value within each ROI for each channel. Tissue autofluorescence and background signal was accounted for by calculating the average mean gray value from 3 distinct background regions within each image. The normalized mean fluorescence intensity was determined using the following formula: Normalized mean fluorescence intensity = Mean Gray Value – (Average Mean Gray Value of Background).

To quantify the output of regeneration after joint resection, slow motion footage of zebrafish suction feeding was analyzed and assigned a score based on premaxillary protrusion as a function of musculoskeletal coordination. A double-blind verification was used to confirm the assigned functional scores which maintained the observed trend. Additional recordings were taken of the time in between consecutive suction feeding actions, measured as the time in milliseconds between the closing of the mouth and the very beginning of the next opening movement. These values were normalized relative to the uninjured time between feeding actions for each individual zebrafish to attain a fold change in the time between consecutive feeding actions.

All statistical analyses were conducted using GraphPad Prism 10.3.0. Proliferation in the whole joint region and epithelial domains was evaluated using ordinary one-way ANOVA with Tukey’s multiple comparison test (F-value = 9.639, DFn = 3, DFd = 15 for whole joint region and F-value = 7.99, DFn = 4, DFd = 15 for epithelial region), while proliferation in the chondrocyte domains was assessed with a two-tailed unpaired t-test (t-value = 25.72, df = 6). Transcript expression of fn1a, nkx3-2, and sox9a across different post-injury timepoints was assessed using ordinary one-way ANOVA with Tukey’s multiple comparison test (F-value = 24.9, DFn = 2, DFd = 9 for fn1a; F-value = 61.9, DFn = 2, DFd = 6 for nkx3-2; and F-value = 22.4, DFn = 2, DFd = 9 for sox9a). The expression of gdf5 transcripts was evaluated with ordinary one-way ANOVA with Tukey’s multiple comparison test (F-value = 6.936, DFn = 2, DFd = 6). Transcript expression of hmga1a and tnfaip6 was assessed using 2-tailed unpaired t-test (t-value = 3.034, df = 4 for hmga1a and t-value = 2.802, df = 4 for tnfaip6). µCT resection length was assessed using a one-way ANOVA (F-value = 1.63, DFn = 2, DFd = 9). Restoration of premaxillary protrusion ability throughout joint regeneration was evaluated using a chi-square test. Comparisons of time intervals between consecutive suction feeding motions was evaluated using a one-way ANOVA (F-value = 5.066, DFn = 4, DFd = 25). Error bars in all statistical analyses represent standard error of mean (SEM).

Archetype Analysis

To perform archetype analysis (AA), the gene expression encodings of cells within the Perivascular, Skeletal 1, and Skeletal 2 superclusters, derived from the dimensionality reduction procedure outlined in the Single-Cell RNA Sequencing section, were utilized. Each cell’s expression encoding was represented by:

\(\widetilde{{x}_{i}}\in {{\mathbb{R}}}^{m}\), where m = 150 principal components (PCs) were selected.

Archetype was defined as a convex combination of the PC encodings \(\widetilde{{x}_{i}}\) in the dataset, as follows:

$${\,z}_{k}=\mathop{\sum }\limits_{j=1}^{N}{\beta }_{{kj}}{\widetilde{x}}_{j},{{\rm{with}}} \, {\beta }_{{kj}}\ge 0,\,\mathop{\sum }\limits_{j=1}^{N}{\beta }_{{kj}}=1$$

where N denoted the number of all cells in the scRNAseq dataset.

Each gene expression encoding \(\widetilde{{x}_{i}}\) was subsequently modeled as a convex combination of the archetypes:

$$\widetilde{{x}_{i}}\approx \mathop{\sum }\limits_{k=1}^{K}{\alpha }_{{ik}}{z}_{k},{{\rm{where}}} \, {\alpha }_{{ik}}\ge 0,\mathop{\sum }\limits_{k=1}^{K}{\alpha }_{{ik}}=1,$$
(2)

where αik corresponds to the coordinates of cell i with respect to archetype k, allowing the positioning within the simplex, and K is the total number of archetypes.

To find the archetype vectors zk and coefficients α,β the original data reconstruction error was minimized:

$${min }_{\left\{\alpha,\beta \right\}}{\mathop{\sum }\limits_{i=1}^{N}\left|\left|{\widetilde{x}}_{i}-\mathop{\sum }\limits_{k=1}^{K}{\alpha }_{{ik}}{z}_{k}\right|\right|}^{2},$$
(3)

subject to the constraints:

$${\alpha }_{{ik}}\ge 0,\mathop{\sum }\limits_{k=1}^{K}{\alpha }_{{ik}}=1,{{\rm{for\; all}}} \, i=1,\ldots,N.$$
$${\beta }_{{kj}}\ge 0,\mathop{\sum }\limits_{j=1}^{N}{\beta }_{{kj}}=1,{{\rm{for\; all}}} \, k=1,\ldots,K.$$

The Seurat object, containing the PC encodings, was converted to a Scanpy object using the R package MuDataSeurat (v0.0.1), enabling further analysis in Python. To optimize the resulting objective function28, the Projected Gradient Descent algorithm56, as implemented in the scikit-learn-compatible Python package archetypes, was utilized with default parameters. The analysis used a forked version of the package (October 19, 2024) available at https://github.com/wangzcl/archetypes.git (original repository at https://github.com/aleixalcacer/archetypes). This package provides the α coefficients, referred to as similarity scores, which quantified each data point’s association with each archetype. The scores ranged from 0 to 1, where 0 indicated maximal distance from an archetype, and 1 signified perfect alignment.

Gene Ontology Enrichment at Archetype Locations

Raw gene expression counts were normalized using the NormalizeData function from the Seurat package with the default parameter normalization.method = ‘LogNormalize’. For each archetype, cells were classified into three groups based on the value of similarity scores. The classification criteria were as follows: close cells with a similarity score greater than 0.9, medium cells with a similarity score between 0.8 and 0.9, and distant cells with a similarity score less than 0.8. Using the Seurat function FindMarkers, differentially expressed genes between close and distant cells were identified with a minimum expression threshold of 0.1 (min.pct = 0.1) and a log-fold change threshold of 0.2 (logfc.threshold = 0.2). The top 150 genes, ranked by average log-fold change and with the adjusted p-value significance less than 0.001, were selected for Gene Ontology (GO) Enrichment Analysis. The GO analysis was performed with the following parameters: ontology ont = ‘BP’ (Biological Process), adjustment method pAdjustMethod = ‘BH’ (Benjamini-Hochberg correction), pvalueCutoff = 0.01, and qvalueCutoff = 0.05.

Zebrabow Clonal Lineage Tracing

For clonal analysis using the ubi:Zebrabow lines, images were captured on a Leica SP8 confocal microscope with LASX software v3.5.7.23225 and processed with Lightning. Using representative slices throughout the Z-stack, cells were categorized into cell type (1: ligamentocyte, 2: adipocyte, 3: chondrocyte) based on cell location and morphology (Supplementary Fig. 17a-c). Cell-type codes were manually assigned using the Point Tool in Fiji. Confocal images were then analyzed using a custom application developed in MATLAB 2024a. The input to the application was a set of image pairs consisting of the raw, unlabeled confocal image (Supplementary Fig. 16a) and its corresponding cell-coded image (Supplementary Fig. 16b). For each pair, the difference between the two images was calculated to provide a ‘cell label’ only image (Supplementary Fig. 16c). This was then scanned to sequentially identify labels indicating cell markers and cell-type code, providing the location of the cell and its cell type, respectively (Supplementary Fig. 16d). RGB values were extracted from the raw images of the cells based on the position delineated by each cell marker and averaged across these pixels to provide a mean hue and saturation value for a given cell (Supplementary Fig. 16e). The number associated with that marker was then recognized to identify cell type. The corresponding cell hue and saturation were then plotted in a polar plot as angle and radius, respectively (Supplementary Fig. 16f). The polar coordinates were used to calculate distance in color space for every potential pairwise comparison of individual cells, allowing for determination of cell lineage, using the standardized equation:

$$D=\sqrt{{r}_{1}^{2}+{r}_{2}^{2}-2{r}_{1}{r}_{2}\cos ({\theta }_{1}-{\theta }_{2})}$$
(4)

where D is distance, r is radius, and θ is the angle in radians. This application allows for tabulation of cells, quantification of distances, and polar plot visualization with interactive functionality that allows users to click on points in the graph to determine that corresponding cell information and visualize its position in the image.

To determine the distance in color space between known clones, the uninjured contralateral side of the joint was imaged. Hue and saturation were recorded for cells with similar color, location, and morphology in these control samples, as these cells were assumed to be clones. The distance in color space between known clones was calculated. Using this value of expected variance in color space for clonally related cells, cells in regenerated joints were then determined to be either predicted as clones or non-clones. Five contralateral uninjured control and five regenerated samples were included in the analysis.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.