Abstract
Canine lymphoma, a phenotypically and genetically heterogeneous disease, represents a significant proportion of canine cancers. We present a large-scale study of 238 dogs with lymphoma to better understand the genetic landscape of canine lymphoma, as well as the relationship to clinical outcomes. Using a targeted next-generation sequencing panel comprising 308 genes, we screened somatic and germline mutations in matched tumor and normal samples. Our findings revealed key associations between genetic alterations and lymphoma subtypes, with certain somatic variants linked to significant differences in response to common chemotherapy regimens. Recurrent mutations in genes such as KMT2C, KMT2D, NOTCH2, TRAF3, CCND1, ARID1A, CREBBP, and TP53 were observed, with TRAF3 mutations standing out for their significant association with prolonged progression-free survival and overall survival in B-cell lymphomas. In contrast, mutations in PIK3CD and CREBBP were associated with inferior outcomes in T-cell lymphomas, highlighting the immunophenotype-specific impact of genetic alterations on treatment responses. These findings support the integration of comprehensive genomic profiling in planning treatment strategies and optimizing clinical outcomes in canine lymphomas.
Similar content being viewed by others
Introduction
Canine lymphoma shows striking similarities to human non-Hodgkin lymphoma1 and is a disease marked by its phenotypic, clinical, and genetic heterogeneity. This neoplastic condition constitutes a significant part of canine cancers, representing 6% of all cancers and 90% of hematopoietic malignancies2,3. Given the known roles of certain cancer-related genes in human oncology, we hypothesized that somatic mutations found in genes of functionally relevant pathways—such as DNA damage response, immune signaling, or cell cycle regulation—may correlate with clinical outcomes in dogs with lymphoma.
Genetic mutations play a fundamental role in cancer biology, influencing tumorigenicity, disease progression, and clinical responses to chemotherapeutic drugs. A recent study has identified potential prognostic biomarkers across canine cancers, noting that somatic mutations in genes with established roles in human cancer biology, such as CCND1, SMARCB1, FANCG, CDKN2B, MSH6, and CCND3, correlate with shorter progression-free survival (PFS) in pan-cancer studies of canine patients4. Previous studies in canine lymphoma subtypes have predominantly focused on B-cell lymphoma (BCL), identifying recurrent somatic mutations in TRAF3 and MAP3K14, as well as associations between mutations in POT1, MYC, and TP53 with shorter PFS and overall survival (OS) in dogs with diffuse large B-cell lymphoma (DLBCL)5. However, the clinical relevance and consequences of genetic alterations in T-cell lymphoma (TCL) remain largely unexplored, underscoring the need for deeper investigation.
Human oncology has benefited from the development of gene panels for targeted therapy and prognostication. Gene panels employing targeted next-generation sequencing (tNGS) technology, notably the Oncomine panel and the MSK-IMPACT panel, have received FDA approval for both in pan-cancer and cancer type-specific contexts6,7. Genetic screenings in canine oncology are also increasing, with targeted mutation analysis in genes such as C-KIT and BRAF increasingly guiding clinical decisions. However, the routine use of comprehensive cancer hotspot panels in dogs, akin to those in human oncology, remains unfulfilled8,9. There is a pressing need for a systematic approach to integrate genomic information into everyday veterinary clinical practice.
Integration of NGS technology into clinical practice faces unique challenges, particularly in veterinary oncology. A crucial aspect of this integration involves establishing robust relationships between genotypes of neoplastic conditions and phenotypic characteristics. There is also variability in treatment regimens across different veterinary practices, contributing to a scarcity of data linking genetic mutations to specific treatment outcomes. This study aims to address these gaps through a large-scale analysis of 238 dogs with lymphoma. Each case includes matched tumor and normal samples, which have been screened using a tNGS panel comprising 308 genes pertinent to cancer biology and pharmacogenomics curated from the COSMIC database10 and literature11,12,13. We investigate both somatic and germline single-nucleotide variants (SNVs) and insertion-deletion (INDEL) mutations, comparing mutational profiles and immune subtypes against treatment outcomes.
Results
Study population and characteristics
The cohort of 238 dogs that met the inclusion criteria for this retrospective study was selected from a pool of 4256 client-owned dogs treated for lymphoma between 2017 and 2023. A summary of the patient demographics, immunophenotypic distribution, and the observed survival characteristics for first-line treatment can be found in Table 1. The incidence rates of BCL and TCL observed in this study reflect those of the general canine population14. BCL cases predominantly involved medium (69/153, 45%) and large-cell lymphoma (72/153, 47%). Within TCL cases, the CD4+ T-helper subset was the most prevalent (60/83, 72%). The most represented breed in the study cohort was Labrador Retriever (30/238, 13%), followed by Boxer (26/238, 11%) and Golden Retriever (25/238, 11%). Among the remaining 158 dogs, 79% were purebred, representing 49 different breeds, while 12% were of mixed or unknown breed (Supplementary Table 1). This study documented 433 responses to various regimens consisting of one to five drugs (Supplementary Fig. 1). Monotherapy regimens were the most frequently observed treatment group, accounting for 41%, followed by CHOP and COP-like regimens (38%), MOPP-based regimens (9%), double-agent regimens (6%), and LOPP-based regimens (4%).
Mutational landscape of canine lymphoma
We evaluated the most common somatic mutations observed in relation to patient signalment, treatment status (treatment-naive or relapsed), stage, and immunophenotype (Fig. 1). Mutation profiles did not differ significantly for patient sex (P > 0.4), age (P > 0.9), or stage of the disease (P > 0.9). However, we did note several mutated genes associated significantly with certain immunophenotypes (Supplementary Fig. 2). In BCLs, TRAF3 ranked as the third most frequently mutated gene, showing a higher mutation frequency in CD21+ DLBCL and a lower occurrence in CD4+ TCL (P < 0.001). FBXW7 and POT1 were also significantly associated with DLBCLs (P < 0.05), aligning with previous studies15,16,17,18. We also observed several novel mutational patterns across BCLs that have not been previously reported. In DLBCL, SETD2 mutations were significantly more frequent compared to the rest of the cohort (P < 0.001). Mutations in MYC and TBL1XR1 were observed more frequently in BCLs than TCLs, but were not specific to cell size. MEF2C mutations were predominantly found in medium-sized BCLs (P < 0.05). TCLs were characterized by frequently mutated SATB1, a transcriptional repressor mainly expressed in thymocytes, exhibiting a mutation rate of 23% (P < 0.001) as observed in previous studies19,20. The serine and/or threonine protein kinase genes—MTOR and PIK3CD—showed significantly higher mutation rates in CD4+ TCL (P < 0.001 and <0.05, respectively).
In our analysis of somatic mutations across canine lymphoma immunophenotypes, several genes emerged as recurrently mutated. Notably, the chromatin remodeling genes KMT2C and KMT2D exhibited the highest mutation rates, detected in 54% and 50% of cases, respectively (Supplementary Fig. 2). These rates align with findings in certain human hematopoietic cancers, i.e., acute myeloid leukemia (AML) and B-cell lymphomas21. Other key epigenetic modifiers, namely ARID1A and CREBBP, showed significant mutation rates, 29% and 26%, respectively. SETD2, identified as a putative oncogenic driver in canine cancers, was mutated in 17% of all cases and in 26% of BCL cases, aligning with previous reports5,22,23. We also identified several genes with recurrent mutations that, to the best of our knowledge, have not been previously reported in canine cancers. BCL11B, a protein-coding gene implicated as a driver in human T-cell acute lymphoblastic leukemia (T-ALL)24, was the third most frequently mutated gene in our analysis, present in 46% of the cohort. The fourth most mutated gene (45%) in our cohort was NOTCH2, which is recognized for its role in human T-cell leukemia25.
Missense mutations constituted the majority of the somatic mutations observed across our study. Amongst the top mutated genes, SETD2, TP53, and TRAF3 had the highest frequencies of nonsense mutations: 17%, 13%, and 25%, respectively. Notably, the frequency of nonsense mutations in TP53 closely aligns with the rate reported in a previous study of canine DLBCL4. KMT2D and KMT2C exhibited a high incidence of missense mutations, accounting for 64% and 72% of the mutation category in these genes, respectively. This trend was also observed in other highly mutated genes, including TP53, SETD2, FBXW7, and TRAF3. Interestingly, SETD2 and TRAF3 had a higher proportion of frameshift mutations than the other top mutated genes: 17% and 25% respectively (Supplementary Fig. 3). Co-occurrence was the most frequently observed between KMT2C and NOTCH2 (28%), followed by KMT2D and KMT2C (27%), KMT2C and BCL11B (24%), KMT2D and BCL11B (23%), and KMT2D and NOTCH2 (23%) (Fig. 2). In terms of odds ratio (OR), co-occurrence was the most significant between TRAF and SETD2 (P < 0.001; OR = 5.7), TRAF3 and POT1 (P < 0.01; OR = 4.7), TRAF3 and TBL1XR1 (P < 0.05; OR = 6.4), TP53 and SETD2 (P < 0.05, OR = 4.2) and POT1 and FBXW7 (P < 0.05; OR = 6.6).
Influence of TRAF3 mutation on clinical outcomes
Given the high frequency of somatic mutations in TRAF3 across all immunophenotypes in our study, we assessed the relationship between TRAF3 mutation status and the clinical outcomes of the patients. The distribution of nonsense and frameshift mutations in TRAF3 spanned multiple domains, with the most prevalent nonsense mutation being c.1267C > T, which leads to a truncation of the protein at Arg 423. This mutation, identified in 13 patients, is located at the N-terminal end of the meprin and TRAF homology (MATH) domain, which is crucial for the trimerization of TRAF family proteins26. We compared the clinical outcomes of the patients with loss-of-function (LOF) mutations localized to this domain, relative to those patients designated as wild-type (WT) for TRAF3 (Fig. 3). In terms of hazards ratio (HR), Putative LOF mutations in TRAF3 were significantly associated with extended PFS (HR = 0.32, 95% CI of 0.12–0.91) and longer OS (HR = 0.49, 95% CI of 0.24–0.99), and a trend towards a longer duration of response (P = 0.052). In contrast, we did not observe a significant association with time to response (TTR). In contrast, neither somatic mutations rated as low, modifier impact or germline truncating variants in TRAF3 were significantly associated with any clinical outcomes (Supplementary Fig. 4).
Immunophenotype-specific mutations relevant to treatment outcomes
PFS following first-line treatments differed significantly between the two main immunophenotypes, BCLs and TCLs. Moreover, in dogs treated with CHOP, both PFS and OS varied significantly between these two primary immunophenotypes. The median PFS and OS following CHOP treatment were 139 days (95% CI of 112–211) and 290 days (95% CI of 188–488), respectively. BCLs had a superior median PFS of 170 days (95% CI: 127–203) compared to 92 days (95% CI of 84–142) observed for TCLs. These results are consistent with the treatment outcomes previously reported in a retrospective analysis of CHOP chemotherapy27,28.
Given the influence of immunophenotype on survival, we analyzed the influence of the somatic mutations for the BCLs and TCLs separately. Among BCLs, we identified four mutated genes associated significantly with response to CHOP and Tanovea-based regimens (Fig. 4). Specifically, mutated TRAF3 and KMT2D were significantly associated with longer PFS (HR = 0.21, 95% CI of 0.07–0.58; HR = 0.37, 95% CI of 0.14–0.98) while mutated FBXW7 was associated with a significantly extended OS (HR = 0.26, 95% CI of 0.08–0.88) in the patients receiving CHOP therapy. Mutated CCND1 was associated with a notably shorter PFS (HR = 3.6, 95% CI of 1.1–11.4). Among TCLs, we also identified four mutated genes related to response to specific treatment regimens (Fig. 4). Somatically mutated PIK3CD and CREBBP were associated with significantly shorter PFS (HR = 4.2, 95% CI of 1.5–11.7; HR = 4.9, 95% CI of 1.6–14.7) in the patients treated with CHOP or lomustine-containing regimens. Conversely, the patients with mutated KMT2C or NOTCH2 achieved significantly longer PFS following the CHOP treatment protocols (HR = 0.24, 95% CI of 0.06–0.92; HR = 0.09, 95% CI of 0.02–0.43).
The influence of TRAF3, PIK3CD, and CREBBP mutations on survival remained significant during multivariate analysis of each gene, wherein the traditional prognostic factors—immunophenotype, stage, and disease status (naïve vs relapsed)—were included for comparison (Table 2). Immunophenotype was a significant covariate in every per-gene Cox PH model constructed for PFS, except for TRAF3. Treatment status (naïve vs relapsed) was a significant covariate for OS in every model. In addition to the above eight genes, we discovered that the somatic mutation status of TP53 was significantly associated with shorter OS following the first-line therapies.
Discussion
Our study represents one of the largest tNGS-based studies of canine lymphoma to date, involving an analysis of the clinical, genetic, and phenotypic data of 238 dogs with spontaneously occurring lymphoma for 308 genes. We discovered several significant associations between immunophenotypic subtypes, treatment outcomes, and specific gene alterations, with some showing promise as potential therapeutic targets. We identified eight genes—TRAF3, FBXW7, KMT2D, CCND1, PIK3CD, NOTCH2, KMT2C, and CREBBP—in which somatic mutations may serve as potential novel biomarkers of clinical response. By focusing our univariate survival analysis specifically on somatic mutations within distinct immunophenotypes, we sought to identify driver mutations rather than incidental passenger mutations. This methodological choice also allowed for a more controlled analysis, accounting for the variability in survival outcomes attributable to differences in immunophenotypes.
TRAF3, which emerged as one of the top somatically mutated genes in BCLs in our study, has been recognized as a potential driver of B-cell lymphomagenesis in both canine and human studies5,7. As a known negative regulator of the non-canonical NF-κB pathway, TRAF3 is a putative tumor suppressor in the realm of human and translational research29,30. We also observed that somatic LOF mutations in TRAF3 are almost exclusively seen in BCLs and positively associated with longer OS and PFS. These findings suggest that TRAF3-mutated BCLs have an increased sensitivity to conventional chemotherapeutics. In contrast, the function of NF-kB signaling affects only the late-stage differentiation of CD4+ T-cells31. Interestingly, in human DLBCL models, deletion of the TRAF3 gene induces reliance of cancer cells on non-canonical NF-κB signaling for survival32. This pathway dependency may render cells more vulnerable to disruption by DNA alkylators such as doxorubicin and cyclophosphamide, both components of CHOP therapy. If this mechanism is conserved in canine lymphoma, it may also highlight components of the non-canonical NF-κB pathway as potential therapeutic targets.
PIK3CD and CREBBP emerged as potential biomarkers for inferior outcomes in TCLs, specifically associated with shorter PFS in CHOP-like treatments and CCNU-containing treatments, respectively. PIK3CD encodes an isoform of the phosphoinositide 3-kinase (PI3K) enzyme, an integral component of the PI3K/AKT/mTOR pathway. Notably, novel therapeutics are under development to target components of this pathway in human oncology33,34. PIK3CD promotes the activation of c-MYC and p-AKT, both of which drive cell proliferation and survival35. Mutations in PIK3CD may therefore lead to constitutive survival signaling, which in turn may reduce the efficacy of DNA-damaging agents such as doxorubicin and cyclophosphamide—both key components of CHOP—by promoting cell survival and impairing apoptosis in response to chemotherapy. PIK3CA has been previously linked to shorter PFS in a canine pan-cancer study36. PIK3CA mutations are also associated with inferior prognosis in triple-negative breast cancer37, and PIK3CD mutations have been implicated as potential indicators of poor prognosis in breast carcinoma. On the other hand, CREBBP encodes a histone acetyltransferase that functions as a transcriptional co-activator and chromatin modifier. LOF mutations in CREBBP can impair histone acetylation, leading to a more condensed chromatin state and transcriptional repression of tumor suppressor genes, contributing to lymphomagenesis38,39. The efficacy of alkylating chemotherapeutics such as lomustine can be diminished as tumors with CREBBP loss may have enhanced capacity to maintain chromatin compaction or evade apoptosis despite DNA damage. Histone deacetylase inhibitors (HDACi) have been shown to sensitize tumor cells to DNA-damaging agents by disrupting chromatin structure and impairing access to DNA repair machinery40. In this regard, CREBBP-deficient TCLs may be less responsive to lomustine-based therapies unless paired with an epigenetic sensitizer.
Other putative prognostic biomarkers in TCLs identified in our study include NOTCH2 and KMT2C. NOTCH2 is implicated in T-cell lineage commitment, while KMT2C plays a role in epigenetic regulation. However, the impact of these genes on cancer prognosis remains less understood and is a subject for further investigation. Interestingly, somatic mutations in both KMT2D and KMT2C were associated with longer PFS in BCLs and TCLs, respectively. KMT2 family genes have been cited as some of the most highly mutated genes across several large-scale sequencing studies in human hematopoietic cancers41. While the roles of these epigenetic regulators in oncogenesis are well established, the association of their mutational status with prognosis remains inconsistent. LOF mutations in KMT2C and KMT2D have been associated with improved outcomes in pancreatic ductal adenocarcinoma, whereas in mantle cell lymphoma and non-small cell lung cancer, KMT2D mutations have been linked to poorer survival42,43. These discrepancies may reflect differences in tissue-specific enhancer programs or the differentiation state of the tumor, factors that influence whether KMT2D loss disrupts oncogenic or tumor-suppressive transcriptional programs34. Further research is needed to understand the role of these genes in canine lymphoma, although our findings point to a shared potential of KMT2 genes as predictors of positive outcome.
Other epigenetic regulators found to have significant associations in our cohort included SETD2, more frequently mutated in DLBCLs, and SATB1, frequently mutated in TCLs. Somatic SETD2 mutations have been recurrently reported in canine DLBCLs and, in some studies, associated with shorter PFS20. In humans, SETD2 encodes a histone methyltransferase responsible for H3K36 trimethylation, a mechanism critical for recruiting double-stranded DNA break repair machinery. Loss of function can lead to increased mutational burden and genomic instability, and is theorized to contribute to tumorigenesis, particularly in B-cell lymphocytic leukemias44. However, the prognostic relevance of SETD2 mutations remains inconsistent across studies in canine lymphoma, and in our cohort, they were not significantly associated with PFS45. Further research is needed to clarify whether SETD2 functions similarly in canine lymphoma. SATB1, a known chromatin organizer, plays a key role in T-cell differentiation and has been implicated in the pathogenesis of human cutaneous T-cell lymphoma. Its frequent mutation in our TCL cohort may reflect dysregulation of these pathways46.
Our analysis of 238 canine samples was broadly representative of the immunophenotypic distribution observed in clinical practice, with 64% of dogs diagnosed with BCL and 31% with non-TZL TCL, compared to 71% and 20%, respectively, in a U.S. study of canine nodal lymphomas47. Among our TCL cases, 60 of 83 were classified as CD4+, consistent with the 72% CD4+ prevalence reported for non-TZL TCLs in the same study. As a result, the prognostic utility of the genotypes identified here is largely reflective of this subtype, which is associated with significantly longer OS than its CD8+ or double-negative counterparts. While these findings are valuable, they also highlight the limitations imposed by the phenotypic heterogeneity of TCLs. The significance of the biomarkers reported here may not be generalizable to less common T-cell subtypes. There is a significant need for research to evaluate genetic biomarkers within a broader spectrum of TCL subtypes in dogs. A future study in an expanded cohort of TCLs, including CD8+, double-positive, double-negative, and T-zone lymphoma, could provide further insight into the mutational landscape and prognostic value of genetic biomarkers found within these subtypes. In addition, breed-specific predispositions to lymphoma subtypes have been reported, but our univariate analyses were not stratified by breed to maintain sufficient case numbers per treatment group. Validation in a breed-controlled cohort would help assess the generalizability of our findings.
Our study focused on gene-level associations with immunophenotype and clinical outcome to offer greater statistical power and interpretability of the results. A follow-up study in a larger cohort would enable variant-level association testing and improve the ability to detect and validate statistically significant hotspot mutations. This, in turn, could guide the design of functional studies to better understand the mechanisms underlying both sensitivity and resistance to chemotherapy in canine lymphoma. Functional validation of the putative loss-of-function mutations identified in this study, particularly in TRAF3, would help confirm causality and clarify their therapeutic relevance. This could include in vitro gene knockout experiments to assess sensitivity to CHOP agents, or ex vivo drug sensitivity testing using patient-derived cells with and without the reported TRAF3 variants.
Given that driver mutations are typically somatic, matched germline samples were included for variant calling. However, the use of whole blood as normal tissue in a systemic malignancy such as multicentric lymphoma presents limitations. To reduce the likelihood of false-positive somatic calls, we applied a conservative allele frequency threshold of >0.05 in the matched blood samples. Thus, it is possible that true somatic variants present at low frequencies in blood due to the presence of circulating tumor cells may have been excluded. Incorporating a panel of normal samples from healthy dogs and application of a lower allele frequency threshold would improve the robustness of somatic variant detection. Our analysis did not include an assessment of copy number alterations (CNAs), which are known to contribute to the pathogenesis of several canine cancers. Future studies validating our initial findings in whole-exome or whole-genome sequencing data would also be better suited to assess CNAs in canine lymphoma.
In conclusion, our findings provide significant insights into the mutational landscape of canine lymphoma. This research, encompassing 238 dogs, reveals critical correlations between genetic mutations and clinical outcomes in various chemotherapeutic regimens. The identification of recurrent mutations in genes such as KMT2C, KMT2D, NOTCH2, TRAF3, CCND1, ARID1A, CREBBP, and TP53, with their varied implications in both BCLs and TCLs, highlights the potential of genomic profiling to inform future treatment strategies. While additional validation is needed to support clinical implementation, these findings represent an important step toward realizing precision medicine in veterinary oncology.
Methods
Patient selection and immunophenotyping
The patient selection criteria were defined as follows. Dogs diagnosed with lymphoma by cytology, confirmed by a board-certified veterinary pathologist, and treated at veterinary oncology clinics between 2017 and 2023 were included in the study (N = 4256). From this population, a subgroup of 248 dogs met the inclusion criteria for the study: (1) Availability of matching tumor and normal specimens; (2) both immunophenotyping and ex vivo drug sensitivity data generated; (3) documented clinical response to at least one well-established chemotherapy regimen within a month following biopsy sampling. Ten dogs were excluded from analysis due to poor sequencing quality, leaving a cohort of 238 dogs for analysis. All specimens were collected under informed consent, and the experiments were approved by the internal review board (IMVLSA1223.18), as well as the ethics committee of the participating veterinary hospitals.
To group the cohort into clinically relevant immunophenotype subtypes, immunophenotyping was conducted using a combination of flow cytometry (FC) and PCR for antigen receptor rearrangement (PARR) as previously described48. Briefly, BCLs were categorized by the expression of the mature B-cell marker CD21, while TCLs were identified by the expression of mature T-cell markers CD3, CD4, CD5, and CD8. Lineage confirmation of the samples was achieved through PARR. FC analysis further divided the cohort into nine subtypes. BCLs were classified by relative cell size—small, medium, or large (DLBCL). TCLs were further subtyped based on surface antigen expression: helper-like CD4+ T-cells (CD3+/CD4+/CD8-), CD8+ cytotoxic-like T-cells (CD3+/CD4-/CD8+), CD45- T-zone lymphoma (TZL), and cases with atypical patterns of antigen expression. These atypical presentations included uncommon phenotypes such as double-negative (CD3+/CD4-/CD8-), double-positive (CD3+/CD4+/CD8+), and aberrant phenotypes lacking CD3 expression (CD3-/CD4+/CD8- and CD3-/CD4-/CD8-/CD5+). Due to the indolent disease progression of TZL in canine lymphoma, this subtype was separated from the TCLs during survival analysis.
Gene selection and tNGS library preparation
Fine needle aspirates (FNAs) of affected lymph nodes were processed by ImpriMed, Inc. (Mountain View, CA, USA) as previously described43. Matching whole blood samples, representing normal patient cells, were also provided with the tumor biopsy. Genomic DNA was extracted from FNA and whole blood cells, sheared to a median insert size of 300 bp and prepared using the Enzymatic Fragmentation v2.0 NGS library preparation kit (Twist Bioscience, CA, USA), and barcoded using the Unique Dual Index plates (A-D). The 308-gene panel used in this study was curated based on relevance to lymphoma biology, drug response, and mutational recurrence in both canine and human cancers. Gene inclusion was guided by prior reports in canine lymphoma, entries from the COSMIC database, literature linking genes to chemoresistance or disease progression, and membership in known oncogenic pathways10. For genes identified in human studies, known canine orthologs were included when available, using annotations from COSMIC and Ensembl. The capture and enrichment of targeted genes was achieved using the Fast Hybridization kit and a custom probe panel of 308 genes spanning 0.98 Mbp, spread over 4667 regions. Whole genome and post-capture libraries were quantified by the Qubit Flex Fluorometer (Life Technologies, CA, USA). Quality was assessed using the Tapestation 6800 (Agilent, CA, USA). Target-enriched libraries were sequenced on either the Illumina HiSeq 4000 or Illumina NovaSeq 6000 system with 2 × 150 bp paired-end reads.
Gene alignment and quality control
All targeted sequencing reads were passed through FastQC (v0.11.9) before and after adapter trimming to obtain baseline sequencing quality metrics, including quality scores, GC content, and adapter content. Adapter sequences and low-quality bases were trimmed from the reads using BBDuk (v38.95). Trimmed reads were aligned to the CanFam 3.1 reference genome, sourced from the Ensembl database with BWA-MEM (v0.7.17). Aligned reads were preprocessed for variant calling by harmonization between matched tumor and normal samples using Samtools (v1.9) and deduplication using Picard MarkDuplicates (v2.22.3). Qualimap (v2.2.1) was used to conduct quality control analysis on the preprocessed bam files.
Variant calling and annotation
Both somatic and germline variants were called with VarDict (v4.3), using the pre-built CanFam3.1.86 database with default parameters as previously described49. Briefly, variants with lower than 5× read coverage, mean base quality <22.5, and a variant allele frequency <0.05 were excluded. False positives were excluded by filtering out variants with mean mismatches in reads ≥5.25. To account for the potential presence of a neoplastic cell fraction in peripheral blood, somatic calls were excluded if its harmonized sequence in the normal sample had an allele frequency >0.05. Called mutations included SNVs and small INDELs. Functional variant annotation was performed using a combination of SnpEff and SnpSift (v4.3)50,51. SnpEff was used for primary annotation of a variant’s IMPACT ratings, protein amino acid level alterations, and associated transcript Ensembl IDs and Uniprot accession numbers. SnpSift was used to obtain Sorting Intolerant from Tolerant scores for missense variants. Annotated Variant Call Format (VCF) files were used for all downstream analysis. RStudio (v4.3.0) was used to extract mutational features from VCF files for survival analysis and subtype correlation, using the VariantAnnotation, GenomicRanges, and Rsamtools packages52,53,54. Deleterious mutations were filtered based on their IMPACT rating as defined by the Ensembl Variant Effect Predictor. Mutations with moderate and high impact, as determined by sequence ontology, were included in the survival analysis and for comparing mutation frequencies between groups defined by baseline clinical and molecular characteristics. For univariate survival analysis focusing on putative LOF mutations, frameshift variants, stop lost, start lost, stop gained, and missense variants with a SIFT score <0.05 were included.
Clinical data collection and curation
Longitudinal clinical data were generated based on the medical charts provided by the participating veterinarians. A contracted medical professional, well-versed in clinical annotation, was responsible for the collation of these records. Date and treatment response were documented for every clinical visit before and after the biopsy sampling date. Treatment responses were categorized into progressive disease, stable disease, partial response (PR), or complete response. Additionally, dates of euthanasia or cessation of treatment were recorded. Time to initiation of and clinical response to treatment was noted relative to the sampling date. Using the collated data, we computed PFS and OS according to VCOG v1.0 guidelines, as well as duration of response (DOR) and TTR for each patient across the treatment regimens they received55,56. PFS was defined as the time interval from the initiation of treatment to the first observation of disease progression. OS was determined as the time from the start of treatment to the date of death or last follow-up. DOR was defined as the time interval from the first instance of a positive response (PR or better) to the first incidence of disease progression, while TTR was defined as the time from the start of treatment to the first instance of positive response.
Chemotherapy regimens were categorized into CHOP, MOPP, LOPP, other combinations, and single or double agents by a previously described methodology57. CHOP-based treatments were defined as at least one single cycle of cyclophosphamide, vinca alkaloids, antitumor antibiotics (doxorubicin or mitoxantrone), and corticosteroids, with or without L-asparaginase induction. COP protocols excluded antitumor antibiotics and were included in the CHOP treatment category unless otherwise specified. MOPP treatments were defined as one cycle of mechlorethamine, vinca alkaloids, procarbazine, and corticosteroids, while LOPP treatments included CCNU in lieu of mechlorethamine with or without L-asparaginase induction. Single and double-agent therapies used one or two drugs, respectively, with or without corticosteroids. Protocols not fitting these classifications were grouped as other combinations.
Statistical and survival analysis
Baseline clinical and molecular characteristics were compared using the chi-square test or Fisher’s exact test for categorical variables, and a two-sample t-test or Mann–Whitney U test for continuous variables. Co-occurrence and mutual exclusivity of variants were analyzed using odds ratios, with the significance of the relationships evaluated by Fisher’s exact test or the chi-square test. P-values were corrected using Bonferroni method when multiple tests were indicated. OS and PFS were estimated using the Kaplan–Meier method. Hazard ratios and their 95% CIs were estimated using Cox proportional hazards models. The impact of somatic mutations on PFS, OS, DOR, and TTR was analyzed using both multivariate and univariate models. Non-parametric log-rank tests were applied to compare survival outcomes.
Data availability
The datasets generated and/or analyzed during the current study are not publicly available due to privacy and security concerns, but are available from the corresponding author upon reasonable request.
References
Marconato, L. et al. The dog as a possible animal model for human non-Hodgkin lymphoma: a review. Hematol. Oncol. 31, 1–9 (2013).
Seelig, D. M. et al. The comparative diagnostic features of canine and human lymphoma. Vet. Sci. 3, 11 (2016).
Vail, D. M. & MacEwen, E. G. Spontaneously occurring tumors of companion animals as models for human cancer. Cancer Invest. 18, 781–792 (2000).
Chon, E. et al. Novel genomic prognostic biomarkers for dogs with cancer. J. Vet. Intern. Med. 37, 2410–2421 (2023).
Giannuzzi, D. et al. The genomic landscape of canine diffuse large B-cell lymphoma identifies distinct subtypes with clinical and therapeutic implications. Lab. Anim. 51, 191–202 (2022).
Nagahashi, M. et al. Next generation sequencing-based gene panel tests for the management of solid tumors. Cancer Sci. 110, 6–15 (2019).
Zehir, A. et al. Mutational landscape of metastatic cancer revealed from prospective clinical sequencing of 10,000 patients. Nat. Med. 23, 703–713 (2017).
Rossman, P. et al. Phase I/II trial of vemurafenib in dogs with naturally occurring, BRAF-mutated urothelial carcinoma. Mol. Cancer Ther. 20, 2177–2188 (2021).
McNiel, E. A. et al. Microsatellite instability in canine mammary gland tumors. J. Vet. Intern. Med. 21, 1034–1040 (2007).
Tate, J. G. et al. COSMIC: the catalogue of somatic mutations in cancer. Nucleic Acids Res. 8, 941–947 (2019).
Plassais, J. et al. Whole genome sequencing of canids reveals genomic regions under selection and variants influencing morphology. Nat. Commun. 10, 1489 (2019).
Lindblad-Toh, K. et al. Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature 438, 803–819 (2005).
Wong, A. K. et al. A comprehensive linkage map of the dog genome. Genetics 184, 595–605 (2010).
Thalheim, L. et al. Lymphoma immunophenotype of dogs determined by immunohistochemistry, flow cytometry, and polymerase chain reaction for antigen receptor rearrangements. J. Vet. Intern. Med. 27, 1509–1516 (2013).
Bushell, K. R. et al. Genetic inactivation of TRAF3 in canine and human B-cell lymphoma. Blood 125, 999–1005 (2015).
Elvers, I. Exome sequencing of lymphomas from three dog breeds reveals somatic mutation patterns reflecting genetic background. Genome Res. 25, 1634–1645 (2015).
Fanelli, A. POT1 mutations are frequent and associated with Ki-67 index in canine diffuse large B-cell lymphoma. Front. Vet. Sci. 9, 968807 (2022).
Smith, P. A. D. et al. The prevalence and characterisation of TRAF3 and POT1 mutations in canine B-cell lymphoma. Vet. J. 266, 105575 (2020).
Harris, L. J. et al. Canine CD4+ T-cell lymphoma identified by flow cytometry exhibits a consistent histomorphology and gene expression profile. Vet. Comp. Oncol. 17, 253–264 (2019).
Fanelli, A. et al. The K9 lymphoma assay allows a genetic subgrouping of canine lymphomas with improved risk classification. Sci. Rep. 14, 18687 (2024).
Xu, J. et al. KMT2D deficiency promotes myeloid leukemias which is vulnerable to ribosome biogenesis inhibition. Adv. Sci. 10, 2206098 (2023).
Krysta, M. C. et al. Shared and distinct genetic features in human and canine B-cell lymphomas. Blood Adv. 6, 3404–3409 (2022).
Sakthikumar, S. et al. SETD2 is recurrently mutated in whole-exome sequenced canine osteosarcoma. Cancer Res. 78, 3421–3431 (2018).
Gutierrez, A. et al. The BCL11B tumor suppressor is mutated across the major molecular subtypes of T-cell acute lymphoblastic leukemia. Blood 118, 4169–4173 (2011).
Laky, K. & Fowlkes, B. J. Notch signaling in CD4 and CD8 T cell development. Curr. Opin. Immunol. 20, 197–202 (2008).
Sunnerhagen, M., Pursglove, S. & Fladvad, M. The new MATH: homology suggests shared binding surfaces in meprin tetramers and TRAF trimers. FEBS Lett. 530, 1–3 (2002).
Curran, K. & Thamm, D. H. Retrospective analysis for treatment of naïve canine multicentric lymphoma with a 15-week, maintenance-free CHOP protocol. Vet. Comp. Oncol. 14, 147–155 (2016).
Koo, J. et al. Predicting dynamic clinical outcomes of the chemotherapy for canine lymphoma patients using a machine learning model. Vet. Sci. 8, 301 (2021).
Zhang, B. et al. An oncogenic role for alternative NF-κB signaling in DLBCL revealed upon deregulated BCL6 expression. Cell Rep. 11, 715–726 (2015).
Whillock, A. L. et al. Author Correction: TRAF3 regulates the oncogenic proteins Pim2 and c-Myc to restrain survival in normal and malignant B cells. Sci. Rep. 9, 17502 (2019).
Liu, T., Zhang, L., Joo, D. & Sun, S. C. NF-κB signaling in inflammation. Signal Transduct. Target. Ther. 2, 1–9 (2017).
Cui, W. et al. PIK3CD promoted proliferation in diffuse large B cell lymphoma through upregulation of c-myc. Tumour Biol. 37, 12767–12777 (2016).
He, Y. et al. Targeting PI3K/Akt signal transduction for cancer therapy. Signal Transduct. Target. Ther. 6, 425 (2021).
Hua, H. et al. Targeting mTOR for cancer therapy. J. Hematol. Oncol. 12, 71 (2019).
Rao, R. C. & Dou, Y. Hijacked in cancer: the KMT2 (MLL) family of methyltransferases. Nat. Rev. Cancer 15, 334–346 (2015).
Wu, K. et al. Analyses of canine cancer mutations and treatment outcomes using real-world clinico-genomics data of 2119 dogs. NPJ Precis. Oncol. 7, 8 (2023).
Wang, X. et al. KMT2D and PIK3CA mutation as potential factors to predict adjuvant chemotherapy efficacy in surgical triple negative breast cancer. Cancer Res. 82, 5084 (2022).
Zhang, W. et al. Etoposide, an anticancer drug involved in therapy-related secondary leukemia: enzymes at play. Transl. Oncol. 14, 101169 (2021).
Zhang, J. et al. The CREBBP acetyltransferase is a haploinsufficient tumor suppressor in B-cell lymphoma. Cancer Discov. 7, 322–337 (2017).
Staberg, M. Inhibition of histone deacetylases sensitizes glioblastoma cells to lomustine. Cell Oncol. 40, 21–32 (2017).
Fagan, R. J. & Dingwall, A. K. COMPASS ascending: emerging clues regarding the roles of MLL3/KMT2C and MLL2/KMT2D proteins in cancer. Cancer Lett. 28, 56–65 (2019).
Dawkins, J. B. et al. Reduced expression of histone methyltransferases KMT2C and KMT2D correlates with improved outcome in pancreatic ductal adenocarcinoma. Cancer Res. 76, 4861–4871 (2016).
Ferrero, S. et al. KMT2D mutations and TP53 disruptions are poor prognostic biomarkers in mantle cell lymphoma receiving high-dose therapy: a FIL study. Haematologica 105, 1604–1612 (2020).
Mar, B. G. et al. SETD2 alterations impair DNA damage recognition and lead to resistance to chemotherapy in leukemia. Blood 130, 2631–2641 (2017).
van der Heiden, A. D. et al. Characterization of the genomic landscape of canine diffuse large B-cell lymphoma reveals recurrent H3K27M mutations linked to progression-free survival. Sci. Rep. 15, 4724 (2025).
Li, M. Y. et al. TRAF3 loss-of-function reveals the noncanonical NF-κB pathway as a therapeutic target in diffuse large B cell lymphoma. Proc. Natl. Acad. Sci. USA 121, e2320421121 (2024).
Harris, L. J. et al. Clinical features of canine nodal T-cell lymphomas classified as CD8+ or CD4-CD8- by flow cytometry. Vet. Comp. Oncol. 18, 416–427 (2020).
Bohannan, Z. et al. Predicting likelihood of in vivo chemotherapy response in canine lymphoma using ex vivo drug sensitivity and immunophenotyping data in a machine learning model. Vet. Comp. Oncol. 19, 160–171 (2021).
Lai, Z. et al. VarDict: a novel and versatile variant caller for next-generation sequencing in cancer research. Nucleic Acids Res. 44, e108 (2016).
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6, 80–92 (2012).
Cingolani, P. et al. Using Drosophila melanogaster as a model for genotoxic chemical mutational studies with a new program, SnpSift. Front. Genet. 3, 35 (2012).
Obenchain, V. et al. VariantAnnotation: a Bioconductor package for exploration and annotation of genetic variants. Bioinformatics 30, 2076–2078 (2014).
Lawrence, M. et al. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 9, e1003118 (2013).
Morgan, M. et al. Rsamtools: binary alignment (BAM), FASTA, variant call (BCF), and tabix file import. R package version 1.0, 677–689 (2016).
Vail, D. M. et al. Response evaluation criteria for peripheral nodal lymphoma in dogs (v1.0)—a Veterinary Cooperative Oncology Group (VCOG) consensus document. Vet. Comp. Oncol. 8, 28–37 (2010).
Delgado, A. & Guddati, A. K. Clinical endpoints in oncology—a primer. Am. J. Cancer Res. 11, 1121–1131 (2021).
Bennett, P., Williamson, P. & Taylor, R. Review of canine lymphoma treated with chemotherapy-outcomes and prognostic factors. Vet. Sci. 10, 342 (2023).
Acknowledgements
This study was made possible through the collaboration between ImpriMed Inc. and a dedicated network of veterinary oncologists and general practitioners. We extend our heartfelt gratitude to the dogs and their owners for their invaluable participation. We also thank Dr. Kevin Choy, Dr. Douglas Thamm, and Dr. Carlos Bustamante for their insightful feedback on the manuscript. Special thanks to Amanda Polley for supporting the analysis of the NGS results.
Author information
Authors and Affiliations
Contributions
J.T. was responsible for conceptualization, data collection, curation, writing—original draft, writing—review and editing. Q.J.Y. was responsible for data curation and analysis. S.K., J.C., and S.S. were responsible for data collection and curation. M.K. contributed to data analysis. G.C. and S.C. contributed to data analysis and writing—review. D.S. and S.P. contributed to data collection and curation. S.L. was responsible for funding acquisition and also contributed to conceptualization. I.H. was responsible for conceptualization, supervision, writing—reviewing and editing. J.K. was responsible for conceptualization, writing—review and editing. I.H. and J.K. are co-corresponding authors.
Corresponding authors
Ethics declarations
Competing interests
J.T., S.L., and I.H. have a patent application about the methodology used to identify genetic mutations and project treatment outcomes described in this study. No disclosures were reported by the other authors not listed in this section.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Tsang, J., Yap, Q.J., Kapoor, S. et al. Identification of novel genetic mutations for the treatment prognostication of canine lymphoma. npj Precis. Onc. 9, 174 (2025). https://doi.org/10.1038/s41698-025-00988-5
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41698-025-00988-5






