Introduction

In certain particular situations, archival pathologic specimens may become the last source of biological material available for forensic casework, such as when a putative parent was deceased with no other material available, when pathological tissues were suspected to be mixed up in medical disputes, when disaster victims or missing persons are required for individual identification, or other special situations. Most of the archival material, however, is composed of solid tumors known to harbor high rates of genetic alterations randomly and abundantly throughout the tumor genome. Therefore, genetic instability of short tandem repeats (STRs) in cancerous tissues is also possible1. STR instability, which manifests as loss of heterozygosity (LOH) or microsatellite instability (MSI), could lead to the genotyping inconsistency between tumor and normal tissues, making forensic identification involving with tumor samples remains challenging in forensic genetics. Although the status of STR instability has been demonstrated in a variety of malignant tissues, including gastric cancer2colorectal cancer3lung cancer4leukemic5pancreatic cancer3breast cancer6papillary thyroid cancer7and et al. The status of STR instability and its clinical significance in hepatocellular carcinoma (HCC) remains to be comprehensively elucidated.

Hepatocellular carcinoma (HCC), the predominant subtype (80–85%) of liver cancer8continues to be the most formidable malignancies due to the lack of early diagnostic methods and efficient therapeutic strategies. A recent investigation in 2022 reported that HCC ranked the sixth most commonest malignancy and the third in cancer-related mortality with approximately 830,000 deaths worldwide annually9. In China, HCC ranked the fourth most prevalent, and the second leading lethal cancer close behind lung cancer10. A recent study by Anqi Chen et al. 2024 demonstrated that 63.33% (19/30) HCC cases harbored genetic alterations in at least one STR locus, with an overall STR alteration rate of 10.88%3. But this study was limited by a small sample size and no clinical data. To gain a deeper understanding of the patterns and degree of STR instability, as well as its clinical significance in HCC, we carried out a comprehensive investigation with a total of 101 matched HCC DNA from cancerous and adjacent non-cancerous tissues.

Results

Widespread STR alterations in HCC

In this study, 20 STRs and the Amelogenin gene were all amplified and genotyped successfully in the 101 matched HCC samples. 79.21% (80/101) of HCC cases exhibited STR alternations in least one STR locus. Among these investigated HCC cases, 20.79% of patients (21/101) displayed alterations with 1 ~ 2 STRs, 47.52% of patients (48/101) with 3 ~ 7 STRs, and 10.89% of patients (11/101) with a number of STRs exceeding 7 (Supplementary Materials S1).

Types and frequencies of STR alterations in HCC

Four types of STR alterations, namely partial loss of heterozygosity (pLOH), complete loss of heterozygosity (cLOH), occurrence of an additional allele (Aadd) and occurrence of a new allele (Anew) were all detected in HCC samples (Fig. 1). A total of 338 STR alternations were observed in 101 HCC tissues. pLOH exhibited as the predominant type of STR alterations with an overall rate of 13.91%, followed by cLOH, Aadd, and Anew with an overall frequency of 1.83%, 0.79%, and 0.2%respectively (Table 1). All investigated STRs exhibited varying degrees of alternation rates ranging from 4.95 to 47.52% (Fig. 2), significantly higher than the average rate of 0.2% in healthy germlines11. Notably, FGA and D16S539 exhibited a distinctively high frequency of alterations, reaching up to 47.52% and 42.57% respectively.

Fig. 1
figure 1

Four typical types of STR alterations in HCC. (A) partial loss of heterozygosity (pLOH), (e.g., allele 18/20 → allele 18/20, but the peak of ‘20’ allele in cancer tissues was significantly reduced); (B) complete loss of heterozygosity (cLOH) (e.g., allele 8/9→ allele 9/9); (C) an additional allele (Aadd) (e.g., allele 22/23 → allele 21/22/23); (D) the occurrence of a new allele (Anew) (e.g., allele 17/20 → allele 18/20).

Table 1 Types and frequencies of the alterations at the 20 STR loci in HCC.

Comparison of the average alteration rate of the forensic STRs across different types of malignant tumors

To better understand the degree of STR alterations in HCC, we compared the average rate of STR alterations in HCC with other types of malignant tumors. Interestingly, we found that HCC exhibited the highest level of genetic instability, followed by colorectal cancer (CRC)3gastric cancer (GC)2lung cancer (LC)4renal cell cancer (RCC)3pancreatic cancer (PC)3breast cancer (BC)3 and papillary thyroid cancer (PTC)7 (Fig. 2; Table 2). Our results indicated that HCC individuals are intrinsically much more susceptible to genetic alterations than any other malignant cancers reported to date. Furthermore, although both the incidence and overall frequencies of STR alterations in this study are a little higher than those in previous study for HCC3no statistically significant differences existed in the incidence of STR alterations between the two HCC groups.

Fig. 2
figure 2

Average alteration frequency of the 19 STRs across different tumor types: PTC: papillary thyroid cancer7BC: Breast cancer3PC: pancreatic cancer3RCC: renal cell cancer3HCC: hepatocellular cancer3LC: lung cancer4GC: gastric cancer, CRC2: colorectal cancer2.

Table 2 Comparison of the alteration frequency of STRs in different malignant cancers (%).

Association of STR alternations with clinicopathological characteristics

To assess the clinical implication of STR hypermutability in HCC, we analyzed the relationship between STR alterations and various clinicopathological characteristics including gender, age, serum a-fetoprotein (AFP), cirrhosis, HBsAg status, tumor size, and degree of tumor differentiation. However, no statistically significant association was found between STR alternations (grouped by presence or absence of STR alterations) and any clinicopathological features (p > 0.05). But when the 101 HCCs were divided into 3 groups based on the cumulative number of altered STR loci, namely the stable group (n = 0), the low-alterated group (n = 1 ~ 2), and the high-alterated group (n ≥ 3), the results revealed a significant association between the accumulation of STR alterations and the presence of positive HbsAg (p = 0.020) as well as the moderate-poor/poor differentiation (p = 0.049) (Table 3). Moreover, LOH at FGA was found to be associated with moderate-poor/poor differentiation (p = 0.002) (Table 4). And LOH at D16S539 was found to be significantly associated with an elevated AFP level (p = 0.042) as well as a larger tumor size (p = 0.040) (Table 5), indicating that LOH at FGA and D16S539 are most common genetic events in HCC.

Table 3 Association of the accumulation of STR alterations with clinicopathological characteristics of HCC.
Table 4 Association of STR alterations at FGA with clinicopathological characteristics of HCCs.
Table 5 Association of LOH at D16S539 with clinicopathological characteristics of HCCs.

Discussion

In current study, we performed a genome-wide investigation of the status of STR alterations in 101 paired HCCs using “classical” forensic STRs widely used in forensic genetics. Our results revealed that 79.21% (80/101) of HCCs exhibited STR alterations in at least one locus, with 16.73% of STR loci altered across all samples. Although these frequencies were slightly higher than those in the previous study for HCC3 no statistically significant differences in the incidence of STR alterations was observed between the two‌ HCC groups. And the slight differences might partly due to the differences in sample size, geographical region, inclusion criteria for HCC volunteers, or various clinical characteristics. By reviewing existing literature, we compared the average frequencies of the STRs alterations in different types of malignancies with sufficiently powered samples (n ≥ 20) for statistical validation. The results revealed that the average frequencies of the STR alterations varied in different tumors, and HCC exhibited a significantly higher alteration frequency compared to any other malignancy reported to date, indicating a remarkable high degree of genetic instability in HCC. Furthermore, our investigation also revealed that different tumors exhibited varied alteration tendencies across the STR loci (Table 2), and no significant regularity could be observed. Intriguingly, both wide ranges and high frequency of STR alterations were detected in CRC, GC and HCC, while the STRs were selectively altered in RCC, PC, BC, LC, and PTC, indicating the inclined alteration in STRs. Although four types of STR alterations were all detected in HCC, LOH (including pLOH and cLOH) was revealed as the predominant type of STR alterations (Supplementary Fig. 1), in agreement with previous literature in which LOH was demonstrated as a major event in almost all malignant tumors12. In actual forensic practice, only the types of cLOH, Anew and Aadd could lead to STR genotype alteration (STRGA), complicating the forensic evaluation. Whereas in this study STRGA occurred in 31 of 101(30.69%) HCC cases, and STRGA was detected at almost all locus except D2S1338, D2S441 and D7S820 in this study. Obviously, the “Gold standard” method of STR-genotyping in forensic application remains challenging when involving with malignant tissues. Alternative approaches such microhaplotypes (MHs) strategy, genome-wide sequencing or high-throughput genotyping of single nucleotide polymorphism (SNPs) might be powerful options.

Increasing evidence has demonstrated that there may be associations between STR alterations and medical conditions13,14,15. In recent years, it has been proposed that STR alteration in cancerous tissue can be used as a biomarker for certain cancers or as a predictor of certain susceptibility disease3,6,16,17. Although in our study no significant associations were found between the presense or absence of STR alterations and age, sex, tumor size, hepato-cirrhosis or AFP level, a significant association was observed between the accumulation of STR alterations and HbsAg as well as degree of differentiation, indicating that HCC patients with hepatitis B virus or poor differentiation are more susceptible to STR alterations.

LOH, indicative of tumor suppressor gene (TSG) pathway, has been established as an important mechanism in the carcinogenesis and/or subsequent progression of human cancers. LOH at 4q has been documented in several human cancers including cervical carcinoma18oesophageal carcinoma19squamous cancer in head and neck20small-cell lung cancer21HCC22,23,24 and et al. However, LOH at 4q occurred more frequently in HCC than in all other cancers. Among which LOH at 4q21-22, 4q28, 4q34, 4q35, 4q24-26, 4q34.3-35 have been frequently observed in HCC. And several HCC-associated tumor suppressor genes (TSGs) including PLAC8 (4q21.22), PRDM5 (4q26), PCDH10 (4q28.3), FBXW7 (4q31.3), SORBS2 (4q35.1) have been identified from these regions25,26,27,28,29,30,31. Consistent with previous reports on high frequency of LOH at 4q31 in HCC22,23our data not only revealed a distinctively high frequency of LOH at 4q31.3 (FGA), but also a significant association with moderate-poor/poor differentiation of HCC, indicating an important TSG associated with HCC may reside at the region of 4q31.3. Interestingly, the STR marker of FGA is exactly located in the third intron of the fibrinogen α (FGA)(4q31.3) gene, which encodes the alpha subunit of the coagulation factor fibrinogen. Mutations in FGA have been identified in several diseases including hypofibrinogenemia, afibrinogenemia, dysfibrinogenemia as well as renal amyloidosis32,33,34,35. A recent study has established FGA as a “drive” gene regulating the progression and metastasis of HCC36. And Xi Han et al. 2024 further validated that FGA acted as a TSG suppressing invasion and metastasis in HCC via the PI3K/AKT signaling pathway37. Whereas our findings provide novel clinical evidence for the critical role of FGA in HCC.

LOH at 16q occurs frequently in various human malignancies including breast cancer38ovarian cancer39prostate cancer40 and HCC23,41,42suggesting that one or more tumor suppressor genes (TSGs) may lie within the region of 16q. Although thus far no definite TSGs associated with HCC at chromosome 16q have yet been identified, LOH on 16q is yet a most frequent genetic events in HCC and occur more frequently with increased tumor size, poor differentiation and metastasis in HCC1,23,43,44. Intriguingly, in this study our data not only demonstrated a markedly high frequency of LOH at 16q24(D16S539), but also revealed a significant association with increased tumor size as well as elevated AFP level. To our knowledge, it is the first to report that LOH on chromosome 16q is associated with an elevated AFP in HCC. These findings indicated that an important TSG, which could not only regulate hepatocyte growth but also involve in postnatal re-expression of AFP, may reside within the region of 16q24.1. But further study is required to clarify this issue.

Conclusions

Taken together, our findings advanced our understanding of the genetic basis of HCC with a robust sample size and clinically relevant correlations. More importantly, our findings holds considerable potential for advancing both forensic genetics and oncology. Future research will focus on the potential roles and underlying mechanisms of FGA as well as D16S539 in hepatocellular carcinoma.

Materials and methods

Participants and samples

101 HCC individuals were recruited from the Hunan Provincial People’s Hospital (the First Affiliated Hospital of Hunan Normal University), Changsha, PR China during 2022–2024. Fresh cancerous and corresponding control tissues were collected with histopathologically verified HCC. Informed consent was obtained from each HCC volunteer.

Patients who had received any treatment including chemotherapy, radiotherapy, or other cancer-related treatments prior to surgery were excluded from the current research cohort. This study included 84 male and 17 female with ages ranging from 30 to 82. This study was approved by the Ethics Committee of Academy of the Hunan Provincial People’s Hospital, China.

DNA preparation

DNA was extracted utilizing the DNeasy®Blood & Tissue Kit (Qiagen, Venlo, The Netherlands) according to the instructions. The DNA concentration was acquired using a NanoDrop One (Thermofisher Scientific, Waltham, MA, Germany).

STR genotyping

PCR was amplified using the MicroreaderTM21ID kit (MR21, MicroReader, Beijing, China) comprising 20 autosome loci (13 CODIS STRs, D2S441, Penta E, Penta D, D2S1338, D19S433, D12S391, and D6S1043) and the Amelogenin gene (Amel). The primer sequences were those detailed in supplementary S3, and the PCR reaction conditions were as follows: 37 °C for 5 min, then 96 °C for 4 min, followed by 28 cycles at 94 °C for 5 s and 60 °C for 70 s, then 60 °C for 30 min and hold at 4 °C. Genotyping was conducted using the ABI 3500 Genetic Analyzer (Applied Biosystems, USA), and STR genotyping data were validated with an allelic ladder and positive and negative controls using GeneMapper ID-X Software (Applied Biosystems, USA). All data from paired tissues were analyzed in a double-blind manner.

To distinguish genuine alternations from genotyping errors, 101 paired samples with inconsistent genotypes or null alleles were re-analyzed to confirm the genotypes using the PowerPlex 21 kit (Promega, Madison WI, USA), which includes 20 autosomal loci (13 CODIS STRs, D1S1656, Penta E, Penta D, D2S1338, D19S433, D12S391, and D6S1043) and the Amel gene. The PCR reaction conditions were as follows: 96 °C for 1 min, followed by 25 cycles at 94 °C for 10 s and 72 °C for 30 s, then 60 °C for 20 min and hold at 4 °C. Four types of STR alterations in HCC cases (HCCs) were observed as previously described, i.e. occurrence of an additional allele (Aadd) (e.g., allele 22, 23 →allele 21, 22, 23); Occurrence of a new allele (Anew) (e.g., allele 17,20→ allele 18,20); and complete loss of heterozygosity (cLOH), and specimens with the ratio of peak intensity in normal/tumor pair < 0.5 or > 2.0 were defined as partial loss of heterozygosity(pLOH)2, both cLOH and pLOH were classified as LOH. In addition, considering that Aadd (occurance of an additional allele) and Anew (occurrence of a new allele) were characterisized by length alterations of simple repeats and caused by DNA mismatch repair deficiency, both Aadd and Anew are also generally referred to as microsatellite instability (MSI).

Statistical analysis

The statistics were executed using SPSS software (version 27.0). the count data are submitted as proportions (%) or frequencies (n), the χ² test or Fisher’s exact probability method was employed to analyze and compare the data between different cohorts. The p-value < 0.05 was considered statistically significant.