Introduction

Recent advances in single-cell technologies have unveiled an unprecedented level of detail and insight into the heterogeneity and complexity of biological systems. The creation of comprehensive atlases for various organs and tissues has provided a vast repository of information, enabling the intricate deconvolution of cell populations and a deeper understanding of their diversity1,2,3. The application of single-cell technologies has facilitated the prediction of cellular trajectories, the identification of rare cell populations, a more profound comprehension of malignant cell transformation and numerous other discoveries4,5,6,7. Despite the significant progress achieved, the clinical translation of these technologies still encounters several challenges that must be addressed to fully realize their potential and expand their applications.

Current approaches to investigate the heterogeneity in human tissues are costly or demand access to large amounts of material while optimized for a small number of tissue types8,9,10,11. Moreover, most of these methodologies have been developed with fresh tissues as the starting material, inherently assuming that samples can be immediately processed. This poses significant logistical challenges, particularly because samples are often collected from multiple locations distant from research laboratories thereby increasing costs and restricting availability. Consequently, maintaining the integrity of fresh samples during transport becomes a critical issue. Additionally, the process of dissociating tissues into a single-cell suspension typically involves extensive mechanical and enzymatic steps over several hours to isolate live cells12. This procedure induces alterations in transcriptional profiles and introduces sequencing noise as well as cell-type dissociation biases, thereby compromising the quality of downstream analyses12,13,14,15. Moreover, for many research projects, the use of fresh samples is not feasible because a comprehensive pathological characterization is required to select appropriate samples for analysis. The detailed characterization of the tissue often precludes its immediate processing, making it difficult to adhere to protocols designed for fresh specimens. Therefore, expanding the applicability of single-cell technologies to low-input cryopreserved material in repositories would be beneficial for researchers.

Numerous protocols and methods have been developed to efficiently extract nuclei from diverse tissues prior to single-nuclei RNA sequencing (snRNA-seq)16,17. Transcriptomic analyses from isolated nuclei have demonstrated equivalency to those obtained from live cells isolated from fresh tissues, suggesting that nuclei are a reliable proxy18,19,20,21,22,23. Additionally, snRNA-seq has been shown to provide datasets that more accurately reflect the composition of cell populations within the tissue8,13,24,25,26. These methodologies have proven successful in profiling samples that have been cryopreserved for extended periods of time, although they typically require a minimum starting material of around 30 to over 100 mg to provide reliable results9,18,27,28. While some approaches have been adapted for low-input material, most protocols are optimized for specific tissue types8,29,30 complicating their application across different tissues. Optimizing conditions for nuclei extraction is crucial for a successful snRNA-seq experiment. The process requires careful mechanical homogenization and an optimal balance between high-quality nuclei and minimal cellular debris31,32. Standardizing these approaches to efficiently use low-input cryopreserved material for snRNA-seq is crucial to fully leverage the potential of clinical samples and expand the range of tissues studied with these techniques.

Developing an adaptable and versatile method compatible with multiple tissues for generating snRNA-seq data is a significant challenge due to the diverse biophysical characteristics of different tissues. In this study, we propose a method to extract high-quality nuclei from low-input cryopreserved human tissues. Our approach optimizes the solubilization process for cancer tissues with distinct biophysical properties (brain, bladder, lung, and prostate). By incorporating nuclei sorting to purify samples before tagging and sequencing, we ensure that only the most intact nuclei are used for analysis. Our results were comparable to those from existing cellular atlases, suggesting that our approach is both robust and suitable for a wide range of tissues with varying biophysical characteristics.

Methods

The detailed protocol is available as supplementary information.

Tissue specimen

The research conducted within this study involved the use of human cancer samples. Prior to the collection of all human samples, the research protocols were reviewed and approved by the Institutional Review Boards (Comité d’éthique de la recherche du CHU de Québec-Université Laval and Comité d'éthique de la recherche de l’Institut universitaire de cardiologie et de pneumologie de Québec – Université Laval). Informed consent was obtained from all participants. Bladder and prostate samples were obtained from the URO-1 biobank of the CHU de Québec-Université Laval (Projects No. 2012-1002 and 2024-6885). Brain samples were obtained from the CHU de Québec – Université Laval (Project No. 2023-6392 and 2024-6885). Lung samples were obtained from the Biobanque du Réseau de recherche en santé respiratoire du Québec (biobanque.ca) – site Institut universitaire de cardiologie et de pneumologie de Québec – Université Laval (Project No. MP-10-2024-4071). All experiments were performed in accordance with relevant guidelines and regulations.

Tissue homogenization

Cryopreserved samples were minced in a pre-cooled mortar on dry ice using a scalpel, transferred into 15 mL tubes prior to adding 3 mL of ice-cold lysis buffer (10 mM Tris–HCl pH 7.4, 10 mM NaCl 10, 3 mM MgCl2.6H2O, 0.05% NP-40). A Dounce homogenizer was used with different pestles: pestle A (loose, clearance: 0.0025–0.0055 inches, Fisher Scientific K885301-0002) or pestle B (tight, clearance: 0.0005–0.0025 inches, Fisher Scientific K885303-0007). The number of strokes and the selection of the pestle were optimized for each tissue (Table 1). Following homogenization, samples were transferred, and 2 mL of ice-cold lysis buffer were added on ice and incubated for 5 min. The lysis reaction was stopped with the addition of 5 mL of ice-cold nuclei washing buffer (0.5X PBS, 5% BSA, 0.25% Glycerol, 40 units/mL Protector RNAse inhibitor). Following filtration on 30 µm MACS strainers (Miltenyi Biotech, Bergisch Gladbach,Germany), samples were centrifuged for 10 min at 1000 g (4 °C). Pellets were resuspended in 1 mL of nuclei washing buffer and then 1 mL of a 50% (wt/vol) solution of iodixanol (Optiprep Millipore Sigma, Burlington, USA) was added. The nuclei suspension was gently transferred on top of a 2 mL cushion of 29% (wt/vol) iodixanol. The pellet was resuspended in 300 µL of nuclei washing buffer. Microscopy images were obtained using the ZEIZZ Axio Observer 7 (Carl-Zeiss-Stiftung, Oberkochen, Germany) and the Fiji image processing package (v 1.54j).

Table 1 Optimized mechanical homogenization for cryopreserved tissues.

Nuclei sorting

Following homogenization, nuclei were stained with 7-AAD (Thermo Fisher Scientific Waltham, USA, Cat. No. A1310) for 10 min before sorting using a BD FACSAria Fusion (Becton Dickinson, New Jersey, USA). A 70 µm nozzle with a 4-way purity mask was used to increase the quality of the sorted nuclei and reduce the final volume. To estimate the relative size of the nuclei, non-fluorescent flow cytometry size calibration beads were used with sizes of 7.88,10.1, and 16.4 uM (Spherotech Cat. No. PPS-6 k). Fluorescent-positive events within size limits were collected and centrifuged at 1000 g for 10 min at 4 °C. Approximately 70 µL of the supernatant was kept to resuspend the purified nuclei before quantification and quality control by microscopy. Sorting plots were generated by FCSExpress (v7 DeNovo Software, Pasadena, USA).

Single-nuclei sequencing

Tagging of single-nuclei and library preparation was performed through Gel Beads-in-emulsion following the 10X Genomics (10 × Genomics, Pleasanton, USA) guidelines (CG000204 Rev. D) by the Genomics platform from the Centre de Recherche du CHUQ de Québec – Université Laval. An estimated 15,300 to 30,300 nuclei were loaded per run. The quality of cDNA was assessed with High Sensitivity D5000 ScreenTape Assay on 4200 TapeStation (Agilent, Santa Clara, USA). Samples that passed the quality control were sequenced on a NovaSeq 6000 (Illumina, San Diego, USA) to a minimal depth of 20,000 paired-end reads (100 bp) per expected nuclei.

Data analysis

Following sequencing, FastQ files were used as input for Cell Ranger (v7.1.0)33. Single-nuclei matrices were examined and filtered to remove ambient RNA using SoupX (v1.6.2)34. Central processing of snRNA-seq was made using Seurat (v5.1.0)35. Values for total number of transcripts (UMI) detected in a single nucleus (nCount_RNA), the number of unique features detected in a single nucleus (nFeature_RNA), and the mitochondrial percentage precent in each dataset was computed. The following filters were applied to ensure quality data: nCount_RNA ≥ 300, nFeatures between 250 and 2500 and mitochondrial percentage < 10%. Additionally, we examined the percentage of ribosomal and hemoglobin genes and the ratio of genes per UMIs (Figure S1). Following filtrations, the dataset was processed to detect and eliminate doublets with DoubletFinder (v2.0.4)36. Processed and filtered datasets were integrated with single-cell atlases downloaded from CZ CELLxGENE (Table S1)37. Prior to integration the SCTransform (SCT) method was applied to each dataset for normalization35. Then integration anchors were identified between the datasets to find common cell types or states. The identified anchors were used to integrate the datasets into a single Seurat object. Batch effects were corrected using the Harmony (v1.2.0) algorithm38. Principal component analysis (PCA) was performed on the integrated data to reduce dimensionality and identify key components. Clustering and dimensionality reduction through Uniform Manifold Approximation and Projection (UMAP) were performed for downstream analyses and visualizations.

For cell type annotations, the nomenclature of cellular populations found within single-cell atlases was used as a reference for manual annotation. Next, we performed curation to refine and validate the annotation by cross-referencing the preliminary labels with single-cell atlases to ensure proper cell type assignment. Discrepancies between the gene expression patterns and marker genes were resolved to ensure robust and biologically meaningful classification of the cell populations. Plots were generated by Seurat (v5.1.0)35, and ggplot2 (v3.5.1)39.

Results

Isolation of single-nuclei starting from low-input cryopreserved tissues

To determine the feasibility of generating quality single-nuclei data from low-input cryopreserved material, we initially tested traditional nuclei isolation involving solubilization and centrifugation. However, significant material loss during the process made this approach unsuitable for low-input material (Figure S2). To develop a more efficient and versatile approach, we investigated alternative methods aiming to work with less than 15 mg of cryopreserved tissue, an amount typically found in a standard biopsy for solid tumors. Through iterative testing and refinement, we developed a protocol optimized for low-input cryopreserved human tissues for single-nuclei transcriptomics (Fig. 1). This protocol is divided into four modules: (1) Homogenization; (2) Nuclei Sorting; (3) Quality Control, and (4) Tagging and Sequencing. We validated the protocol using four tissues of significant interest to cancer research: brain, bladder, lung, and prostate. These tissues exhibit a range of biophysical properties, from low stiffness (brain) to high stiffness (prostate)40,41,42. In the following sections, we will detail the necessary steps to achieve high-quality results.

Fig. 1
figure 1

Workflow for single-nuclei isolation starting with low-input cryopreserved tissues. Outline of the main steps of the process for isolating and preparing single-nuclei for transcriptomics analyses starting with low-input cryopreserved human tissues. Module 1—Homogenization: Tissue samples are mechanically homogenized to release single nuclei while preserving nuclear integrity using a Dounce homogenizer. Module 2—Nuclei Sorting: The homogenized mixture is subjected to fluorescence-activated cell sorting (FACS) to isolate and purify the nuclei based on integrity and size. Module 3—Quality Control: Isolated nuclei undergo quality control assessment with microscopic examination to confirm their integrity and purity. Module 4—Tagging and Sequencing: Nuclei are tagged with unique molecular identifiers (UMIs) and barcoded using the Chromium platform. The tagged nuclei are then subjected to library preparation and high-throughput sequencing followed by data analysis.

Homogenization of cryopreserved tissues requires optimization

Tissue homogenization is the first step of our method to isolate nuclei from cryopreserved tissues. A Dounce homogenizer was used to solubilize the tissue, using specific conditions. The type of pestle and the number of strokes were optimized for each tissue. Two metrics were used to assess nuclei recovery: the number of nuclei per mg of tissue (Fig. 2A–D) and the percentage of good-quality nuclei (Fig. 2E–H). The number of nuclei per mg of tissue ranged between ~ 10,000 (lung) and ~ 75,000 (brain). For single-nuclei applications, we aimed for minimally ~ 50,000 total nuclei starting with between 10 and 15 mg of cryopreserved tissue. In all cases, the total number of nuclei per mg of tissue was sufficient for downstream sequencing application. Therefore, the percentage of good-quality nuclei was used as the main determinant for the best conditions (Fig. 2E–L). The selected conditions ranged from 6 strokes of pestle A (loose) for the brain to 10 strokes of pestle B (tight) for the prostate (Table 1). These results are in line with the higher stiffness of the prostate. It is worth noting that tissue aggregates were still visible suggesting that samples were not over solubilized (Fig. 2I–L). Therefore, determining the homogenization conditions empirically is a critical step for each tissue to be used with our method.

Fig. 2
figure 2

Different cryopreserved tissues require specific homogenization conditions. Homogenization conditions need to be optimized for each tissue. (AD) Quantification of the number of nuclei per mg of tissue for each sample. Error bars represent the standard deviation for a biological triplicate (only duplicate for bladder due to tissue constraints). (EH) Quality assessment of the extracted nuclei. Microscopy images were quantified manually. Nuclei were considered high-quality if: the membrane integrity was intact, the shape of the nuclei was consistent, and nuclei aggregates were low. The bars in red represent the optimal number of strokes required to obtain the best quality nuclei. Error bars represent the standard deviation for a biological triplicate. (IL) Representative microscopy images for the optimal conditions highlighted in red (EH). Nuclei were stained with 4′,6-diamidino-2-fenilindol (DAPI). Scale bars represent 100 and 50 µm respectively.

Sorting allows recovery of a pure suspension of quality nuclei

Obtaining high-quality single-nuclei data relies on preserving the nuclear integrity, which requires specific conditions and proper handling. Traditional protocols involving centrifugation and washing steps often result in the loss and damage of cells and nuclei. As an alternative, sorting intact nuclei before sequencing has proven effective9,43,44,45. The fluorescent DNA intercalator 7-aminoactinomycin D (7-AAD) has been shown to be safe for single-cell applications20,46,47. We utilized 7-AAD in combination with calibration beads of different sizes (8–16 µm) to set the gate for fluorescence-activated cell sorting (FACS) of nuclei (Fig. 3A). For each tissue, nuclei positive for 7-AAD and between 8 and 16 µm were selected (Fig. 3B–E). We obtained between 307 and 607 nuclei/µL (Table S2). A minimal concentration of 300 nuclei/µL was set as the lower threshold for tagging and sequencing to maximize the number of single nuclei profiled according to 10X Genomics guidelines. Overall, cell sorting minimally affected the recovery process while significantly reducing the presence of debris.

Fig. 3
figure 3

Nuclei are purified using fluorescence-activated cell sorting. Homogenized nuclei were subjected to fluorescence-activated nuclei sorting (FANS) to enrich high-quality nuclei while removing ambient RNA and debris. (A) Left—Scatter plot representing the nuclei distribution of an unlabeled sample based on the Side Scatter-Area (SSC-A) and 7-AAD intensity. The FANS gate (blue box) was set based on standard size beads ranging from 8 to 16 µm. Right – Histogram of the calibration beads using the forward scatter (FSC) signal. (BE) Left—Scatter plots showing the distribution of nuclei for the brain (B), bladder (C), lung (D) and prostate (E) samples. Nuclei within the FANS gate were sorted and used for downstream analyses. Right – Distribution of the calibration beads and the nuclei (blue) using the FSC to approximate the size of nuclei for the different tissues.

Low-input cryopreserved tissues yield similar heterogeneity to single-cell atlases

Following purification, nuclei were tagged using the 10X Genomics Chromium droplet technology. Following library preparation, samples were sequenced at a minimal depth of 20,000 reads per cell. The quality of the data was assessed using established methods to ensure reliable and accurate data (see methods and Figure S1). We sequenced between 1550 and 7468 nuclei per sample (Figure S3). To compare the heterogeneity of the profiled samples with the tissue of origin, we integrated our results with publicly available single-cell atlases (Fig. 4 and Table S1). Low-input cryopreserved samples and single-cell atlases were normalized, and batch corrected to be combined into the same data set. For each tissue, our sequencing results overlapped with the cluster distribution of the single-cell atlases (Fig. 4A–D). Then, the different cell populations were identified and annotated using available signatures (Fig. 4E–H). For all tissues, combinations of tissue-specific subtypes in addition to immune and endothelial cells were recovered as expected for heterogenous tumors samples. For example, in the prostate tissue we identified all the expected cell populations including luminal, basal, immune and endothelial cells (Fig. 4H). Taken together, these results support the feasibility of our method using low-input cryopreserved tissues to generate representative single-nuclei datasets.

Fig. 4
figure 4

Low-input cryopreserved tissues yield similar heterogeneity to single-cell atlases. To evaluate the representativity of single-nuclei data starting with low-input material, we integrated our results with publicly available single-cell atlases. (AD) Uniform manifold approximation and projection (UMAP) plots of the individual cells identified in each tissue (red) with a corresponding single-cell atlas (grey). Sources for the atlases are available in Table S1. Each dot represents a single nucleus or cell. (EH) UMAP plots color-coded according to distinct cell types for the brain (9), the bladder (11), the lung (15) and the prostate (10). The cell annotations represent the cell populations found within the reference atlas for the specific tissue. Each dot represents a single nuclei or cell.

Discussion

Access to well-characterized samples in sufficient quantities is a major barrier for many single-cell experiments, as the high costs of these studies often limit the number of samples and biological replicates that can be processed. To address this issue, we propose a versatile method for extracting nuclei from cryopreserved human tissues available in tissue repositories, which requires minimal input material (Fig. 1). This method has been successfully applied to profile cells from brain, bladder, lung, and prostate tissues (Fig. 4), demonstrating its broad applicability. Notably, some tissues, such as the prostate, have historically posed challenges due to their highly fibrous nature, making them difficult to process for many applications. By focusing on nuclei rather than live single-cell suspensions, we eliminate the constraints associated with extracting viable whole cells. Our findings indicate that the solubilization of the tissues is the most variable and critical step (Fig. 2), necessitating careful optimization for each specific tissue type to ensure the extraction of high-quality nuclei prior to purification (Fig. 3). Once good-quality nuclei are obtained, the subsequent sequencing results are typically robust and reliable (Fig. 4). These results demonstrate that our method is applicable to a wide range of tissues with varying biophysical properties and can be potentially expanded to make the technology accessible to more tissues.

A significant limitation of current single-cell approaches is the substantial amount of tissue required. Samples with high clinical value for diagnostics are frequently available only in limited quantities, and cryopreserved samples, which cannot be replenished, are allocated with caution. Access to these samples is typically granted only when the potential for generating valuable information is sufficiently high. Our results demonstrate that using less than 15 mg of tissue, significantly less than the amounts required by standard reference atlases, yields the expected cellular heterogeneity for each tissue (Fig. 4). Notably, some cell types, such as epithelial cells in the lung are difficult to extract and often show biases in protocols that start with fresh tissues25,48. In this regard, extracting nuclei from cryopreserved tissues may offer a distinct advantage, as we profiled these cell populations that are sensitive to dissociation processes used for fresh samples. By using our method to extract nuclei from low-input cryopreserved tissues, we expand access to valuable samples. This methodology holds significant potential for advancing single-cell research, particularly in clinical and diagnostic contexts where sample availability is a critical constraint.

The development of methods to extract single-cell information from cryopreserved human tissues offers a significant advantage by enabling long-distance collaboration and the molecular analysis of retrospective studies. Our approach has successfully profiled cryopreserved samples stored for long periods (over 20 years), showcasing its versatility and robustness. Compared to other methods, which often require larger sample sizes and fresh tissues, our technique requires minimal tissue input while still delivering high-quality nuclei extraction and cellular heterogeneity across diverse tissues. Reducing the reliance on fresh tissues will enable the investigation of a broader array of tissue types and conditions, enhancing our understanding of tissue heterogeneity. Integrating single cell approaches with well-characterized tissues and long-term clinical follow-up will offer an effective strategy to tackle complex biomedical questions.

Limitations of study

While our method is applicable to at least four cancer tissue types: brain, bladder, lung, and prostate, it has not been extensively tested on additional normal or tumor tissues. The method was developed with a focus on optimizing conditions for low-input cryopreserved human tissues that range different grades stiffness, so we did not benchmark it against different protocols using the same starting material. Although our method is efficient, other conditions may be better suited for specific tissues. For example, protocols tailored for liver tissues often employ different enzymatic treatments to handle the higher metabolic activity26,49 while those designed for cardiac tissues typically use gentle mechanical dissociation to preserve delicate cell structures50. Additionally, we did not compare the cellular heterogeneity obtained from large versus small amounts of starting material, which limits our ability to conclude whether using a low input reduces the likelihood of identifying rare cell populations or introduces biases in cell proportions. Finally, our efforts were focused on transcriptomics, and we did not explore other single-cell approaches, such as those investigating chromatin features, which could provide complementary insights.

Conclusion

Overall, we present a functional, versatile, and robust method for isolating nuclei for single-nuclei transcriptomic analyses across multiple human cancer tissues, with the potential for adaptation to a broader range of applications and tissue types.