Introduction

Motor neuron disease (MND), the third most prevalent neurodegenerative condition after Alzheimer’s and Parkinson’s, is a highly fatal, rapidly progressive, and complex disease. Typical patient survival is 2–3 years post symptom onset, and given the greatest incidence is observed between 50 and 85 year olds, there is a rapid expected increase in global burden as populations age1. Current therapeutic options yield minimal benefit in broad patient populations2,3,4, and as such there is a strong demand for a new pipeline of developed therapeutics.

Unfortunately, modelling MND for drug discovery purposes remains challenging, partly due to the range of pathological mechanisms implicated, many of which are backed by genetic associations5,6,7,8,9,10. Given the cellular complexity and mechanistic heterogeneity of MND, strong support exists for use of patient derived iPSC derived motor neurons as a model to enable disease interrogation and preclinical drug discovery.

However, these iPSC models encounter significant reproducibility challenges. When in a pluripotent state, they are highly sensitive to external environmental cues, modifying internal genomic architecture, becoming unsuitable for differentiations11. Whilst protocols using transcription factor driven differentiation can overcome some of these challenges, demonstrating greater reproducibility, purity, and decreased cost/generation time12,13,14, these genetic methods fail to mimic neurodevelopment.

Given the importance of cellular identity for hypothesised subtle phenotypes within MND, mimicking these developmental pathways are key. As such, preference exists for small molecule driven protocols that can provide highly guided developmental routes via promotion of key signalling cascades. Such protocols have been developed, and provide motor neurons reproducibly from iPSCs15,16, however inconsistencies remain17.

Here, we examine suitability of the widely used Hall et al. small molecule driven protocol for motor neuron differentiation18 to support early drug discovery in an industrial environment. We assess variability in the purity of differentiated cells and using statistical analysis, attempt to identify which factors (namely operator, induction set and cell line) are the main sources of variability. We conclude that assessing iPSC genomic stability is an accessible monitoring factor for reducing model variability.

Results

Variability is high and more greatly influenced by non-genetic factors

Using data generated across 15 separate rounds of differentiation (where a round can include more than one cell line), herein termed “induction sets”, and 8 cell lines (with a minimum of 3 induction sets per cell line, and a minimum of 3 induction sets per operator), motor neuronal characteristic, herein referred to as QC (Quality Control) metrics (detailed in Table 1), were analysed to describe variability and assess robustness of the differentiation protocol described by Hall et al.18. Statistical testing focused on exploring the magnitude of variance generated, and how much variation was attributable to key factors (namely cell line, induction set, and operator). Selection of factors was based upon importance described from literature in similar settings19, and those pertaining to handling of cells, namely “induction set” given each differentiation attempt will be subject to expected undefinable variations in environmental and reagent conditions.

Table 1 Definitions of QC criteria for motor neuronal cultures; cells assessed at neural precursor cell stage (NPC) and after terminal differentiation for 3 days (D3) or 10 days (D10) to produce early and later motor neurons respectively; image analysis scripts run in signals image artist (Revvity).

Coefficient of variance analysis suggested that most quantifiable factors displayed high levels of variation (> 30% – see Table 2), with over half of these metrics exceeding 40%, far beyond acceptable levels for an industrial environment (ideally < 20%)20. Further linear modelling suggested that “Operator”, followed by “Induction Set” explained large amounts of variation within each of the QC metrics as demonstrated by high R2 values (i.e. the amount of variation that can be attributed to this factor) and low p-values associated with each (p < 0.05 in all cases). Whilst “Cell Line” was a significant explanatory factor for variation (p < 0.05 in all cases), generally this remained low in magnitude, with R2 values < 12% in the majority of cases.

Table 2 Variability analysis of motor neuron differentiations per quantitative QC metric as performed in R; R2 values represent percentage of variability attributable to “Cell Line”, “Induction Set” or “Operator” per QC metric where “-” indicates no significant contribution; adjusted R-squared values indicate strength of linear model (range 0–1, adjusted R-squared > 0.60 acceptable).

Testing to assess independence between key factors and metrics with a binary outcome (i.e. “Yes” or “No”) highlighted that a significant association existed between “Induction Set” and whether non-neuronal morphology was present or cultures possessed cellular clusters > 5000µm2 (Non-neuronal morphology: χ2 = 20.79, df = 12, p < 0.05; Cellular clusters > 5000µm2: χ2 = 23.04, df = 12, p < 0.05). This relationship was however lacking when examined against “Cell Line” and “Operator”.

Overall statistical data highlights that variability in differentiation quality can be linked strongly to “Induction Set” and “Operator”, and less so to “Cell Line”. Whilst some variability continues to remain attributable to cell line (see Table 2), it remains low and provides remit to investigate factors more amenable to being controlled (such as handling procedures) to improve differentiation quality.

iPSC genomic stability is a key determinant of differentiation protocol performance and variability

Following previous studies in the literature, it is known that iPSC genomic architecture can change over time whilst these cells are in culture11 and this can have a noticeable impact on differentiation protocols21. We assessed iPSC karyotypic abnormalities on motor neuron differentiations using the Hall et al. protocol18, to understand if this knowledge can be used to reduce variability of QC metrics in produced material.

Using a bulk RT-qPCR based assay, with primers designed to target the nine most common karyotypic abnormalities in human iPSCs (as designed by STEMCELL Technologies), five rounds of differentiations were carried out across two cell lines to assess the impact of genomic stability. Cell lines with chromosomal copy numbers < 1.5 or > 2.5 (< 0.7 or > 1.3 for chromosome X for lines from male donors) were considered “abnormal”. One-way ANOVAs comparing chromosomal copy number across all loci with post hoc Tukey test p-value adjustment were then used to confirm these chromosomal deletions.

Differentiations from cell lines with no detectable abnormalities commonly displayed decreased coefficient of variance values for QC metrics when compared against all differentiation sets, consequently meaning these were less variable (see Table 3). Differentiations from these cells also showed significantly greater purities at the neural precursor and terminal differentiation day 3 stage, as determined by quantitative immunocytochemistry co-expression analysis for relevant neural precursor and motor neuronal markers (see Table 3 for details).

Morphological abnormalities were clearly present in cell lines harbouring confirmed or suspected chromosomal deletions, with confirmed cases (i.e. copy number < 1.5, adjusted p-value < 0.05) displaying complete lack of motor neuronal organisation (see Fig. 1A) and suspected cases (i.e. copy number < 1.5, adjusted p-value > 0.05) retaining some neuronal networks but with extremely dense clusters and rapidly proliferating cells with non-neuronal morphology (see Fig. 1B). In cell lines with no chromosomal deletions, typical neuronal structuring with no clear morphological abnormalities were observed (see Fig. 1C). Detected chromosomal abnormalities are detailed in Table 4.

Table 3 Variability and QC data for differentiations performed on iPSCs where chromosomal stability status was known; “All sets” include cell lines with and without abnormalities whilst “No detectable abnormalities” only includes data from the latter, as confirmed via a bulk qPCR assay targeting common iPSC chromosomal abnormalities; coefficient of variance data highlighted in bold indicates an improvement in value; QC metrics as previously defined; * = p < 0.05, ns = p > 0.05, as determined by a student’s T-test.
Fig. 1
Fig. 1
Full size image

Representative images of motor neurons terminally differentiated for 10 days subject to karyotypic abnormality analysis with states as follows: (A) detected chromosomal copy numbers of Chr8q, Chr17q, ChrXp < 1.5, lowest adjusted p-value < 0.034; (B) detected chromosomal copy numbers of Chr1q, Chr18q < 1.5, lowest adjusted p-value < 0.93; (C) detected chromosomal copy numbers normal. Targeted karyotyping for common abnormalities performed via bulk qPCR assay. Merge image colourings: Blue = Hoechst; Red = MAP2; Orange = ISL1; scale bar = 100 μm.

Table 4 Chromosomal copy number as assessed via bulk qPCR assay; five rounds of differentiation completed across two cell lines; One-way ANOVA performed comparing copy number against all loci; copy number < 1.5 or > 2.5 (< 0.7 or > 1.3 for chromosome X for lines from male donors) and p < 0.05 with post hoc Tukey adjustment indicate abnormality.

Discussion

“Induction set” shown to be greatest contributor to variability of culture purity.

Variance and statistical analysis show that factors affecting variability in order of largest to smallest contributor were: induction set > operator > cell line.

Large scale studies into reproducibility of differentiation success from iPSCs have described similar contributors22, largely grouped into genetic and non-genetic factors, with each contributing 20% and 40–60% to overall variation respectively19. These numbers align with calculated values from variability studies carried out in this study for genetic (i.e. cell line) and non-genetic factors (i.e. induction set and operator) (~ 2–30% and 30–70% respectively).

Interestingly, variability attributable to genetic factors (i.e. cell line) remained low yet persistent across most QC metrics. Given this is an inherent biological property that cannot be readily controlled or changed, it is encouraging that the magnitude of this response is low across metrics and multiple studies19,22. However, proposed approaches to account for this genetic variation, including use of “Rosetta lines” to normalise data between multiple cell lines and remove technical variation19, warrant further investigation. Cell lines used in this study are widely available from the HipSci (Human Induced Pluripotent Stem Cell Initiative) project23 and National Institute of Neurological Disorders and Stroke (NINDS) repository, and as such, provide datasets to normalise studies against if required.

Variability attributable to non-genetic factors (i.e. induction set and operator) by far outweigh contribution to variance from genetic factors (i.e. cell lines). This was explored further in the data with linear modelling of self-reported operator years of cell culture experience (general, complex model, and iPSC, ranging 6–20 years), however no results of significance were observed (see supplementary table S1). This most likely suggests handling techniques that cannot be accurately recorded, unique to each operator, play a key role and should be considered in future similar studies. It has been previously demonstrated variation due to cellular heterogeneity (i.e. cell line) is amplified during technical handling of iPSCs22. As handling time increases, this variation continues to grow and amplify, aligning with statistical findings here that non-genetic factors/overall handling was the biggest contributor to variability. Furthermore, additional studies examining gene expression profiles show a higher variability in iPSC derived models when compared against the primary cell culture counterparts they are intended to model24 confirming that the differentiation process itself (analogous to “induction set” in our analysis) is introducing a significant amount of variation, supporting our statistical analysis. The use of large scale automated systems have been shown to reduce this source of variation25, effectively controlling for “induction set” and “operator” given the repeatable environment and handling cells are subject to. However, such systems are not widely available, and other handling techniques beyond operator training to reduce this variability are required.

iPSC genomic instability dictates differentiation culture purity and variability

Assessment of iPSC karyotypic abnormality status via bulk qPCR was shown to provide a quantifiable approach to reducing differentiation failure rates and improving variability. It has been well characterised that iPSCs with abnormalities possess reduced differentiation capacity, for example loss of neuroectoderm differentiation capacity following a 20q11.21 gain, or a failure to complete hepatic progenitor generation following a 12p13.31 gain26,27.

Interestingly, instability of chromosomes 12,17,18q and 20q all pertained to increased oncogenic activity in cells28,29,30,31 – this is a major issue within a clinical transplantation setting but also not desirable within a drug discovery setting. Within differentiations that were impure, a high proliferation rate was seen of either motor neuronal cells, or those of unknown origin. In the case of a 17q deletion, an extremely large increase in cells was observed in cultures, akin to previous studies29. Whilst they may appear motor neuronal in origin, these are effectively cancerous cells that are not physiologically representative or usable within a drug discovery setting.

Given low level chromosomal mosaicism has been shown in multiple routine iPSC cultures32,33, continued monitoring of cells for genomic stability is key to prevent oncogenesis and adverse differentiations. Ultimately this will reduce overall variability of produced cellular material.

Conclusion

In summary, this study highlights that whilst variation in motor neuronal purity remains high using the Hall et al. protocol (with coefficient of variance values exceeding 20%), this can partially be controlled for via regular iPSC chromosomal assessment. This assessment can be performed through simple, accessible and cost efficient methods such as RT-qPCR. Current variability defined here (both genetic and non) remains largely in line with values described elsewhere in literature19, and whether this can be reduced further remains a current hurdle within the field. A significant portion of this variability is attributable to non-genetic factors (~ 30–70%), and as such further in-depth technical analysis is required and encouraged for groups using this differentiation protocol regularly. At the very least both induction set and operator need to be included as factors in experimental design and as potential confounders in statistical analysis.

However in lieu of this knowledge, we show focussing on defining chromosomal stability is an accessible strategy for providing reproducible cellular models, especially within the motor neuronal space. Subsequently, this facilitates improved model interrogation and therapeutic discovery strategies within the MND field.

Methods

Cell line details

Cell lines sourced from HipSci (line ID): WTSli010-A, WTSli022-A, WTSli076-A; cell lines sourced from National Institute of Neurological Disorders and Stroke (line ID): NH500217, NH50305, NH50306, NH50218, NH50219. Cells thawed from minimum of passage 10 and maximum of passage 20. Cell line authenticity, sterility and mycoplasma free status confirmed internally.

iPSC maintenance

6 well tissue culture plates (Corning) for iPSC growth were coated with Geltrex (Gibco) at a 1:100 dilution in DMEM/F12 (Gibco) – plates were incubated at 37 °C, 5% CO2 1 h minimum/1 week maximum prior to cell seeding.

iPSC vials frozen in liquid nitrogen were thawed using E8 media with supplement (Gibco) and 10µM Y-27,632 (ROCK inhibitor; STEMCELL Technologies) (original Y-27632 stock dissolved in water). Media was changed next day for E8 with no Y-27,632 – cells were incubated at 37 °C, 5% CO2.

Cells were passaged with 0.5mM EDTA (Invitrogen) in PBS (Thermo Fisher), receiving daily media changes of 2mL/well E8 media with supplement warmed to room temperature.

Motor neuron differentiation

Protocol originally described by Hall et al.18. Briefly, iPSCs were allowed to reach 90–100% confluency during routine maintenance before starting a differentiation. Basal differentiation media was made as 50:50 aliquots of Neurobasal plus and DMEM/F12 with 1X B27 plus supplement, 1X Glutamax, 1X non essential amino acids, 1X N2 supplement, 10µM 2-mercaptoethanol, and 0.5 µg/mL insulin. Relevant small molecules were added for each differentiation stage, and possessed a final DMSO concentration of < 0.01%.

Daily differentiation media changes (warmed to room temperature) were performed (minimum of 2mL/well) and incubated at 37 °C, 5% CO2. Cells were treated with differentiation media across an 18 day period as follows: Day 0–6 = 1µM Dorsomorphin (Sigma), 2µM SB431542 (STEMCELL Technologies), 3µM CHIR99021 (STEMCELL Technologies); Day 7–13 = 0.5µM Retinoic acid (Tocris), 1µM Purmophamine (Sigma); Day 14–17 = 0.1µM Purmorphamine (Sigma); Day 18 (19–38 optional for cycling at neural precursor stage) = 10ng/mL FGF (Peprotech); Day 19 earliest for motor maturation = 0.1µM Compound E (Enzo Life Sciences).

Cells were split using dispase (Gibco) dissolved in PBS (10 mg/mL), and reseeded in appropriate differentiation media with 10µM Y-27,632. Splits were performed no earlier than day 4 of the differentiation (1:2 splitting ratio), and no more than 2 additional times (1:3 splitting ratios) over the whole 18 day period. Once cells had reached day 18 and achieved a neural precursor identity, cells were passaged with accutase (Invitrogen) and reseeded in differentiation media containing 10ng/mL FGF and 10µM Y-27,632.

Maturation was performed in 96 well imaging plates (Revvity) coated with Geltrex. Neural precursor cells were seeded in differentiation media containing 10ng/mL FGF and 10µM Y-27,632, and after a 24 h incubation at 37 °C, 5% CO2 for 24 h, media was aspirated and replaced with differentiation media containing 0.1µM Compound E.

Cell culture purity assessment – Immunocytochemistry

Motor neurons were matured in 96 well plates and fixed with 4% formaldehyde for 20 min at room temperature. Cells were then washed with PBS, permeabilised with 0.5% triton in PBS (Sigma) for 20 min, blocked with 10% bovine serum albumin (Sigma) and 20% fetal bovine serum in PBS (Gibco), and treated with primary antibody in 0.25% triton and 5% bovine serum albumin overnight at 4 °C. Wells were then washed with PBS and treated with secondary antibody in 0.25% triton and 5% bovine serum albumin for 1 h in the dark at room temperature. Cells were then washed with PBS and stained with Hoechst 33,342 (Thermo Fisher) in PBS for 10 min in the dark at room temperature. Cells were then washed in PBS, and imaged on the INCell Analyzer 6500 (GE Healthcare).

Images were analysed in Signals Image Artist (Revvity) using an automated pipeline and custom built script for each plate, with intensity thresholds for positive expression defined by iPSCs seeded on the plate (known to not express targets of interest28 and Hoechst intensity thresholds defining cell viability where appropriate.

Primary antibodies: NANOG (MABD24 – Merck), OCT4 (ab19857 – abcam), PAX6 (ab195054 – abcam), OLIG2 (157H00010215-M02 – Tebu), SMI32 (801701 – Biolegend), ISL1 (ab109517 – abcam), MAP2 (ab92434 – abcam).

Secondary anitbodies: Alexa Fluor 488 (A32723 – Invitrogen, Alexa Fluor 594 (A32740 – Invitrogen), Alexa Fluor 647 (A32933 – Invitrogen).

iPSC karyotypic abnormality assessment

iPSCs were cultured as previously described and prior to starting a differentiation, DNA was extracted from iPSCs grown in parallel from the same starting cellular material and purified using a DNA isolation kit (STEMCELL technologies) following the manufacturers’ protocol. Samples were frozen at -20 °C until ready to process.

Upon thawing on ice, DNA concentration and quality was assessed on a nanodrop (Thermo Fisher). Material with an A260/280 reading of 1.8–2.0 and an A260/230 reading of 1.9–2.2 was considered acceptable for further processing. Material was then prepared for qPCR assessment via a genetic analysis kit as per the manufacturers’ protocol (STEMCELL technologies) – primers were designed to detect 70% of the most recurrent karyotypic abnormalities reported in human stem cell cultures found in Chr 1q, 8q, 10p, 12p, 17q, 18q, 20q, Xp (primers available from STEMCELL technologies - #07550).

The ViiA 7 Real-Time PCR System (Thermo Fisher) was used to take readings in a 384 well format, ΔCT values per loci were calculated via normalisation against Chr 4p (following manufacturer recommendations), and data was analysed using one way ANOVAs across all loci and post-hoc tukey tests to confirm p-values.

Linear model variability analysis

The following linear model was used to calculate variation attributable to key factors:

$$\:{y}_{QC}=\:{\beta\:}_{1}a\:+{\beta\:}_{2}b+{\beta\:}_{3}c\:$$

Where:

$$\:{y}_{QC}=Quantitative\:QC\:metric$$
$$\:{\beta\:}_{x}=Linear\:Model\:Coefficient$$
$$\:a=Operator$$
$$\:b=Induction\:Set$$
$$\:c=Cell\:line$$

Analysis was performed in R, with linear models being created for each quantitative QC (quality control) metric as defined in the QC pipeline. Diagnostic plots (residuals vs. fitted, normal Q-Q, scale-location, and residuals vs. leverage) were examined in each case to determine suitability of linear modelling – in cases where models were not acceptable, data was log transformed if appropriate.

Statistics

Significance testing via t-tests (for quantitative data) and chi-squared tests (for qualitative data) were performed in in Prism (version 9.4.1; GraphPad) and Excel (Microsoft). Variance testing (F testing) was carried out prior to significance testing to determine whether heteroscedastic or homoscedastic testing should be applied.