Transcriptomic profiling and machine learning uncover gene signatures of psoriasis endotypes and disease severity

Rider, Ashley; Grantham, Henry J.; Smith, Graham R.; Watson, David S.; Casement, John; Cockell, Simon J.; Gisby, Jack; Foulkes, Amy C.; Henkin, Rafael; Iqbal, Wasim A.; Ewen, Tom; Amarnath, Shoba; Ng, Sandra; Zuliani, Paolo; Dand, Nick; Stocken, Deborah; Traini, Christopher; Thomas, Elizabeth; Kalyana-Sundaram, Shanker; Rajpal, Deepak K.; Smith, Kathleen M.; Barker, Jonathan N.; Griffiths, Christopher E. M.; Di Meglio, Paola; Smith, Catherine H.; Warren, Richard B.; Barnes, Michael R.; Reynolds, Nick J.

doi:10.1038/s43856-025-01325-4

Download PDF

Article
Open access
Published: 21 January 2026

Transcriptomic profiling and machine learning uncover gene signatures of psoriasis endotypes and disease severity

Communications Medicine volume 6, Article number: 65 (2026) Cite this article

3876 Accesses
1 Citations
68 Altmetric
Metrics details

Subjects

Abstract

Background

Despite increased understanding of psoriasis pathogenesis, molecular classification of clinical phenotypes and disease severity is poorly defined. Knowledge gaps include whether molecular endotypes of psoriasis underlie distinct clinical phenotypes and the positive and negative molecular regulators of disease severity across tissue compartments.

Methods

We performed comprehensive RNA sequencing of skin and blood (n = 718) from prospectively-recruited, deeply-phenotyped discovery and replication cohorts of 146 subjects with moderate-to-severe chronic plaque psoriasis initiating TNF-inhibitor (adalimumab) or IL-12/23-inhibitor (ustekinumab) therapy.

Results

Here we show, using two complementary dimensionality reduction methods, that co-expressed gene modules and factors within skin and blood are significantly associated with psoriasis phenotypes and disease severity. We identify a 14-gene signature negatively associated with BMI in nonlesional skin and with disease severity in lesional skin. Genotype integration reveals that HLA-DQA1*01 and HLA-DRB1*15 genotypes are positively associated with baseline psoriasis severity. Using explainable machine learning models, we define two disease severity-associated gene modules in lesional skin - one positive, one negatively-associated - and a 9-gene signature in lesional skin predictive of disease severity. Disease severity signatures in blood are only seen following adalimumab exposure, suggesting greater systemic impact of adalimumab compared to ustekinumab, in line with its side effect profile. In contrast, a gene signature in blood linked to HLA-C*06:02 status is independent of disease severity or drug.

Conclusions

These findings delineate gene-environmental and genetic effects on the psoriasis transcriptome linked to disease severity.

Plain language summary

Psoriasis is a common and debilitating skin disease, linked to other inflammatory conditions. A lot is known about what causes psoriasis and the factors that influence it, but doctors still cannot offer personalised treatments. This is because it has been difficult to understand what makes psoriasis more or less severe, why people respond differently to treatment, or why some people develop related diseases. To help address this, we collected skin and blood samples and personal information from people with severe psoriasis across the United Kingdom. Using computer-based methods, we found shared biological processes that link the disease with obesity and help predict its severity.

Integrated bioinformatic analysis of gene expression profiling data to identify combinatorial biomarkers in inflammatory skin disease

Article Open access 07 April 2022

Psoriasis

Article 26 June 2025

Integrated single-cell and spatial transcriptomics reveals heterogeneity of fibroblast and pivotal genes in psoriasis

Article Open access 10 October 2023

Introduction

Psoriasis is a common, multifaceted immune-mediated inflammatory disease (IMID) characterised by symmetrical erythematous, hyperplastic and scaly plaques affecting the skin, with associated systemic inflammatory disorders including psoriatic arthritis, cardiovascular disease and metabolic syndrome, which contribute to premature mortality¹. The aetiology and pathophysiology of psoriasis is complex and multifactorial¹. Over the last two decades significant progress has been made in understanding the pathophysiology of psoriasis and the contributions of various factors, including: genetic predisposition²; environmental factors including infection, trauma and diet; and acquired immune (e.g. T helper 1 (Th1), T17 cells, IL-17, IL-23 cytokines) and innate autoinflammatory factors (e.g. TNF, IL-36) which represent targets for highly effective biologic therapies. However, less progress has been made in translating these advances into individualised patient care. In part, this relates to significant knowledge gaps along the translational pathway that include: (a) whether molecular endotypes within clinically homogeneous stable plaque psoriasis underlie distinct clinical phenotypes (e.g. sub-groups of subjects with specific comorbidities), (b) the positive drivers and negative molecular regulators of disease severity across tissue compartments, and (c) the relationship between molecular endotypes and clinical response to therapy, including the side effect profiles of targeted therapies.

To address these questions, Psoriasis Stratification to Optimise Relevant Therapy (PSORT), an academic-industrial UK stratified medicine consortium³, prospectively recruited formally-powered, deeply-phenotyped discovery and replication psoriasis patient cohorts during the early phase of treatment with two distinct biologics, adalimumab (TNF inhibitor) and ustekinumab (IL-12/23 inhibitor) (Fig. 1a, b). Utilising this large multiomic dataset, we aimed to identify gene networks linked to specific disease endotypes, defined by clinical and phenotypic features measured at baseline (e.g. BMI), and to disease severity endotypes, defined by Psoriasis Area and Severity Index (PASI)-associated gene expression profiles (Fig. 1c, d).

**Fig. 1: Summary of study design, including patient recruitment, sample collection and analysis methodology.**

Gene expression is influenced by multiple factors across different cell types. Network analysis of bulk RNA-seq data from large clinical cohorts can capture coordinated patterns of expression by identifying underlying latent factors and co-expression modules. Moreover, these gene networks can more accurately represent key biological processes and generate signatures for disease classification and therapeutic response prediction, compared with analyses of individual genes⁴. We therefore hypothesised that such network signatures would map to specific clinical phenotypes of psoriasis, including disease severity over time in response to biologic therapy.

Our integrative multiomic analysis across lesional and nonlesional skin and whole blood identifies gene signatures in distinct tissue compartments and cell types that classify psoriasis endotypes and associate with disease severity. These signatures highlight both pathogenic pathways active in psoriasis and systemic immune processes detectable in blood, distinguish disease endotypes linked to genetic factors, and identify reproducible biomarkers of disease severity. Together, these findings provide a framework for understanding the molecular heterogeneity in psoriasis.

Methods

Prospective observational study

This study included 146 subjects with moderate to severe chronic plaque-type psoriasis (PASI > 10) recruited prospectively into the Psoriasis Stratification to Optimise Relevant Therapy (PSORT) study at 6 centres in the UK between May 2015 and May 2018 and due to start biologic therapy (ustekinumab or adalimumab) as part of routine clinical practice³. Exclusion criteria included use of systemic or biologic treatments in the two weeks prior to study entry (or four x t½ of last treatment, whichever was longer), use of PUVA therapy for 3 months or UVB for 1 month prior to study entry or use of topical treatments at the site of biopsies (except for emollients) for 2 weeks prior to study entry, as well as serious/uncontrolled systemic disease. We studied 89 psoriasis subjects within the discovery cohort and replicated findings in a further cohort of 57 subjects. For the replication cohort, samples for RNA sequencing were selected from patients whose treatment response (PASI 50/75/90 and non-responders) broadly matched those in the earlier discovery cohort. Selection was based solely on clinical response and did not consider molecular or demographic data. Subjects commencing adalimumab by subcutaneous injection received 80 mg at baseline, then 40 mg at week 1, then 40 mg every 2 weeks as per label and those starting ustekinumab received 45 mg or 90 mg according to body weight, as per label. Participants self-administered doses that fell between study visits and the time and date of these were recorded in the Case Report Form. The Psoriasis Association provided Patient and Public Involvement and Engagement, which influenced the study design. The study was conducted in accordance with the declaration of Helsinki, was approved by the London Bridge research ethics committee (REC reference: 14/LO/1685PSORT) and subjects provided written informed consent.

Patients completed detailed demographic questioning, including reporting information on comorbidities and concomitant and previous medication. Disease severity and response to therapy were assessed using the PASI, Physician Global Assessment (PGA) and DLQI. Clinical samples, including blood and lesional skin punch biopsies (edge of psoriasis plaque with site preference for lower back or buttock), were collected under local anaesthetic at baseline, one week (prior to the second injection of adalimumab) and 12 weeks of treatment. Biopsies were derived from the same body sites at each time point and, if possible, lesional biopsies were taken from the same plaque as baseline. Non-lesional skin with a minimum distance of 2 cm from the edge of nearest plaque was also collected at baseline and week 12 (and a minimum distance of 2 cm between initial and subsequent biopsies) and a further blood sample was taken at 4 weeks. Patients had been identified in line with recommendations for initiation of biologic therapies in the UK. Screening investigations had been completed prior to recruitment. Adverse events were recorded but did not form part of primary analysis.

Power calculation

Based on our affiliated pilot investigation⁵, using the method of Guo et al.⁶, we calculated the requisite sample size to achieve 90% power to detect differential expression associated with response. Imposing a 5% FDR threshold and a target log fold change of 1.5, we determined that a study would require 40 subjects to achieve 90% power to identify transcriptomic markers of biologic response for patients with chronic plaque psoriasis. Power curves projected across an expected range of fold changes at 1% and 5% differential expression in Supplementary Fig. 1.

RNA extraction and quality control

Skin samples

RNA was preserved in the skin punch biopsies using RNAlater Stabilization Solution (Invitrogen AM7022). Biopsies were stored at 4 °C in RNAlater overnight and the solution removed prior to long term storage at −80 °C (according to the manufacturer’s instructions).

Biopsies were transferred into pre-cooled 2 ml lysing tubes (Precellys, CK mix) containing lysis buffer (10 µL 2-ME/mL RLT Plus) supplied in the Qiagen AllPrep DNA/RNA Kit (Cat. No. 80204). Homogenisation was performed in the TissueLyser LT (Qiagen Cat. No. 85600) over 10 ×2-min cycles at 50 Hz. Samples were cooled on wet ice for one minute between cycles. Tissue debris was pelleted at 13,000 RPM for 3min, and the supernatant was transferred to an Allprep DNA spin column. DNA/RNA was then extracted following the Qiagen AllPrep kit’s protocol. RNA concentration/integrity was checked using the Agilent Bioanalyzer 2100 with the RNA 6000 Nano assay (Agilent: 5067-1511); only samples with an RNA integrity number (RIN) of 8 or more were sequenced.

Blood samples

RNA was isolated from human whole blood collected in PAXgene blood RNA tubes (Qiagen #762165) utilising the QIAsymphony SP (Qiagen #9001297) with the QIAsymphony PAXgene Blood RNA kit 96 (Qiagen #762635). The manual processing of the whole blood samples was performed following the manufacturer’s protocol and loaded onto the QIAsymphony SP. The RNA isolation protocol implemented was a custom protocol based on the standard automation protocol PAXgene_RNA_V5.xml. The custom protocol is PAXRNA_CR22332_2915.xml and contains the following modification to the standard protocol “elution buffer taken out of accessory trough. Accessory trough will be displayed as ETOH on the touch screen.” The elution buffer used was Invitrogen UltraPure DNase/RNase-free distilled water #10977015. Whole blood RNA samples were eluted into 80 μL of Invitrogen UltraPure DNase/RNase-free distilled water #10977015 and plated into 8×12 elution plates. RNA quality control for quantity was performed with Qubit RNA Broad Range (BR) (ThermoFisher Scientific #Q10211) on a Molecular Devices Gemini plate reader following the manufacturer’s protocol for reagent and sample preparation. RNA quality control for integrity was performed with an Agilent TapeStation 4200 (Agilent #G2991BA) using the RNA Screentape assay (Agilent #5067-5576,77,78) following the manufacturer’s protocol.

RNA sequencing

The sequencing libraries for the PSORT-D skin samples were prepared using the Illumina Truseq stranded mRNA kit and sequenced on an Illumina HiSeq 3000 with 2x101bp read length. The sequencing libraries for the PSORT-D blood samples were prepared from total RNA using the Kapa mRNA HyperPrep kit and depleted of rRNA and globin mRNA using the QIAseq FastSelect RNA Removal Kit by Qiagen; sequencing was done on an Illumina HiSeq 4000 using 2x75bp read length. The sequencing libraries for the PSORT-R skin samples were prepared using the Illumina Truseq stranded mRNA kit and sequenced on an Illumina NovaSeq 6000 with 2x250bp read length. As the approved participant consent forms did not include permissions for the sharing of personally identifiable raw sequencing data, we provide raw and adjusted gene count data from our RNA-seq analysis. The count data are available at Array Express under accession number E-MTAB-14509. This approach allows us to share valuable processed data for replication, validation and further analysis while respecting participant privacy and adhering to governance and ethical guidelines.

Genotype data and HLA imputation

DNA was isolated from blood using standard methods. Genotyping was performed with Illumina HumanOmniExpressExome-8 v1.2 and v1.3 BeadChips, followed by quality control with standard tools, as previously described⁷. HLA-C*06:02 (HLA-Cw6) genotype was imputed using SNP2HLA (version 1.0.3) based on the Type 1 Diabetes Genetics Consortium reference panel⁸.

Genomic and transcriptomic data analysis workflow

Analysis was conducted in the R statistical computing environment (R Core Team, 2021). Sample data analysis scripts can be found on our GitHub repository (https://github.com/C4TB/PSORT), along with extended supplemental markdown documents.

A graphical summary of the samples used for analysis is available in supplementary fig. 21. Several blood samples were RNA sequenced but identified as technical failures based on exploratory analysis and so were excluded. Following RNA sequencing of all samples, reads were pseudo-aligned using Kallisto⁹. Transcript counts aggregated gene-wise and were TMM normalised prior to modelling¹⁰ and transformed to the log₂-CPM scale. An expression filter was applied to ensure that a gene has at least one count per million (CPM) in at least 5% of all libraries, leaving 16,172 genes.

Exploratory data analysis

Principal component analysis identified tissue, time, and disease activity as the key drivers of transcriptome variation (Supplementary Fig. 22c), with relatively limited influence of demographic and other factors previously associated with therapeutic response.

Differential expression analysis

Differential expression was tested using heteroskedastic linear models and empirical Bayes shrinkage as implemented by the voom function in the limma software package¹¹. We compute q-values for each differential expression test using Storey’s method¹², with a false discovery rate (FDR) threshold of 5%.

We built separate models to test a number of related hypotheses. We have two primary goals for this portion of the experiment¹: to identify genes that associate with disease phenotype (e.g BMI) and² to identify genes that correlate with PASI irrespective of time points. We refer to these as the disease, and disease severity endotypes, respectively.

Disease severity model

Disease activity is measured at each time point by PASI. Building on the design of our trial study⁵, we split the data by tissue and analysed samples from both treatment arms with coefficients for each drug. We accounted for repeated observations using the duplicateCorrelation function, which approximates a mixed model design in which patient ID is treated as a random effect. Because preliminary investigations suggested that gene expression is often a nonmonotonic function of PASI, we expanded the model using a cubic spline basis of degree 3. An intercept term was included for each drug and the spline coefficients were also allowed to vary depending on the drug, except for the model underlying the volcano plot in Fig. 6, which was independent of drug. In the lesional skin and nonlesional skin models, an intercept for the Cohort (Discovery/Replication) was included. In addition to the q-value criterion, we define a signed fit range equal to the minimum to maximum range of the fitted log2CPM expression, with the sign given by the sign of the gradient at PASI = 0. In the Venn Diagrams and Volcano plots of Figs. 6, 7, we include only those genes that were assigned to a WGCNA module (see the section “Identification of gene coexpression modules” below).

Dimensionality reduction

WGCNA and ICA decompose transcriptomes of many thousands of gene transcripts into a dataset comprised of a much smaller number of gene modules, overcoming the limitations inherent in gene-level analysis, including lower signal-to-noise ratios and a higher multiple testing burden¹³, enhancing the statistical power to detect true endotype associations. These methods also reduce data complexity, offering a more holistic view of biological pathways and networks by focusing on co-expressed genes organised mutually exclusively into modules (WGCNA) or independent components (ICA), which reduces high-dimensional data into a smaller set of latent variables (or factors), with the objective of describing unobserved processes that explain patterns in gene expression, allowing for genes to belong to multiple pathways. Each module is represented by an eigengene and each factor by a metagene. Eigengenes and metagenes represent summary expression values for modules and factors, respectively. These approaches not only provide robustness against noise but also reveal biologically relevant patterns and potential novel mechanistic insights which may be missed in individual gene-level analyses.

Whereas individual genes can only be assigned to one WGCNA module, ICA allows single genes to be weighted across multiple factors in both up or down regulated states, reflecting the involvement of genes in multiple shared biological processes (Fig. 1d). These approaches are complementary, with the former being more explainable and the latter more likely to represent biological complexity.

Within skin and blood, all samples were used for WGCNA and ICA, i.e. both lesional and nonlesional samples (within skin), both drug cohorts and all time points.

Identification of gene coexpression modules

The following steps were carried out for the PSORT-D skin and blood data separately in order to identify co-expressed gene modules in each tissue compartment. Prior to running WGCNA, the gene-level counts were filtered using a threshold that required at least one CPM in at least n/k libraries, where n equalled the number of samples and k equalled the number of unique combinations of tissue type, drug, and time point. The counts were then normalised using the TMM method and transformed to log2-CPM. Selection of the appropriate soft-thresholding power β was done by plotting the values 1–20 against R², a measure of scale-free topology, and mean connectivity. The lowest value which reached the R² threshold of 0.8 was chosen; a β of 12 was chosen for skin and 5 for blood (Supplementary Fig. 23). The blockwiseModules function was then used to partition the genes into co-expressed modules. In brief, this function calculated the Pearson correlations between each pair of genes and raised these estimates to the selected β power in order to amplify the differences between high and low correlations. These correlations were then used to generate a topological overlap matrix (TOM) and hierarchical clustering of this matrix was used to group genes with similar expression profiles into modules. Parameters to blockwiseModules included a minimum module size of 30, a dendrogram cut height (for merging of similar modules) of 0.1, and the use of a signed network so that the correlations between genes were scaled to lie between 0 and 1.

Module-trait correlations

The moduleEigengenes function was used to derive eigengene values for the skin and blood modules in every sample. An eigengene represents a summary expression score for a module and is analogous to the first principal component of the expression matrix for that module. Although module identification was not carried out in PSORT-R, the module assignments in PSORT-D were used to derive eigengenes for these modules in PSORT-R as well. Pearson correlation (with pairwise complete observations) was used to identify associations between modules and traits of interest. These included disease traits at baseline: age of onset, onset type (early/late), anti-TNF naïve status (Y/N), PsA (positive/negative), sex, age, BMI, and HLA-Cw6 status (positive/negative); and PASI across time in each drug cohort. Binary traits were encoded as one and zero. In skin, the module-trait correlations were carried out separately for the lesional and nonlesional samples. Significant correlations were defined by FDR ≤ 0.05; in skin, replicable correlations were defined by FDR ≤ 0.05 in the discovery cohort and nominal p-value ≤ 0.05 in the replication cohort and correlation of the same sign in both cohorts. Only traits with at least one significant correlation are displayed in the module-trait correlation heatmaps.

Independent component analysis

Independent component analysis (ICA) was applied to identify latent variables separately in both skin (discovery cohort) and blood expression data. In each case, we included samples from both treatments and all timepoints, and we centred the data prior to factor analysis. We used the “imax” method implemented in the ica R package¹⁴, using the maximally stable transcriptome dimension (MSTD) approach to select the optimal number of factors to compute¹⁵ implemented in the ReducedExperiment package¹⁶. The number of factors recommended by MSTD was 24 for the skin expression data and 21 for blood. In order to validate the identified skin signatures, we projected the expression data from the replication cohort into the factor space defined in the discovery cohort. As a result, the feature loadings (i.e., the source signal estimates) for the skin discovery and replication cohorts are equal, permitting the investigation of the same factors in each cohort.

Factor metagenes were calculated by taking the scaled values of the estimated mixing matrix (Supplementary File Factor tables). These factor metagenes were then associated with phenotype using the same modelling approach as we employed for the module eigengenes (see Module-trait correlations, above). For factors, we additionally calculated correlations with HLA genotypes and baseline PASI. This was carried out using the combined discovery and replication cohorts following batch effect correction using limma’s removeBatchEffect function.

For analyses that required a defined set of genes, we selected a set of highly aligned genes for each factor based on their loadings (Supplementary Data 5s). We selected genes with loadings that exceeded a threshold. By default, we defined this threshold at half the maximal loading for that factor (Supplementary Data 5). For some analyses (functional enrichment analysis, BMI associations) we used a more relaxed threshold of 5; where this method resulted in the selection of less than 20 genes, we instead extracted the top 20 features.

Deconvolution

An abundance of cell types was inferred using the CibersortX online tool¹⁷. To infer cell types in skin, a single-cell reference matrix was generated using single-cell RNA sequencing data from 38,274 skin cells across five inflammatory skin conditions, including psoriasis¹⁸ downloaded from the Single Cell portal developed by the Broad Institute of MIT and Harvard (https://singlecell.broadinstitute.org/single_cell). Due to memory limitations imposed by the CibersortX tool, the size of the reference matrix was reduced by downsampling to a maximum of 200 cells per cell-type. The reference matrix was used to generate a Signature Matrix file using recommended CibersortX settings. Bulk RNA-seq data for both PSORT discovery and replication cohorts were used to generate the mixture file. Cell fractions were imputed in absolute mode using ‘B-mode’ batch correction.

To infer cell types in blood, the LM22 signature matrix provided by CibersortX was used. The mixture file was generated from PSORT blood RNA-seq data. Cell fractions were imputed using recommended CibersortX settings.

Correlations of cell type fractions with latent factors and module eigengenes

Per-sample imputed absolute cell fractions for each cell type were tested for association with per-sample WGCNA module eigengene loadings and with per-sample Latent Factor loadings using the cor.test function from the R statistical computing environment. Correlation tests used Kendall’s tau coefficient. Adjusted p-values were calculated with the p.adjust function using the method of Benjamini & Hochberg.

Pathway analysis

Functional analysis of systems-level upstream regulators responsible for observed differential gene expression related to response was performed using the Upstream Regulator function in Ingenuity Pathways Analysis¹⁹, using all genes with nominal response p ≤ 0.05 as input. For all gene set enrichment analyses, a right-tailed Fisher’s exact test was used to calculate a pathway p-value determining the probability that each biological function assigned to that data set was due to chance alone. All enrichment scores were calculated in IPA using all transcripts that passed QC as the background data set. Upstream regulator analysis is based on prior knowledge of expected effects between regulators and their known target genes according to the IPA database. The prediction of activation state is based on the global direction of changes of differentially expressed genes; a z-score is calculated and determines whether gene expression changes for known targets of each regulator are correlated with what is expected from the literature for an activation of this pathway.

Enrichment analysis to annotate the function of WGCNA-defined gene modules was conducted using Metascape, a platform used for inclusive gene list annotation and source analysis (https://metascape.org/)²⁰.

Predicting PASI scores using Gaussian process and ridge regression models

The end goal of this study was to determine the overall disease endotypes and phenotypes responsible for the progression and severity of psoriasis. Machine learning (ML) models are being increasingly adopted across the life sciences as decision-making tools. ML models involve general-purpose algorithms that learn patterns from high-dimensional datasets for performing prediction tasks. On the other hand, statistical models (e.g., linear regression) are often more suitable for inference (i.e., distinguishing whether one or more variables are signals or noise)²¹. Most machine learning algorithms involve supervised tasks, i.e., mapping one or more feature inputs to corresponding labels (i.e., the ground truth) and making predictions for similar but unseen labels. A neural network is a popular machine learning choice for supervised tasks but requires thousands of labelled examples for learning. Given the size of this study’s dataset (n 139 patients and 339 time-points) a Gaussian process regression (GRP) was explored²².

Gaussian processes (GPs), a family of Bayesian models, have been shown to perform well on a range of modelling tasks given a limited amount of data^23,24,25,26.

Two key features allow GPs to model a range of problems with limited data. Firstly, in the absence of testing data, GPs provide measures of uncertainty to determine how close predictions are to examples in the training dataset²². Secondly, GPs attempt to model a distribution over functions f(x). This is specified using a kernel covariance function that makes some basic prior assumptions about the relationship such as whether the functions are smooth, linear or rough. The kernel covariance function also makes the basic assumption that data inputs that are closely related are more likely to have similar labels. When the relationship is unknown, a popular choice of kernel is the non-linear Matern 5/2 kernel, which was adopted in this study²². Further, the kernel function can be decomposed into low-order functions that can be used to model feature inputs additively²⁷. Many relationships can be decomposed into additive parts, for instance, the price of a building can be broken down into the individual building materials. If the relationship depends jointly on additive low-order interactions, the sum of kernels can be used to model the relationship. If not, the kernels will still specify a suitable model. In this study, an additive non-linear kernel function and Gaussian process model were implemented using the GPflow python package (version 2.5.2)²⁸.

During training, the kernel hyperparameters including the length scale ι and variance σ² were tuned for each feature input by maximising the probability of observing the data points, known as the marginal likelihood, on an independent 10-fold validation dataset. Predictions were then obtained from the trained GPR regression.

As a baseline comparison model, a linear ridge regression model was chosen. This model is a special case of a linear regression model that includes an L2 regularisation function to reduce overfitting. A range of hyperparameters were chosen to tune the L2 regularisation parameter on an independent 10-fold validation dataset.

Predictions from the trained models were assessed using shuffled 10-fold held-out testing patient datasets. Performance metrics such as the coefficient of determination (R^2) and mean absolute error (MAE) were calculated for each shuffled dataset.

A total of 4 GPR regression and 4 ridge regression models were trained and tested for predicting PASI scores using several features (i.e. Demographics + clinical features, Skin factors and Skin RNA modules). Notably, to reduce the influence of extreme outliers and allow more direct comparison between feature inputs, models adopted log transformed PASI scores and feature inputs normalised using the robust scaler technique. The robust scaler technique uses the feature median and interquartile range rather than the mean and standard deviation.

We compared the performance of the GPR model to a linear ridge regression model. Through comparison to this baseline method, we were able to determine whether the non-linearity captured by the GPR models improved the predictive accuracy (Supplementary Tables 3–5).

To determine the features driving the model relationships, SHAP (SHapley Additive exPlanations) method²⁹ was adopted. SHAP method is a popular and model-agnostic approach for explaining machine learning outputs. SHAP assesses the impact of each feature on the predicted output while keeping other features unchanged. Higher SHAP values indicate features causing a higher change in the predicted output value and lower SHAP values indicate the opposite.

Statistics and reproducibility

Module identification with WGCNA was done using Pearson correlation with pairwise complete observations. Similarly, trait correlations with WGCNA module eigengenes and ICA latent factors were calculated using Pearson correlation with pairwise complete observations. The trait correlations were calculated in lesional skin, non-lesional skin and blood separately and with different sample subsets for each variable type: correlations with the disease endotype variables (i.e. age of onset, onset type, anti-TNF naive status, PsA, sex, age, BMI, HLA-Cw6 status) were calculated using the baseline samples from both drug cohorts and correlations with PASI were calculated for each drug cohort separately using samples from all time points. To account for multiple testing within each tissue, the p.adjust function with method “fdr” in R was applied to the trait correlation p-values for modules and factors separately. Additionally, correlations of module eigengenes and latent factors with per-sample imputed absolute cell fractions were calculated using Kendall’s tau coefficient and FDR-adjusted p-values were derived as above. P-values from gene-level differential expression modelling of disease severity were adjusted using Storey’s method¹². Further details about statistical methodology are available under the relevant subsections in the materials and methods section.

Results

Study design

Our study design was previously reported^3,30. The timing of sample collection and details of sample numbers are illustrated in Fig. 1 (see main and supplementary Materials and Methods). We studied 89 subjects with stable plaque psoriasis initiating biologic therapy, 82 of whom provided skin biopsies (41 with adalimumab and 41 with ustekinumab; 400 total samples) and 83 of whom provided blood samples (40 with adalimumab and 43 with ustekinumab; 318 total samples) (discovery cohort) (supplementary Materials and Methods). We replicated findings in a further cohort of 57 subjects who provided skin samples (29 with adalimumab, 28 with ustekinumab; 276 total samples). Power calculations based on Guo et al.⁶ and a pilot study⁵ indicated a discovery cohort sample size requirement of 40 (Supplementary Methods and Supplementary Fig. 1). Subject characteristics for included participants are shown in Supplementary Table 1a, b.

Identification of gene expression signatures in skin and blood

To define the relationship between transcriptional signatures and clinical phenotypes, we used two complementary methods for dimensionality reduction, Weighted Gene Correlation Network Analysis (WGCNA) and Independent Component Analysis (ICA) (Fig. 2a, b, Materials and Methods)⁴. In brief,WGCNA groups genes into modules based on co-expression and ICA identifies latent variables that describe patterns of expression variation in the data⁴. WGCNA-derived eigengenes summarise the main trend in co-expressed gene modules, while ICA-derived metagenes emphasise independent patterns of gene expression⁴. To facilitate comparisons between lesional and non-lesional skin across time, all skin samples from the adalimumab and ustekinumab drug cohorts at weeks 0, 1 and 12 were analysed together. Due to extensive transcriptomic variation between skin and blood, these tissues were analysed separately.

**Fig. 2: Systems-level gene modules and latent factors correlate with disease and disease severity endotypes in psoriasis.**

WGCNA identified 34 co-expressed gene modules in lesional skin and nonlesional skin across all time points (Supplementary Fig. 2a). Individual modules contained between 34 and 2677 genes. ICA identified 24 factors in lesional skin and nonlesional skin across all timepoints. WGCNA of the blood RNA-seq data identified 26 co-expressed gene modules (Supplementary Fig. 2b) with individual modules containing between 50 and 1333 genes. ICA identified 21 factors in blood. Many of the factor metagenes were highly correlated with module eigengenes in both tissues (Supplementary Fig. 3a, b), indicating that the methods converged on similar key signatures, providing cross-validation.

To define disease and severity endotypes, module eigengenes and factor metagenes were correlated with clinical phenotypes and HLA-Cw6 genotype status at baseline, which were designated as disease endotypes; and with PASI across all time points, termed disease severity endotypes. These associations were replicated by independent testing within the replication cohort (Fig. 2a, b).

To further define the functional relevance of the co-expressed WGCNA modules and ICA factors, systems analysis was performed. Further information about the modules and factors, including the top pathway enrichments and exemplar aligned genes, is available in Supplementary Data 5 and described in the results below.

We carried out module preservation analysis to assess the degree to which skin modules were preserved in blood and vice versa. The skin modules black (translation) and tan (oxidative phosphorylation) exhibited strong evidence for preservation in blood and were found to have the most significant gene overlap with the gold (translation) module in blood (Supplementary Fig. 4a, b). The green module (mRNA processing) was also strongly preserved in blood and exhibited significant gene overlap with the olivedrab (oxidative phosphorylation) module in blood (Supplementary Fig. 4a, b).

Disease endotype

Significant associations of clinical phenotypes with WGCNA modules and ICA factors were found in both lesional skin and nonlesional skin (Fig. 2a) at baseline. Notably, we observed that the lightyellow module (insulin and hormone secretion) and factor S9 (obesity-associated) each displayed significant and replicable negative associations between their eigengene expression and i) BMI in nonlesional skin and ii) PASI in lesional skin (Fig. 2a, c; Supplementary Fig. 5). This supports our earlier observed association between high BMI and poor clinical response in a larger cohort of patients³¹. Functional enrichments for lightyellow and factor S9 included: secretory pathways, hormone signalling, transport of small molecules and Wnt signalling (Supplementary Data 5). Twenty-one genes within the intersection of lightyellow and factor S9 were independently negatively associated with BMI (Fig. 3a, b). Notably 14 of these genes, including SCGB1D2 (Secretoglobin Family 1D Member 2), MMP7, DNER (Delta/Notch Like EGF Repeat, regulates adipogenesis) and PDE9A (stimulates lipogenesis) were negatively associated with PASI in lesional skin across all time points (Fig. 3c). Deconvolution of bulk RNA-seq data with single cell RNA-seq data from 38,274 skin cells¹⁸ revealed that lightyellow, factor S9, and the 14 gene core BMI/PASI signature (including SCGB1D2, DNER and PDE9A) were highly enriched with genes expressed within the pilosebaceous unit (Supplementary Figs. 6, 7b, 8).

**Fig. 3: A BMI-related transcriptomic signature in non-lesional skin is also linked to disease severity in lesional skin.**

Two blood factors were significantly associated with HLA-Cw6 genotype status (as a binary trait), with factor B8 (inflammatory and HLA-aligned B) being positively correlated and factor B19 (inflammatory and HLA-aligned C) negatively correlated (Fig. 2b). These factors were enriched for antigen processing, graft versus host disease and allograft rejection, with these enrichments being driven predominantly by HLA- genes (Supplementary Data 5). The factors were highly aligned with the expression of genes in the 6p21 gene region, including PPP1R18, HLA-DRB1, HSPA1L, GPSM3, HSPA1A, LY6G5B, CSNK2B and TUBB (factor B8); and HLA-DRA, HLA-DQB1, AGPAT1, PSMB9, HLA-DQA1, HLA-DRA and HLA-DMB (factor B19).

Disease severity endotype

Few previous studies have studied the relationship between gene expression patterns and disease severity across different tissues in psoriasis^32,33. We therefore investigated whether gene expression within lesional skin, nonlesional skin and blood was associated with whole body disease severity scores at the time of sampling. PASI is the gold-standard disease severity measure and represents the average redness, thickness, and scaliness of psoriasis lesions weighted by the area of involvement (Fig. 1b, top panel)³⁴.

Lesional skin

Remarkably, 21 out of 34 WGCNA modules in lesional skin showed highly significant and reproducible correlations with disease severity in at least one of the drug cohorts (Fig. 2a); 12 showed positive correlations and 9 negative correlations with disease severity. Cross-correlation of the key modules and factors identified two distinct blocks (i and ii) (Supplementary Fig. 3a). Strikingly, the first block (i) comprised modules and factors (e.g. turquoise module and factor S1; cytokine and anti-microbial peptide signalling) positively associated with disease severity whereas the second block (e.g. yellow module and factor S9; ECM and Insulin and hormone secretion signalling) (ii) was negatively associated with disease severity (Fig. 2a; Supplementary Fig. 3a). Exemplar scatter plots are shown in Fig. 2c and Supplementary Fig. 9.

To gain further insight, we deconvoluted the bulk RNA-seq data using single-cell RNA-seq data¹⁸. Cluster analysis showed the presence of two main blocks that were distributed according to positive or negative disease severity associations (Supplementary Fig. 6). For example, we observed resolution of keratinocyte subsets into two distinct clusters, consistent with previous single cell³⁵ and spatial transcriptomic studies³⁶. Thus, keratinocyte-1 (KRT1⁺, KRT10⁺ S100A8/9⁺ spinous), keratinocyte-4 (KRT1⁺, KRT10⁺ S100A8/9⁺ spinous), keratinocyte-6 (KRT5⁺, KRT14⁺ KRT1⁺, KRT10⁺ PCNA⁺ supra-basal) and keratinocyte-7 (KRT5⁺, KRT14⁺, COL17A1⁺ basal) subsets (Supplementary Fig. 7a) correlated with positively-associated disease severity modules and factors (e.g. turquoise module, darkgreen module and factor S1) whereas keratinocyte-3 (KRT5⁺, KRT14⁺, KRT1⁺, KRT10⁺ supra-basal), keratinocyte-5 (KRT1⁺, KRT10⁺ IVL⁺ supra-spinous) and keratinocyte-8 (KRT5⁺, KRT14⁺, COL17A1⁺, PCNA⁺ basal)³⁵ subsets (Supplementary Fig. 7a) aligned with negatively-associated disease severity modules and factors (e.g. yellow module, skyblue module and factor S9) (Supplementary Fig. 6). Notably, each cluster comprised basal, suprabasal and spinous subsets, reinforcing a mechanistic model of a switch in keratinocyte differentiation program and phenotype with disease severity progression^18,35. Additionally, and in line with the current pathophysiological understanding of the role of innate and acquired immunity in psoriasis^35,37, positively-associated disease severity modules and factors showed strong associations with myeloid-1 (dendritic cells/macrophages), T cell-2 (Th1, Th17), T cell-3 (cytotoxic T lymphocytes)and venule-2 cells (which regulate the trafficking of immune cells into tissues) (Supplementary Figs. 6, 7e–g). Deconvolution of negatively-associated disease severity modules and factors revealed enrichment for fibroblast-5 (SFRP+, MFAP5+; f2/3 stromal/mesenchymal)³⁸ and Langerhans cells (Supplementary Figs. 6, 7b–d).

Nonlesional skin

Although clinically resembling normal skin, it is recognised from gene and protein expression studies that nonlesional skin represents a pre-psoriatic state that is primed to develop into psoriasis³⁹. We extend these findings to show that in nonlesional skin, eigengene expression of three WGCNA modules (turquoise, darkgreen (cornification 1) and paleturquoise (sphingolipid metabolism) and factor S1 significantly and reproducibly positively correlated with whole body disease severity in the adalimumab group (Fig. 2a, c). Of note, there were no significant negatively associated disease severity modules or factors in nonlesional skin.

Blood

Limited previous studies have systematically investigated mRNA biomarkers of psoriasis disease severity in blood⁴⁰. The khaki (innate immune cell; neutrophil degranulation) module and factor B1 (inflammatory and HLA aligned A; phagosome) were positively associated with disease severity in the adalimumab cohort, and deconvolution indicated enhanced representation of neutrophils (Fig. 2b, Supplementary Fig. 10, Supplementary Data 5). Strikingly, in the ustekinumab cohort there were no correlations reaching statistical significance (Fig. 2b, c).

Predicting disease severity using machine learning

To predict disease severity from modules and factors, we employed additive Gaussian‑process regression (GPR; Supplementary material and methods). GPR was well‑suited to our “small‑n, large‑p” design, which comprised 718 samples from 146 subjects and approximately 20,000 transcripts within 60 modules and 45 factors across skin and blood, as this method combines (i) non‑linear flexibility with (ii) a Bayesian framework that returns per‑sample credible intervals, an essential read‑out for clinical risk‑stratification. Moreover, its additive kernel decomposition facilitated transparent attribution of individual module and factor contributions via SHAP values (Fig. 4b, d), which cannot be obtained as cleanly from tree‑based ensembles or neural networks.

**Fig. 4: Gaussian process regression accurately predicts log PASI from transcriptomic modules and factors.**

A linear ridge‑regression (RR) baseline was trained for comparison. In five‑fold cross‑validation GPR achieved MAE = 0.45 ± 0.07 and R² = 0.53 ± 0.09, outperforming RR (ΔMAE = +0.06; ΔR² = –0.08).

The final GPR and RR models, fitted to the combined Discovery and Replication cohorts, tracked the training data well (R² = 0.59–0.78; Table 1, Supplementary Table 3). Crucially, only GPR supplies calibrated uncertainty bands around each prediction, complementing global metrics such as MAE and R².

Table 1 PASI prediction performance of additive Gaussian process models

Full size table

To assess the GPR and RR models on unseen data points, models maximised on an independent validation set were retrained and assessed on additional held-out subject-level inputs (Table 1). This suggested demographics and clinical features related poorly to disease severity (Supplementary Fig. 11), while both RNA eigengenes and skin factors strongly related to disease severity in both the validation and testing datasets (Fig. 4a and Table 1). To demonstrate how our models can make clinically meaningful predictions, a random selection of subject predictions were plotted over time (Supplementary Figs. 13, 14).

To assess feature importance, we used SHAP (SHapley Additive exPlanations) which shows the contribution of each skin RNA module and skin factor for predicting disease severity (Fig. 4b)²⁹.

Despite differences in model methodology (i.e. linear vs. nonlinear), both regression techniques prioritised similar eigengenes and skin factors, although GPR was more selective than RR (Fig. 4b and Supplementary Fig. 12b). Overall, both methods highlighted turquoise, darkred (PI3K-AKT-mTOR signalling), steelblue (innate/adaptive immunity), violet (complement) and tan (oxidative phosphorylation) modules as important contributors to disease severity prediction (Fig. 4b and Supplementary Fig. 12b). Factors S1 and S2 (mixed inflammatory), which are inter-correlated (Supplementary Fig. 3a), were identified by SHAP analysis of RR and GPR models respectively as the most influential skin factors for predicting PASI (Fig. 4b). Additionally, factors S9 and S6 (ECM signalling) appeared as top features influencing disease severity (Fig. 4b and Supplementary Fig. 12b). By analysing both MAE and R² values, we identified the turquoise module as the strongest positive and the blue module as the strongest negative contributors to disease severity prediction (Table 1). In order to gain a deeper understanding of key genes driving disease severity prediction, the GPR models were retrained with only the top 10 aligned genes from selected modules and factors correlating with disease severity (Fig. 4c and Supplementary Tables 3, 4). A SHAP analysis of the top 10 aligned genes from turquoise and from blue identified CARHSP1, KLK13, CRABP2, and GJB2 (turquoise, cytokine) and THRA, ZNF34, RORC, CRY2 and CACNA2D2 (blue, WNT signalling) as the 9 key genes driving disease severity prediction (Figs. 4d, 6d).

Factors S18 (HLA-DQA1*01/HLA-DRB1*15-associated) and S21 (HLA-enriched factor C) were among the top predictors of disease severity according to both models (Fig. 4b and Supplementary Fig. 12b). Most of the genes highly aligned with these factors were HLA-encoding, with HLA-DQB1, HLA-DQA1, HLA-DRA, HLA-DRB1 and HLA-DRB5 aligning with factor S18, and HLA-E, HLA-DQB2, HLA-DQA2 and GSTM1 aligning with factor S21 (Fig. 5A). Previous studies have reported associations between HLA genotypes and psoriasis severity⁴¹. We therefore investigated associations between HLA genotypes and the expression of genes and factors. Among the strongest genotype-gene correlations were those between HLA-DQA1*01 genotype and the expression of HLA-DQA1 and HLA-DQB1, and between HLA-DRB1*15 and expression of HLA-DRB5 and HLA-DRB1 (Fig. 5B, C). Additionally, the two strongest genotype-factor relationships were between HLA-DQA1*01 and HLA-DRB1*15 genotypes and factor S18 (Fig. 5B). We observed three clusters of patients corresponding to factor S18, its constituent genes and the HLA genes. Individuals with both the DRB1*15 and DQA1*01 genotypes had high factor S18 expression and high expression of HLA-DQB1, -DQA1, -DRB1 and -DRB5; subjects with the DQA1*01 genotype only had moderate factor S18 expression and high expression of HLA-DQB1 and -DQA1, while those with neither allele had low factor S18 expression and low expression of all four of the aforementioned genes (Fig. 5A).

**Fig. 5: Latent factor S18 defines HLA-driven patient clusters associated with baseline disease severity.**

Given that these HLA-aligned factors were not associated with disease severity across timepoints (Fig. 2a), we investigated why they were considered important for prediction (Fig. 4c). There was no apparent clustering of samples by tissue type (Fig. 5A) and the expression of these factors displayed strong correlations between paired lesional and nonlesional samples (Fig. 5D), suggesting that the expression of these factors does not depend on tissue type. We next considered associations between the factors and baseline disease severity. Factors S1, S2, S17 (matrisome and cytokine receptor) and S18 were significantly correlated (FDR < 0.05) with baseline PASI in lesional skin; factor S18 was the most significantly associated (Pearson’s r = −0.28, adjusted p-value = 0.01) (Supplementary Table 5). Furthermore, the patient clusters based on the HLA-DQA1*01 and HLA-DRB1*15 genotypes were significantly associated with baseline PASI (p-value = 0.007) (Fig. 5E, F). These data indicate a link between these genotypes, expression of HLA-DQB1, -DQA1, -DRB1 and -DRB5 and psoriasis severity at baseline.