Introduction

With improved access to early and effective antiretroviral therapy (ART), HIV has become a chronic condition marked by higher risk and earlier onset of aging-related diseases, including cardiovascular disease (CVD)1. People living with HIV (PLWH), even those with viral suppression consequent to ART use, are at higher risk of myocardial diseases2, such as incident heart failure (HF)3,4,5 and atrial fibrillation (AF)6. Advanced HIV disease has been classically associated with left ventricular (LV) systolic dysfunction and dilated cardiomyopathy, but with effective ART, this phenotype has transitioned to one characterized by subclinical abnormalities in cardiac structure and function that may presage clinical HF, particularly HF with preserved ejection fraction (EF)7. We previously demonstrated PLWH have a higher risk of diastolic dysfunction and related structural abnormalities compared to people without HIV (PWOH), including subclinical left atrial (LA) enlargement8,9, findings supported by other data across world regions10,11,12,13. Indexed left atrial volume (LAVi) reflects the cumulative effect of chronically elevated LV filling pressure and is an independent predictor of diastolic dysfunction and clinical HF14.

Mechanisms contributing to HIV-associated myocardial disease are hypothesized to be multifactorial, mediated by chronic immune activation and dysfunction that persists despite ART-induced viral suppression. Hypothesized contributors include direct immune activation and exhaustion due to HIV viral persistence and co-pathogens, metabolic dysfunction due to HIV and use of certain antiretrovirals, higher mucosal permeability and microbial translocation, and higher prevalence of risk factors with known pro-inflammatory effects, including substance use7. However, it remains unclear to what degree mechanisms underlying this risk differ among PLWH compared to PWOH, among whom aging-related immunosenescence and inflammaging may similarly contribute to clinical HF risk. Moreover, therapeutic and preventive strategies among PLWH, particularly for HF, are largely guided by data from clinical trials conducted among PWOH, so evaluating this knowledge gap may have important clinical implications.

Proteomics provides a high-throughput discovery approach to generate mechanistic hypotheses that could lead to improved risk prediction, disease characterization, and targeted pharmacologic therapies. We performed proteomic analyses using the Olink Explore 3072 platform on stored plasma obtained concurrently with cardiovascular magnetic resonance (CMR) among largely ART-treated and virally suppressed PLWH and sociodemographically similar PWOH enrolled in the Subclinical Myocardial Abnormalities in HIV (SMASH) study9, the discovery cohort. Our objective was to identify plasma proteomic signatures that may represent markers of HIV-associated atrial remodeling, defined as features associated with both positive HIV serostatus and higher LAVi. We then sought to evaluate whether identified proteomic signatures were associated with LAVi, incident AF, and incident adjudicated HF among a large external cohort of older PWOH in the Multi-Ethnic Study of Atherosclerosis (MESA). Here, we discover such a proteomic signature that may in part reflect or contribute to HIV-associated atrial remodeling, many features of which are associated with older age and time to incident heart failure in the MESA. This signature is enriched in immune checkpoint proteins, tumor necrosis factor signaling, ephrin signaling, and extracellular matrix organization, which may help identify shared mechanistic pathways in HIV and aging that contribute to a higher risk of clinical HF.

Results

A graphical representation of the study design and analytic flow is depicted in Fig. 1.

Fig. 1: Graphical representation of the study design and analytic flow.
figure 1

Cross-sectional discovery analyses were performed in the Subclinical Myocardial Abnormalities in HIV Study (SMASH) using linear regression with robust variance. Analyses of HIV serostatus adjusted for sociodemographics, substance use, hepatitis C virus infection, and renal function. Analyses of left atrial size further adjusted for HIV serostatus, body mass index, systolic blood pressure, hypertension medication use, dyslipidemia, and diabetes. Both cross-sectional and longitudinal time-to-incident event analyses were performed among an external population of older people without HIV in the Multi-Ethnic Study of Atherosclerosis (MESA). Cross-sectional analyses of left atrial size were performed using linear regression with robust variance, adjusting for field center, sociodemographics, smoking, body mass index, systolic blood pressure, hypertension medication use, dyslipidemia, diabetes, and renal function. Time-to-event analyses of atrial fibrillation and heart failure were performed using Cox proportional hazards regression, adjusting for the same covariates. Proteome feature counts listed are those associated with indicated outcomes at a false discovery rate < 0.05.

SMASH Participant characteristics

Of 468 persons initially enrolled in SMASH, 352 were selected for plasma proteomics based on the availability of complete CMR data, including gadolinium enhancement, and adequate stored plasma volume (Supplementary Fig. S1). Selected participants had a lower representation of females and a lower prevalence of hypertension and hepatitis C virus infection (HCV) compared to unselected participants (Supplementary Table S1). Demographic, clinical, and CMR characteristics of selected participants are summarized in Table 1 by HIV serostatus and by cohort in Supplementary Table S2. Participants were mean (standard deviation, SD) age 55 (6) years old, 25% female, 61% PLWH, and had predominantly normal LVEF. LAVi was higher among PLWH compared to PWOH—29.7 (11.3) and 27.7 (8.3) mL/m2, respectively (p = 0.02 for difference, adjusting for demographics, substance use, and traditional CVD risk factors).

Table 1 SMASH participant characteristics by HIV serostatus (n = 352)

Proteomic signature of HIV serostatus

Of 2596 proteins analyzed, plasma abundances of 439 proteins were significantly associated with positive HIV serostatus following multivariable adjustment (415 positively associated and 24 inversely associated with a false discovery rate, FDR < 0.05) (Fig. 2). These proteins were statistically enriched for many biological processes, including T cell activation, differentiation, and migration; B cell activation; natural killer (NK) cell-mediated cytotoxicity; regulation of mitogen-activated protein kinase (MAPK) and extracellular signal-regulated kinase 1/2 (ERK1/ERK2) cascades; tumor necrosis factor (TNF) signaling; response to interleukin (IL)-1; IL-10 production; and interferon-γ (IFN-γ) production. The strongest association with HIV was that of cytotoxic and regulatory T cell molecule (CRTAM); on average, its adjusted plasma abundance was 1.01 SD higher among PLWH compared to PWOH (95% confidence interval [CI]: 0.84−1.17; FDR = 1.70 × 10−29). Upon exclusion of viremic PLWH, 414 of these 439 proteins remained significantly associated with positive HIV serostatus (390 positively, 24 inversely), among the strongest of which was CRTAM (FDR = 1.51 × 1020). See Supplementary Data 1 for complete results of protein associations with HIV serostatus and Supplementary Table S3 for enrichment details.

Fig. 2: Proteomic signature of HIV seropositivity among people living with and without HIV in the United States (n = 352).
figure 2

a Volcano plot of mean differences in standardized plasma protein abundance comparing PLWH to PWOH vs. -log10 false discovery rate (FDR; Benjamini-Hochberg) estimated using linear regression with robust variance, adjusting for sociodemographics, substance use, hepatitis C virus infection, and renal function. The threshold for significance is indicated by a gray dotted line, FDR < 0.05; 415 of 2596 proteins were positively associated with positive HIV serostatus and 24 proteins were inversely associated; comparing suppressed PLWH vs. PWOH, 414 of these 439 total proteins were associated with suppressed positive HIV serostatus (not depicted). Complete modeling results can be found in Supplementary Data 1. b Enriched biological processes among proteins associated with HIV serostatus, estimated using a two-sided Fisher’s exact test, Gene Ontology: Biological Processes annotations, and a threshold for significance of FDR < 0.05. Protein ratio = proportion of proteins significantly associated with HIV serostatus (n = 439) that map to the given annotation. Proteins mapping to each annotation can be found in Supplementary Table S3. PLWH = persons living with HIV; PWOH = persons without HIV; suppressed HIV viral load = HIV RNA < 50 copies/µL.

Many classical markers reflecting chronic immune activation and systemic inflammation among PLWH were significantly elevated among PLWH compared to PWOH, including the monocyte activation markers soluble CD14 (FDR = 2.60 × 1010) and CD163 (FDR = 0.009), as well as CXCL10 (FDR = 1.73 × 108) and TNFR1 (FDR = 0.014). Notably, IL-6 levels did not differ by HIV serostatus (p = 0.19, FDR = 0.42), consistent with previously reported immunoassay results among this study population15. These inferences were unchanged following the exclusion of viremic PLWH.

Of six protein clusters agnostically identified using weighted gene co-expression network analysis (WGCNA) (details in Supplementary Data 2, Supplementary Table S4 and Supplementary Fig. S2), higher plasma abundance of one cluster (Brown) was independently associated with positive HIV serostatus (p = 3.99 × 106), and this association remained following exclusion of viremic PLWH (p = 1.02 × 104) (Supplementary Table S5). This cluster comprised 42 proteins enriched for regulation of T cell proliferation, TNF signaling, ephrin signaling, cell-cell adhesion, and extracellular matrix (ECM) organization (Supplementary Table S6).

HIV-Associated proteomic signature of left atrial remodeling

Next, we identified proteome features significantly associated with both HIV serostatus and LAVi that may reflect potential contributors to or biomarkers of HIV-associated LA remodeling. Of 439 proteins associated with HIV serostatus, 78 were also associated with LAVi (74 positively, 4 inversely), adjusting for demographics, substance use, HIV, HCV, and estimated glomerular filtration rate (eGFR). Upon further adjustment for traditional CVD risk factors, 73 remained associated (69 positively, 4 inversely) (Fig. 3a and Supplementary Data 3), all of which were associated with concordant directionality compared to positive HIV serostatus (Fig. 3b). In subgroup analyses, 72 (99%) of these proteins differed in mean plasma level between virally suppressed PLWH and PWOH. These proteins were statistically enriched for involvement in T cell activation, differentiation, and proliferation; B cell activation; NK cell-mediated cytotoxicity; cell-cell adhesion; and TNF signaling, amongst others (Fig. 3c and Supplementary Table S7).

Fig. 3: HIV-associated proteomic signature of left atrial size among people living with and without HIV in the United States (n = 352).
figure 3

a Volcano plot of mean differences in left atrial volume index (LAVi) per standard deviation (SD) increment in plasma abundance of 439 HIV-associated proteins vs. -log10 false discovery rate (FDR; Benjamini-Hochberg) estimated using linear regression with robust variance. Purple indicates proteins only associated in Model 1 (M1) when adjusting for sociodemographics, HIV, hepatitis C virus (HCV) infection, and renal function. Orange indicates proteins associated in Model 2 (M2) following further adjustment for education, cardiovascular risk factors, and substance use. The threshold for significance is indicated by a gray dotted line, FDR < 0.05. The most saturated model (orange) yielded 69 proteins positively associated with LAVi and 4 proteins inversely associated. Complete modeling results can be found in Supplementary Data 3. b Beta-beta plot of mean differences in standardized protein abundances comparing persons living with vs. without HIV (PLWH, PWOH) vs. mean differences in LAVi per SD increment in plasma protein abundances. All associations were estimated using linear regression with robust variance, and proteins depicted (n = 73) are restricted to those significantly associated with both parameters with an FDR < 0.05. HIV point estimates (y-axis) were adjusted for sociodemographics, substance use, HCV, and renal function. LAVi point estimates (x-axis) were adjusted for the same covariates, plus HIV and cardiovascular risk factors. c Enriched biological processes among proteins associated with both positive HIV serostatus and higher LAVi (concordant directionality), estimated using a two-sided Fisher’s exact test, Gene Ontology: Biological Processes annotations, and a threshold for significance of FDR < 0.05. Protein ratio = proportion of total signature proteins (n = 73) that map to the given annotation. TNF = tumor necrosis factor; NK = natural killer; PDGF = platelet-derived growth factor. Proteins mapping to each annotation can be found in Supplementary Table S7. d Mean differences in LAVi per SD increment in plasma protein abundances among strata of PLWH and PWOH, estimated using linear regression with robust variance adjusting for sociodemographics, substance use, cardiovascular risk factors, HCV, and renal function. Proteins depicted are limited to those with HIV×protein multiplicative interactions with an FDR < 0.10 (n = 73 proteins tested). Error bars indicate 95% confidence intervals.

The strongest observed protein association with LAVi was that of known HF biomarker, N-terminal pro-B type natriuretic peptide (NT-proBNP). On average, LAVi was 3.09 mL/m2 higher (11% of mean LAVi) per SD increment in plasma NT-proBNP (95% CI: 2.16−4.03; FDR = 7.75 × 10−9). Notably, among the top 5 strongest associations with LAVi was CRTAM, which also had the strongest association with HIV serostatus; LAVi was 2.37 mL/m2 higher on average per SD (95% CI: 1.27−3.47; FDR = 3.40 × 104). The 73 proteins associated with LAVi exhibited pairwise correlations in plasma abundances that ranged widely from weak to strong (Supplementary Fig. S3) with several moderate confidence protein-protein interactions detected via STRING16 (Supplementary Fig. S4).

Next, we evaluated what proportion of the adjusted association between HIV and LAVi was accounted for by further adjustment for each identified proteome feature. When further adjusting the HIV-LAVi analysis for each of these 73 proteins separately (in addition to demographics, substance use, HCV, eGFR, and CVD risk factors), 38 proteins reduced the adjusted HIV association by >30%, and 8 reduced it by >50%—CRTAM, CD27, CD79B, CD160, TNF receptor superfamily (TNFRSF) 8, TNFRSF9, TNFRSF17, and IL18BP (Supplementary Table S8). Notably, adjustment for CRTAM reduced the association between positive HIV serostatus and higher LAVi by 82%, suggesting it may substantially account for that association.

We evaluated these 73 proteins for differences in adjusted LAVi associations by HIV serostatus and found no significant protein × HIV interactions (FDR < 0.05). Given type II error is likely high for these multiplicative tests, we present interactions with higher significance thresholds to broaden hypothesis-generating potential. Five proteins exhibited associations with LAVi that differed by HIV serostatus at FDR < 0.10 and were present only among PLWH: NT-proBNP; CD80, CD83, and TNFRSF9 (CD137), all important immune checkpoint co-stimulatory molecules17,18,19; and folate receptor β (FOLR2), an important folate binding protein expressed by myeloid cells, including infiltrating M2-like macrophages20 (Fig. 3d). See Supplementary Table S9 for complete results of individual protein interaction tests alongside modeling results stratified by HIV serostatus. We detected no protein × age interactions (Supplementary Table S10).

In our multi-analyte approach, we found higher plasma level of the one agnostically defined HIV-associated protein cluster (Brown) was also associated with higher LAVi, adjusting for age, sex, race, ethnicity, eGFR, HCV, and HIV serostatus (2.03 mL/m2 higher per SD increment in Brown cluster plasma level, 95% CI: 0.85−3.21; p = 0.001). Following further adjustment for substance use, education, and traditional CVD risk factors, the estimate of association was only mildly attenuated (1.96 mL/m2 higher per SD, 95% CI: 0.75−3.16; p = 0.002, Fig. 4). We found adjustment for this protein cluster reduced the independent association between positive HIV serostatus and higher LAVi by 36%. Moreover, this association differed by HIV serostatus (p = 0.006) and was only present among PLWH. We did not detect an interaction with age (p = 0.18).

Fig. 4: Relationship between an HIV-associated, agnostically defined cluster of plasma proteins and left atrial size among people living with and without HIV in the United States (n = 352).
figure 4

a Mean difference in indexed left atrial volume (LAVi) per standard deviation (SD) increment in ‘Brown’ protein cluster plasma abundance among all participants, among strata of participants living with and without HIV, and among strata of participants above and below median age in SMASH (55 years), estimated using linear regression with robust variance adjusting for sociodemographics, substance use, cardiovascular risk factors, hepatitis C virus infection, and renal function. Error bars indicate 95% confidence intervals. b Enriched biological processes among 42 proteins comprising the ‘Brown’ protein cluster, estimated using a two-sided Fisher’s exact test, Gene Ontology: Biological Processes annotations, and a threshold for significance of Benjamini-Hochberg false discovery rate (FDR) < 0.05. Protein ratio=proportion of total ‘Brown’ cluster proteins (n = 42) that map to the given annotation. Proteins mapping to each annotation can be found in Supplementary Table S6. c Interaction network of 42 proteins comprising the ‘Brown’ protein cluster generated using STRING, a public database of known and predicted protein-protein interactions. Interactions include direct (physical) and indirect (functional) associations derived from computational prediction, knowledge transfer between organisms, and interactions aggregated from other databases. Line thickness indicates the strength of data support. Szklarczyk D, et al. The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res. 2023;51(D1):D638-64.

We identified eleven proteins in both single- and multi-analyte approaches—specifically butyrophilin subfamily 2 member A1 (BTN2A1), C-type lectin domain containing 14 A (CLEC14A), delta-like canonical Notch ligand 1 (DLL1), ephrin A2 (EPHA2), endothelial cell adhesion molecule (ESAM), T cell immunoglobulin and mucin-domain containing-3 (TIM-3 or HAVCR2), junctional adhesion molecule 2 (JAM2), layilin (LAYN), TNFRSF1B, TNFRSF14, and TNFRSF21.

Associations between proteomic signature and clinical variables in SMASH

Many components of the identified protein signature were associated with age (p < 0.05), including the Brown cluster and 26 of 73 individual proteins. Notably, higher plasma CRTAM was not associated with traditional CVD risk factors (age, body mass index [BMI], hypertension, dyslipidemia, diabetes) but was associated with substance use (smoking and stimulant use) and many HIV-related factors (HCV, detectable plasma HIV RNA, lower current and nadir CD4 + T cell counts, not receiving ART and shorter ART duration). Similarly, the Brown protein cluster was largely only associated with substance use and HIV-related clinical factors—smoking, opioid use, HCV, detectable plasma HIV RNA, and lower current and nadir CD4 + T cell counts. See Supplementary Fig. S5 for the results of all evaluated protein associations with demographic and clinical characteristics.

External validation of protein signatures with indexed left atrial volume in MESA

A total of 1242 MESA participants had both CMR imaging and Olink Explore 3072 proteomic data (Supplementary Fig. S6), demographic and clinical characteristics of whom are summarized in Table 2. Participants were mean (SD) age 67 (9) years, 52% female, and had predominantly normal LVEF.

Table 2 MESA external validation participant characteristics (exam 5, 2010–2012)

We tested each feature of the HIV-associated proteomic signature of LA remodeling that we identified in SMASH for associations with LAVi in MESA. Among these older PWOH, 8 of 73 plasma proteins were cross-sectionally associated with LAVi following multivariable adjustment (Table 3 and Supplementary Table S11). Following NT-proBNP, the strongest validated association was that of CLEC14A (mean difference: 1.36 mL/m2 higher per SD, 95% CI: 0.65−2.07; FDR = 0.006), followed by collagen type IV alpha 1 chain (COL4A1) (1.20 mL/m2 higher per SD, 95% CI: 0.56−1.85; FDR = 0.006). Moreover, 28 (of 73 proteins) had LAVi associations that differed by age (interaction FDR < 0.05, 41 proteins FDR < 0.10) (Supplementary Table S12); 16 of these were significant associations with LAVi only among a subgroup of MESA participants older than the median age at exam 5 ( ≥ 67 years), and only one (NT-proBNP) was associated among participants age < 67 years (Table 3). Furthermore, plasma level of the Brown protein cluster was associated with LAVi only among older MESA participants (age ≥ 67 years, mean difference in LAVi per SD: 1.23 mL/m2, 95% CI: 0.03−2.43; p = 0.04; age < 67 years: -0.72, 95% CI: -1.75−0.32; p = 0.17; interaction p = 0.005).

Table 3 Summary of cross-sectional associations between identified HIV-associated proteomic signature of left atrial size and indexed left atrial volume by age among people without HIV in MESA (n = 1242)

External validation of protein signatures with incident clinical events in MESA

Incident clinical event analyses did not require CMR data and were performed in a larger subgroup of the MESA cohort to maximize statistical power (Supplementary Fig. S7). Distributions of demographic and clinical characteristics were similar to those in cross-sectional analyses presented above (Table 2).

Among 2185 MESA participants without known AF at exam 5, there were 252 incident AF events over a median [IQR] follow-up of 7.8 [0.9] years. Following multivariable adjustment, only NT-proBNP (of 73 proteins) was associated with time to incident AF (Supplementary Table S13 and Supplementary Fig. S8) (hazard ratio, HR: 1.54 per SD increment in NT-proBNP, 95% CI: 1.33−1.77; FDR = 2.08 × 10−7). Plasma abundance of the Brown protein cluster was also associated with incident AF (HR: 1.18 per SD increment, 95%CI: 1.01 to 1.37; p = 0.036). No interactions with baseline age were detected (all FDR > 0.10).

Among 2273 MESA participants without known HF at exam 5, there were 54 incident adjudicated clinical HF events over a median [IQR] follow-up of 8.9 [0.8] years. Thirty of 73 proteins were associated with incident HF (Fig. 5), the strongest of which was TNFRSF1B (HR: 1.81 per SD increment, 95% CI: 1.41−2.33; FDR = 2.58 × 104). Five of these proteins were also associated with LAVi in MESA when restricting to participants above the median age—CLEC14A, cysteine-rich transmembrane BMP regulator 1 (CRIM1), NT-proBNP, thrombospondin-2 (THBS2), and tissue inhibitor matrix metalloproteinase-1 (TIMP1). See Supplementary Table S14 for complete results of evaluated protein associations with time to incident HF. The hazard of HF was also significantly greater with higher plasma levels of the Brown protein cluster (HR: 1.79 per SD, 95% CI: 1.32 to 2.43; p = 1.82 × 104). No interactions with age were detected (all FDR > 0.10). Notably, 9 of 11 proteins identified in both single- and multi-analyte discovery analyses in SMASH were associated with incident HF in MESA—BTN2A1, CLEC14A, DLL1, ESAM, HAVCR2, JAM2, LAYN, TNFRSF1B, and TNFRSF14.

Fig. 5: Associations between identified HIV-associated proteomic signature of left atrial size and time to incident adjudicated clinical heart failure in the Multi-Ethnic Study of Atherosclerosis (n = 2273).
figure 5

Hazard ratios were estimated using Cox proportional hazards adjusting for study site, age, sex, race, ethnicity, educational attainment, body mass index, systolic blood pressure, anti-hypertensive medication, dyslipidemia, diabetes, smoking, and estimated glomerular filtration rate. N = 54 incident adjudicated heart failure events. Purple indicates p < 0.05, Orange indicates FDR < 0.05. Complete modeling results can be found in Supplementary Table S14, and source data are provided as a Source Data file. SD = standard deviation; FDR = alse discovery rate (Benjamini-Hochberg).

Associations between validated proteomic signature features and clinical variables

Each component of the protein signature validated with clinical HF among MESA participants was positively and strongly associated with older age (all p < 0.001) (Supplementary Fig. S9). All identified proteome features were also strongly associated with lower eGFR, and many with hypertension (28 proteins and Brown cluster), diabetes (25 proteins and Brown cluster), and higher BMI (22 proteins and Brown cluster)—which remained associated following adjustment for eGFR.

Discussion

In this discovery study, we identified a distinct plasma proteomic signature of left atrial remodeling—a marker of subclinical heart disease—that was heightened among PLWH and was also strongly associated with age and higher risk for incident clinical HF among a large independent cohort of PWOH who were over 10 years older. This proteomic signature comprised 73 individual proteins and one agnostically-defined cluster of 42 proteins, largely comprising immune checkpoint proteins (ICPs), cytokine signaling, ephrin signaling, and ECM organization pathways. Our findings support the premise that persistent immune activation, higher systemic inflammation, and fibrosis are related to HIV-associated atrial remodeling, even among virally suppressed PLWH on ART, and some of these mechanisms may parallel those of aging-related HF among the general population.

It is well-known that PLWH exhibits elevated systemic inflammation and immune activation compared to PWOH, characterized by T cell exhaustion, macrophage activation, and elevated levels of circulating pro-inflammatory cytokines that persist even among virally suppressed PLWH on ART. Immunosenescence is a well-described consequence of aging, and maladaptively activated innate and adaptive immune responses have also been implicated in acute and chronic HF21. This is the prevailing hypothesized mechanism through which PLWH experiences excess risk of aging-related diseases, including HF, and our findings support and expand upon those previously put forward.

We identified 30 proteins associated with HIV serostatus and LAVi in SMASH as well as incident clinical HF among older PWOH in MESA. Five of these proteins were associated with LAVi among PWOH in MESA when restricting to participants above median age. Interestingly, this list also included several proteins exhibiting especially high potential for mediating and/or moderating effects on the association between HIV and atrial remodeling and also included 9 proteins identified in both single- and multi-analyte discovery analyses. This overlap in identification is important because these approaches are both agnostic and independent of each other; identification by both approaches therefore strengthens the plausibility of the identified proteins as potential contributors to or biomarkers of both HIV- and aging-related LA remodeling and HF.

Many of these proteins of especially high interest are important ICPs—specifically CD27, TNFRSF8 (CD30), CD79B, CD80, TNFRSF9 (CD137), HAVCR2 (TIM-3), TNFRSF14 (HVEM), TIGIT, TNFRSF1B (TNFR2), and less extensively studied BTN2A1, CD83, IL18BP, and TNFRSF17. Many of the remaining proteins—CLEC14A, CRIM1, DLL1, ESAM, JAM2, LAYN, THBS2, and TIMP1—are involved broadly in cell migration and tissue homeostasis.

Activation of T cells, B cells, and NK cells is regulated by several costimulatory and inhibitory ICPs. Elevated expression of ICPs is a known consequence of HIV infection that persists despite ART-induced viral suppression and contributes to immune dysfunction, HIV disease progression, and viral persistence22. Many ICPs we identified belong to the TNF receptor superfamily, the upregulation of which is both a hallmark of HIV-associated inflammation and immune activation23 and has been extensively observed in HF24. Higher soluble CD27 and CD80 have also recently been identified as correlates of risk for non-AIDS adverse events among PLWH receiving ART25. Therapeutic blockade of ICPs has, however, proven controversial and effects are dependent on whether target molecules serve co-stimulatory or co-inhibitory functions. Taken together with the unique features of HIV-associated immune dysfunction, these findings highlight the need for focused future investigation of ICPs in HIV-associated cardiac remodeling and HF.

In this study, the one agnostically defined protein cluster of interest included four proteins involved in ephrin signaling, one of which was also identified in single-analyte discovery analyses (EPHA2). Ephrin signaling mediates a wide range of cellular processes involved in tissue homeostasis and response to injury, including angiogenesis and activation/migration of T cells, B cells, and macrophages26. Though not well-studied among PLWH, ephrin signaling has been linked to other viral infections27 as well as cardiomyocyte development, systolic and diastolic function, and TGF-β mediated cardiac fibrosis28,29,30.

Several of these higher-interest proteins are related to tissue growth and development, angiogenesis, and ECM remodeling—including CRIM1, THBS2, and TIMP1. CRIM1 modulates the activity of bone morphogenic proteins, members of the TGFβ superfamily that are known to play roles in cardiac development and disease pathophysiology in HF, pulmonary arterial hypertension, and atherosclerosis31. Both THBS2 and TIMP1 are critical regulators of tissue homeostasis with roles in ECM remodeling, angiogenesis, and cell growth/differentiation, and have been extensively implicated in HF pathogenesis32,33, including in recent plasma proteome studies34,35,36.

Other high-interest proteome features have been linked to the migration of various immune cell subsets in tissue homeostasis—specifically DLL1 (a ligand of Notch receptors), ESAM (endothelial cell adhesion molecule), JAM2 (involved in cell migration), and LAYN (involved in focal adhesion). Immune cell recruitment plays an important role in myocardial response to injury21; however, further investigation is needed to clarify the role of these proteins in shared mechanisms of HIV- and aging-related cardiac remodeling and HF.

Few proteins were inversely associated with positive HIV serostatus and higher LAVi in SMASH. One of these four proteins—contactin-3 (CNTN3)—was validated in its inverse association with LAVi among all MESA participants, though it was not associated with time to incident AF or HF. There is evidence from murine models that CNTN3 may be associated with aging, inflammation, and vascular permeability37, and a protective effect of a closely related contactin (CNTN1) was recently identified in a proteomics study of incident HF34.

We identified potential biological contributors to the association between HIV and LA remodeling. The abundance of one protein—CRTAM—reduced that association by 82%, markedly more than any other protein, and was strongly associated with several clinical factors related to HIV disease severity. CRTAM is an immune activation marker expressed by NK cells, CD8 + T cells, and a subset of CD4 + T cells with enhanced cytotoxic activity38. Measures of CRTAM expression have previously been associated with positive HIV serostatus, HIV viral reservoir size, and viral rebound kinetics39,40. CRTAM is also believed critical to shaping the gut microbiome and Th17 responses at the mucosa41, increased permeability of which is a consequence of Th17 HIV-susceptibility and a contributor to chronic inflammation among PLWH42. CRTAM was not associated with LAVi or incident HF among older PWOH in MESA, consistent with the hypothesis that it may be an HIV-specific biomarker of CVD. Taken together, our results and prior work suggest CRTAM may represent an HIV-specific marker of cardiovascular risk worthy of further investigation into its prognostic and therapeutic utility.

There are limitations to this study. Protein levels measured with Olink are semi-quantitative, limiting comparison to other assays. Previously published data demonstrated many proteins assessed by Olink Explore 3072 were associated with cis-protein quantitative trait loci (pQTLs), suggesting specificity of protein identification43. Nonetheless, all proteins of interest require validation using other methods. Type II error may be inflated due to the method used for multiple testing correction, which assumes independence of tests (i.e., biological independence of proteins) and is therefore conservative. For example, given macrophage activation is a hallmark of chronic immune activation among PLWH, we hypothesized it would be detected in our enrichment analyses. It was indeed positively associated with HIV and LAVi in SMASH but not following multiple testing corrections (p = 0.03; FDR = 0.36). Type II error may be of particular concern in analyses among subpopulations or involving multiplicative interactions. Discovery analyses are cross-sectional and lacking incident event data, resulting in temporal ambiguity. There are also limitations of incident event ascertainment in MESA, particularly for AF which relied on hospitalizations and claims data alone, possibly leading to misclassification bias. Finally, though we believe it strengthens the validity of our detected associations, discovery and external study populations substantially differ demographically, sociobehaviorally, and clinically (e.g., prevalence of HIV and HCV infection). We were therefore unable to externally evaluate the identified proteomic signature of HIV serostatus. In addition, though observed associations with LAVi in SMASH discovery analyses were independent of HIV serostatus and other major factors that differed between populations (e.g., substance use, age, sex), it cannot be discounted that unvalidated proteins may simply be more clinically relevant among the source population represented in SMASH.

In conclusion, we discovered a plasma proteomic signature that may in part reflect or contribute to HIV-associated left atrial remodeling and also predicted incident clinical heart failure, as evidenced among a large independent cohort of older people without HIV. This signature was enriched in pathways of immune activation, cytokine signaling, and extracellular matrix organization. Our findings suggest some pathways underlying the risk of heart failure among people with HIV may also contribute to heart failure pathogenesis among older people without HIV. If successfully validated in other external populations, these proteins may help refine current risk prediction models and help identify therapeutic targets for heart failure among both people with and without HIV.

Methods

Discovery study population (SMASH)

Participants in the SMASH study were recruited from the Multicenter AIDS Cohort Study (MACS), the Women’s Interagency HIV Study (WIHS), and the AIDS Linked to the Intravenous Experience (ALIVE) study and underwent CMR with concurrent blood collection and storage between March 2015 and February 2018 under a single, centralized protocol. All three cohorts include PLWH and PWOH. The WIHS and MACS are multicenter observational longitudinal cohort studies composed of women or men who have sex with men, respectively44,45. The ALIVE study is a community-based cohort of men and women with a history of injection drug use46. By leveraging three well-established cohorts with distinct source populations for enrollment, we recruited participants into SMASH with diverse sociodemographic and HIV-related risk factors, highly representative of people with and at risk of HIV in the United States. See the Supplementary Information for additional cohort details. The SMASH study enrolled active participants 40–70 years of age from the Chicago and Baltimore/Washington, DC MACS sites, the Washington, DC WIHS site; and the ALIVE study, which is based in Baltimore. Exclusion criteria included eGFR (Chronic Kidney Disease Epidemiology Collaboration 2021) <45 mL/min/1.73 m2, weight >350 pounds, known claustrophobia or contrast allergy, and contraindications to CMR. Proteomics was performed on a subset of total SMASH participants with complete CMR and adequate stored plasma volume (Supplementary Fig. S1). All participants provided informed consent. This study conforms to the ethical principles of the Declaration of Helsinki and has been approved by the Institutional Review Boards of Johns Hopkins University, Georgetown University, and Northwestern University.

Sociobehavioral and clinical data in SMASH

Participants completed an interviewer-administered structured questionnaire and biological measures concurrent with CMR. Prescribed medications, history of CVD, and substance use during the preceding 5 years were queried. These data were supplemented with data collected through the MACS, WIHS, and ALIVE cohorts, including self-reported demographics and CVD risk factors. See Supplementary Information for a full list and definitions of covariates. Among PLWH, measures of HIV disease activity included current plasma HIV RNA concentrations (Roche ultrasensitive assay), current and nadir CD4 + T cell counts/µL, history of AIDS-defining malignancy or opportunistic infection, and use of ART, including regimen type.

Cardiovascular magnetic resonance in SMASH

CMR was performed at Johns Hopkins Hospital (Baltimore) or Northwestern Memorial Hospital (Chicago) on 1.5 T Siemens Avanto or Aera scanners (Erlangen, Germany) using a standardized protocol that included gadolinium enhancement, detailed in the Supplementary Information. Briefly, short and long-axis cines (30 phases/cardiac cycle) were acquired with a steady-state free precession sequence. Multimodality Tissue Tracking software (MTT; version 6.0, Toshiba, Japan) was used to quantify LA volumes and other standard metrics47, blinded to participant characteristics. Maximum LA volume was measured using the LA volume curve generated by the Simpson’s method from four-chamber and two-chamber views and was indexed to body surface area (LAVi).

Proteomics in SMASH

We performed proteomics on EDTA plasma stored at − 80 °C using the Olink Explore 3072 platform at the Laboratory of Clinical Investigation at the National Institute on Aging, NIH (Baltimore, MD) using a standardized protocol. Olink technology employs DNA oligonucleotide-labeled antibody pairs that bind target antigens in solution and, when bound pairwise, hybridize and are extended by a DNA polymerase. This forms a DNA barcode, which is then amplified and quantified by microfluidic quantitative PCR48. Additional details are outlined in the Supplementary Information alongside data quality control, normalization, and calibration procedures. Values are reported as log2-transformed relative plasma abundances, which were winsorized to 5 SD to minimize the effect of outliers (median: zero observations winsorized per protein; range: 0.00% to 0.13%) and standardized for comparability. Following quality control, no samples were excluded, and 2594 proteins were analyzed with final mean intra- and inter-assay coefficients of variation of 9% and 15%, respectively.

Weighted gene co-expression network analysis

High-dimensional proteomic data often contain informative patterns lost when analyzing proteins individually. To address this, we derived unique clusters of highly correlated proteins using weighted gene co-expression network analysis implementing the R ‘WGCNA’ package (version 1.72). See Supplementary Information for details.

Statistical modeling

A graphical representation of the study design and analytic flow is depicted in Fig. 1. Complete case analyses were performed to estimate cross-sectional associations between plasma protein abundances and both HIV serostatus and continuous LAVi. Proteins were analyzed individually (single-analyte approach) and as clusters agnostically defined using WGCNA (multi-analyte approach). Inference was made using multivariable linear regression with Huber-White robust variance estimators to account for likely heteroscedasticity. Analyses were corrected for multiple comparisons using a step-down false discovery rate procedure defined by Benjamini and Hochberg and a two-sided threshold for significance of FDR < 0.05.

Mean standardized differences in plasma protein abundances by HIV serostatus were first estimated by adjusting for age, sex, race/ethnicity, educational attainment (dichotomized at high school diploma), current hazardous alcohol use, pack-years of smoking in prior 5 years, stimulant use in prior 5 years, opioid use in prior 5 years, HCV, and eGFR. Mean differences in LAVi were then estimated per SD increment in individual plasma protein abundances, restricting to proteins with HIV associations (at FDR < 0.05) and adjusting for covariates above, and for HIV serostatus, BMI, systolic blood pressure (SBP), hypertension medication use, dyslipidemia, and diabetes. The study site was not adjusted for in these analyses due to collinearity with sex. Proteins of interest were defined as those with significant adjusted associations with both HIV serostatus and LAVi. Given known biological differences among PLWH with vs. without viral suppression on ART, we further analyzed proteome features in subgroup analyses excluding PLWH with detectable HIV RNA (> 50 copies/mL) to evaluate their relevance in the context of successful HIV viremic control. We assessed the degree to which they may reflect the association between positive HIV serostatus and higher LAVi. We evaluated differences in their associations with LAVi by HIV serostatus and age using multiplicative interaction terms. We estimated protein associations with basic clinical and HIV-related characteristics.

Functional annotation analysis

Annotation-based overrepresentation analyses were performed using the Gene Ontology (GO): Biological Processes annotation database and the R ‘clusterProfiler’ package (version 4.6)49. This employs a Fisher’s exact test comparing the proportion of significantly associated proteins mapping to a given functional annotation to the proportion of total analyzed proteins mapping to the same annotation. The background used in these analyses was proteins measured on Olink Explore 3072 that passed quality control. Relationships between proteins of interest were further explored using interaction networks generated using a database of known and predicted protein-protein interactions, STRING16.

External validation cohort (older persons without HIV in MESA)

External validation was performed in MESA, a prospective community-based cohort initiated in 2000 to study the characteristics and progression of subclinical CVD among a diverse population in the United States, age 45–84 years without clinical CVD at baseline50. We used data from exam 5 (years 2010–2012) when participants were aged 53–94 years due to the use of the same CMR protocol51 and proteomics platform (Olink Explore 3072) as SMASH at that time point. MESA participants were also, on average, over 10 years older than SMASH participants at exam 5, allowing a more extensive evaluation of the impact of older age on the identified proteomic signature among PWOH. See Supplementary Information for the study population and data acquisition details.

We first performed cross-sectional analyses of LAVi among MESA participants with proteomics and CMR using the same statistical modeling approach as described above, with slight variation in covariates due to differences in data collection or prevalences among study populations—specifically, we adjusted for field center, age, sex, race/ethnicity, education level, smoking, BMI, SBP, blood pressure-lowering therapy, dyslipidemia, diabetes, and eGFR.

We additionally assessed associations of identified proteomic signatures with time to incident AF and incident clinical HF. AF was defined by the presence of ≥1 inpatient or outpatient International Classification of Diseases, Ninth Revision codes for AF. HF criteria included symptomatic, physician-diagnosed HF as well as ≥1 of the following: (i) pulmonary edema/congestion by chest x-ray; (ii) dilated LV or poor LV function by echocardiography or ventriculography; and/or (iii) evidence of LV diastolic dysfunction. Inpatient and/or outpatient records were independently reviewed by two physicians for HF classification and event date assignment. See Supplementary Information for detailed ascertainment and adjudication procedures. Hazard ratios were estimated per SD increment in plasma protein level using Cox proportional hazards models adjusting for the same covariates as cross-sectional analyses. Participants with prevalent disease (AF or HF) at the start of follow-up were excluded from those analyses, and time at risk was calculated as the time from the date of MESA exam 5 to the incident event, death, loss to follow-up, or end of follow-up, whichever occurred first.

Lastly, we estimated associations between proteins validated with at least one outcome and basic clinical characteristics adjusted for field centers. All analyses were conducted using R (version 4.2).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.