Introduction

Alcohol use among college students poses a significant public health concern, contributing to increased risks of injury and chronic health problems, as well as impairments in academic performance and productivity 1,2,3. To mitigate the risks associated with alcohol use, it is important to understand the factors related to drinking behaviors among college students. Multiple studies on alcohol use in college students have found that drinking behaviors are heavily influenced by social relationships 4,5,6,7,8,9, defined by ‘friends’ or ‘peers’ broadly. However, peer relationships can be categorized into various types depending on their underlying social purposes. For instance, you might consider one person your mentor (leader), and go to someone else when you have a hard time and want to talk (emotional supporter); that is, a person’s perceived leadership network and emotional support network might be configured with different ties (though of course, the same individual might serve multiple functions).

Given this diversity of social ties, one may ask: does knowing the structure of a person’s emotional support or leadership network also tell us something about the shape of their perceived drinking network? This question concerns whether certain types of social ties are more strongly associated with alcohol use. Prior evidence suggests that individuals tend to respond more to alcohol-related posts from close connections in their online networks, offering a potential clue in this regard 9. Building on this, a more granular classification of perceived social networks may help identify specific relationship types that are more closely associated with, or reflective of, patterns of alcohol consumption. Such a perspective aligns with theoretical frameworks like Primary Socialization Theory (PST), which, in this context, highlight how alcohol-related behaviors might develop through primary social group interactions 10. While our analysis does not aim to test PST directly, our findings regarding the alignment of certain social layers (e.g., emotional support or leadership) with perceived drinking behaviors may offer structural insights into the peer dynamics underlying socialization processes.

While several studies have examined the structural similarities among various types of social ties–often using node characteristics, centralities, edge overlap, or entropy measures–most of these focus on generalized or online networks 11,12,13,14,15,16,17, such as smartphone proximity data 11 or online gaming activity 13. Consequently, the structural similarities among offline multi-social layers (i.e., networks) have not been thoroughly examined, which has in turn limited our understanding of how specific social relationships are associated with perceived drinking-related networks 18.

To address this gap and understand how different types of social relationships relate to college students’ perceived peer drinking behaviors, we examined the following research questions using a survey dataset on offline social networks among college students. First, do social networks that are structurally similar—based on systematic comparisons of network features across different types of social relationships—also exhibit functional similarity in terms of the roles they play within the group? Second, how are structural and functional similarities among social networks associated with perceived drinking behavior within peer groups? Third, can structural similarities across different types of social relationships be leveraged to predict missing links or infer unobserved attributes associated with peers’ perceived alcohol consumption?

Grounded in these research questions, we analyzed diverse social relationship types surveyed within nine college student groups (e.g., clubs, teams, etc.), including perceived drinking nominations (351 participants across nine student groups started the survey, 336 participants finished the survey [69 males, 266 females, one unknown]; see Methods). To ensure that we captured a broad range of interpersonal dynamics, we included ten peer nomination items reflecting affective, behavioral, and structural ties. These items were informed by prior network studies on emotional cohesion, social support, leadership, and behavioral norms 19,20,21, and have been widely used in peer network research. In the survey, students were asked to nominate group members in response to each of these ten questions—such as “Which group members do you like the most?” (Like), “Who would you say are the leaders of the group?” (Leaders), and questions regarding perceived alcohol consumption—in order to examine how perceived drinking networks are coupled with other layers of social connection (see Table 1 in Methods for all questions).

All peer nominations for each type of social relationship were represented as separate networks (i.e., layers), where links indicate the presence of a nomination between individuals (i.e., nodes). We refer to each of these relationship-specific networks—such as the network formed by nominations from the “Like” question or the network from the “Leaders” question—as a network layer to adopt the language of multilayer network analysis. Each layer corresponds to a single question in our social network survey and captures one distinct aspect of group interactions (see Fig. 1a, b). To answer the first research question, we analyzed the structural characteristics of each layer by calculating network measures at three levels, which were then compared across layers (Fig. 1b). Such measures include evaluating individuals’ positional importance through centrality measures (i.e., node-level), counting common links between layers (link-level), and identifying triadic motifs (triad-level) between different layers. These features were then grouped into structurally similar clusters using unsupervised k-means clustering (Fig. 1d).

In addition to comparing structural similarity across layers, we further examined how these social relationships vary in their underlying group structure by projecting each layer onto a spectrum ranging from hierarchical (vertical) to horizontal (egalitarian) configurations as shown in Fig. 1c, d 22. This distinction reflects how different relationship types are structured to meet different social needs: hierarchical relationships often involve clear directionality and ordering of roles, whereas horizontal relationships are characterized by reciprocity and mutual exchange. Understanding these structural dimensions offers a theoretical basis for interpreting the positioning of different layers within the spectrum and the behavioral dynamics they may represent. The clusters identified through k-means clustering and projection-based classification were subsequently used to evaluate how well each social layer could predict missing links in drinking-related relationships. This analysis, which addresses the second and third research question, was conducted under the assumption that prediction accuracy would improve when more strongly correlated social layers are incorporated.

Collectively, we aim to establish a framework for identifying structural and functional similarities across multiple layers of social relationships and to examine how these structural characteristics relate to perceived alcohol-related networks among college students. By doing so, we assess whether structurally and functionally similar social layers can be leveraged to infer missing information in drinking-related networks, thereby enhancing our understanding of peer influence dynamics and informing future intervention strategies.

Fig. 1
Fig. 1
Full size image

Overview of the structural similarity analysis and clustering of social network layers. (a) Ten peer-nominated social layers, including two layers related to perceived drinking behavior, were collected from college student clubs. (b) Example illustration of three directed network layers (e.g., Like, Drinking Most, and Good News) representing different social relationships within a single student group. Each layer represents a specific type of social network (e.g., Like, Drinking Most, and Good News) and contains directed links between students (nodes), based on peer nominations. Green arrows indicate directed ties that are shared across multiple layers, and yellow highlights mark local triadic structures. The elements used for node-, edge-, and triad-level analysis are shown. (c) Social layers are positioned along a conceptual axis from hierarchical (e.g., Leaders layer) to horizontal (e.g., Affiliation-related layers), reflecting the degree of directionality and reciprocity in each network. (d) Node-, edge-, and triad-level similarity measures are used to perform unsupervised k-means clustering of network layers. Structural similarity scores are projected onto the first two dimensions (Dim 1 and Dim 2) of a low-dimensional embedding (e.g., PCA), where Dim 1 and Dim 2 capture dominant patterns of structural similarity across layers. Each layer is also positioned along a conceptual hierarchical–horizontal axis. The alignment between these two independent perspectives–an unsupervised clustering and a projection onto a conceptual relationship axis–highlights groupings of social layers that are both structurally and functionally similar.

Methods

Multilayer social network data from college student groups

We initially recruited 15 student clubs from the University of Pennsylvania and Columbia University, offering an Amazon gift card as an incentive for participation through the Social Health Impact of Network Effects (SHINE) study 23. To ensure data quality, we included only those groups in which more than 25 participants completed at least 60% of the survey, resulting in a final sample of 351 participants across nine student groups. All research, methods, and study protocols were approved by the Human Subjects Electronic Research Application (HSERA) Institutional Review Board (IRB) at the University of Pennsylvania and were acknowledged by the Human Research Protection Office of the Department of Defense, with reliance agreements and local context review approvals at Columbia University. All research, methods, and study protocols were conducted in accordance with the Human Subjects Institutional Review Board (IRB) at the University of Pennsylvania and the Human Research Protection Office of the Department of Defense. All methods were performed in accordance with relevant guidelines and regulations. All participants provided informed consent before taking part in the study and were financially compensated.

Age and gender distributions are shown in Supplementary Figure S1. For the present analysis, we selected nine student groups with more than 25 members, ensuring that over 60% of their members had completed the survey to maintain the group members’ representativeness. These groups have different foci (e.g., sports teams, music, dance performance), differ in size from 25 to 87 people, and include four woman-only and five mixed-gender groups. Participants were compensated $20 (via Amazon gift card) for completing a social network survey and an additional $20 (via gift card) if more than eighty percent of their student group completed the survey. While our sample was limited to student organizations from two private universities in the northeastern United States, the groups represented a range of club types (e.g., academic, social, athletic), gender compositions, and group sizes. This diversity offers some variation in social contexts, which may support cautious generalization to other college settings with similar organizational structures. However, further research is needed to evaluate whether these findings hold in broader institutional or cultural contexts.

Building on previous research indicating that the social life of a group can be mapped through multiple relational dimensions—as individuals adopt different roles within the group 24,25—we collected participants’ peer nominations in response to ten relational questions designed to capture these various aspects. Prior work in social network analysis highlights the importance of distinguishing among affective (e.g., liking, closeness), behavioral (e.g., interaction, support), and structural (e.g., leadership, influence) ties to better understand the complexity of group interactions 25. For instance, the “Like” and “Closest” items assess affective bonds that are central to emotional cohesion 19, while “Good News” and “Bad News” reflect asymmetrical trust and social support–seeking behaviors 25. Items such as “Leaders” and “Influence” capture power dynamics and perceived centrality in group decision-making 20,26. In addition, “Drinking Most” and “Drinking Least” reflect health-related behavioral norms within peer groups, which are particularly relevant to studies on college socialization and risky behavior. Although previous studies did not use the same set of items, they consistently emphasize the importance of capturing diverse types of peer relationships in adolescent and young adult networks. Motivated by these insights, we asked ten peer nomination questions to assess multiple dimensions of social relationships within peer groups. The ten survey questions are presented in Table 1.

Table 1 Questions and their abbreviations for ten social relationships of college student groups.

Participants were given the option to select as many group members as they felt were appropriate in response to each question, but we excluded from our analyses self-nominations or nominations of members who did not complete the social network survey. We then aggregated participants’ answers to these questions to specify ten social networks for each group. In each network, a directed link from individual A to individual B exists if A nominates B on the corresponding relationship question. For instance, if Aspen nominates Bailey on the question “Which group members are you closest to?”, a link is created from Aspen to Bailey in the Closest social network of their group. The basic network characteristics are summarized in Supplementary Table S1.

Node-level centrality correlations

One can hypothesize that participants’ positions would demonstrate overlap across functionally similar layers 27. To evaluate this, we computed node-level similarity by comparing centrality values across layers while preserving node identities. Specifically, we measured the in-degree, eigenvector, and betweenness centrality for each individual in each social network layer, treating each layer as a separate single-layer network, in order to capture each individual’s relative prominence or positional importance within different types of social ties. These centralities were computed using the python-NetworkX library, and Spearman’s rank correlation coefficient was calculated using python-scipy. The Spearman correlation coefficient quantifies the similarity in rank order of centrality values across layers, with a value of 1.0 indicating perfect rank agreement between two layers.

In this context, in-degree reflects the number of nominations received by each node—indicating how many peers identify a person in a given role (e.g., how many people view someone as a leader or emotional supporter), which can be interpreted as perceived popularity. Eigenvector centrality captures how well-connected a person is to other well-connected individuals, and can be interpreted as their overall influence within the group—for example, someone whom influential peers also tend to nominate. Betweenness centrality quantifies how often an individual lies on the shortest paths between others, serving as a proxy for brokerage or gatekeeping—for instance, a person who connects otherwise separate subgroups within the network.

Link-level similarity

To measure the similarity of the edges between two different layers of a social group (e.g., Like and Closest), we calculate the Jaccard similarity between the pair of layers. Here, \(J(L_{g,\alpha },L_{g,\beta })\) represents the Jaccard index between two layers, \(\alpha\) and \(\beta\), in student group \(g\). As one means of performing this calculation, we use indicators on all ordered node pairs specified by two binary vectors \(E_{g,\alpha }\) and \(E_{g,\beta }\), where a 1 entry indicates the presence of the corresponding directed edge in a given layer while a 0 entry represents its absence. Equivalently, \(E_{g,\alpha }\) indicates the edge list of layer \(\alpha\) for student group \(g\). The Jaccard index is then given by

$$\begin{aligned} J(L_{g,\alpha },L_{g,\beta }) = {\frac{|E_{g,\alpha } \cap E_{g,\beta }|}{|E_{g,\alpha } \cup E_{g,\beta }|}}. \end{aligned}$$
(1)

That is, the Jaccard index measures the number of directed edges that the two layers \(L_{g,\alpha }\) and \(L_{g,\beta }\) share with each other (size of intersection) relative to the total number of directed edges in their union. This value reflects the extent to which the same directed ties—that is, the same student nominating the same peer—occur in both layers (e.g., whether student A nominates student B as both a “Good news” and “Bad news” friend). After calculating the Jaccard similarity indices for all possible pairs of the ten layers for each of the nine college student groups, we averaged the results over student groups to obtain a single Jaccard similarity matrix for a visual representation of the similarities between layers. Finally, we applied standard hierarchical clustering with the cluster map function in the Seaborn package of Python to identify relevant layer groupings, with results presented in Fig. 2b.

Triad census analysis

The triad census is useful for summarizing social relationship properties in directed networks 28. Various other characteristics of the network can be inferred from the triad counts—including network density, transitivity, and dyadic properties—and they also inherently give more involved information characterizing the different transitive patterns of pairwise interactions possible between three actors 29. In our context, this triad census analysis helps distinguish how different types of social ties (e.g., Like vs. Leaders) differ in structural patterns such as reciprocity, hierarchy, or transitivity. A full triad census includes counting the instances of the sixteen possible three-node motifs in a directed graph as shown in Fig. 3a. The label of each triad motif is written with three integers (indicating the number of reciprocal links, unidirectional links, and non-connected node pairs in the motif) and sometimes with an additional letter among \(\{C, D, U, T\}\) to indicate Cyclic, Down, Up, and Transitive patterns, respectively (again, see Fig. 3a). We then applied the Triad Significance Profile (TSP) methodology suggested by Milo et al. 30 to obtain a normalized vector of Z-scores which can be written as

$$\begin{aligned} \text {TSP}_j = \frac{Z_j}{\sqrt{\sum {Z_j}^2}}, \end{aligned}$$
(2)

for each layer in a student group, where \(Z_j\) is the Z-score of triad motif j (in layer \(L_{g,\alpha }\)) relative to the counts of that motif in randomly rewired versions of the network layer. That is, \(Z_j = (N_j-\langle R_j \rangle ) / \sigma (R_j)\), where \(N_j\) denotes the count of triad motif j in the empirical network, \(R_j\) is a set of random realizations of this count (in our case, from an ensemble of 50 randomly rewired networks with the same degree sequence), angled brackets indicate the average and \(\sigma (\cdot )\) the standard deviation of the set. The denominator in Eq. (2) ensures the collection of TSP scores for a given empirical network is normalized to length 1 (in a sum of squares sense).

Unsupervised k-means clustering based on multi-level similarity features

We average the node, link, and triad similarities of a student group by layers to define a collective feature associated with each social network layer \(\alpha\) for the student group \(g\). Specifically, we averaged together each group’s structural similarities corresponding to the three matrices in Fig. 2—for node-level similarity based on the Spearman correlation (in-degree, eigenvector, betweenness centrality similarity), link Jaccard similarity, and TSP similarity, so each student group g has \(10\times 10\) matrix of the averaged similarity. We then applied Principal Component Analysis (PCA) to this \(9 \times 10\times 10\) matrix (10 layers for each of the 9 groups, each with 10 features). When we check the scree plot by varying the number of PCA components (see Fig. 4a), we confirm that the first two components together explain around \(69\%\) of the variance, while the first three components together explain around \(77\%\) of the variance. Reducing the similarity matrix to these three PCA directions, we subsequently applied k-means unsupervised clustering to further visualize how multi-level similarity can capture the major differences between the Leadership, Affiliation, and Drinking layers. We used five clusters for the visualization in Fig. 4c–e.

Cosine similarity along the hierarchical-to-horizontal axis

We select the closest layers of the nine student groups to represent the most ‘horizontal’ of our observed network layers (notated here as \(\mathcal {X}^+ = \{{\bf v}_1^+,{\bf v}_2^+, \dots , {\bf v}_{8}^+,{\bf v}_{9}^+ \}\)), and similarly select the Leaders layers to represent the ‘hierarchical’ aspect (\(\mathcal {X}^- = \{{\bf v}_1^-,{\bf v}_2^-, \dots , {\bf v}_{8}^-,{\bf v}_{9}^- \}\)). We then measure the centroid position of each by averaging the corresponding representation vectors over the nine student groups: \(\bar{{\bf V}}^{+} = \frac{1}{9} \sum _{i=1}^{9} {\bf v}_i^{+}\) and \(\bar{{\bf V}}^{-} = \frac{1}{9} \sum _{i=1}^{9} {\bf v}_i^{-}\). We then define the Hierarchical-to-Horizontal axis vector by subtracting the centroid of the hierarchical (Leaders) vectors from the centroid of the horizontal (Closest) positions: \({\bf v}_{\textrm{axis}} = {\bar{\bf{V}}^+} - {\bar{\bf{V}}^-}\). To examine the composition along this axis, we calculate the cosine similarity between the axis vector and a vector \({\bf v}_\alpha\) of a social network layer as \(\mathcal {S}_\alpha = {{\bf v}_\alpha \cdot {\bf v}_{\textrm{axis}}}/({|{\bf v}_\alpha || {{\bf v}_{\textrm{axis}}}|})\), where \(\alpha\) denotes a social layer of the Closest, Interact, Like, Good News, Bad News, Offer Support, Influence, Leaders, Drink Most, or Drink Least question in a selected student group.

Link prediction with functional clusters

Our link prediction calculations employ a stochastic block model method developed by DeBacco et al. 31 that uses expectation maximization to compute the likelihood of an edge existing in a particular layer. Specifically,

$$\begin{aligned} M_{ij}^{\alpha } = \sum ^{K}_{{k}, l = 1} u_{i{k}} v_{jl} w_{{k}l}^{\alpha }, \end{aligned}$$
(3)

where \(M_{ij}^{\alpha }\) represents the likelihood of the existence of a directed edge between individuals i and j in layer \(\alpha\), obtained by a sum over blocks \({k}\) and l with mixing matrix \(w_{{k}l}^{\alpha }\) for layer \(\alpha\), and the membership vectors for node i in block \({k}\) and node j in block l given by \(u_{i{k}}\) and \(v_{jl}\), respectively (which can be different for edges in and out of the nodes; an undirected network would have \(u=v\)). This model requires knowledge of or other means of specifying the number of blocks, K, to be used in the link prediction. Because we are focused on the potential for accurate link prediction, we explore a range \(5\le K\le 11\) and in each case select the K with the highest AUC (as in 31) under 5-fold cross-validation (hiding 20% of the edges in the adjacency matrix at a time). In particular, K is selected case by case, specific to each prediction performed; that is, K could (and we expect it to) be different for different target-support combinations of layers. In this study, link prediction estimates how well the structure of one layer can inform or recover the missing links in another, such as using emotional support nominations to infer perceived drinking ties.

Results

We collected social network data on nine college student groups across ten questions listed in Table 1 (see Methods). We represented each student group as a multilayer network, with each node representing an individual member and with directed links between nodes in a layer encoding a specific social relationship according to the nominations on the corresponding survey question (e.g., friendship, leadership, etc.; see Methods). To compare and contrast layers (questions), we consider three different structural levels: (i) how individuals’ positions differ across layers (node-level similarity), (ii) whether relationships between pairs of individuals co-occur in different layers (link-level similarity), and (iii) how measures on node triples correlate across layers (triad-level similarity). Across all three levels, we find consistent functional clusters of layers based on their structural similarities corresponding to identifiable three-level structural similarities. The consistency of these layer/question clusters is further supported by unsupervised clustering on a principal component representation of the full set of similarity comparisons, as well as by the projection-based clusters along the hierarchical-horizontal relationship axis. Finally, we show how they impact the potential for accurate link prediction, with a special focus on predicting missing links in the alcohol-related layers of our survey data.

Node-level and link-level similarity

Fig. 2
Fig. 2
Full size image

Averaged layer-to-layer matrices for node-, link-, and triad-level similarities. (a) Spearman rank correlation coefficients of node in-degree between each pair of layers. (b) Jaccard similarity coefficients of overlapped edges between pairs of layers (i.e., pair counting directed edges that appear in both, one, or the other). (c) Spearman rank correlation coefficients of the 16 parts of the Triad Significance Profile (TSP) between each pair of layers. All coefficients were computed separately within each student group and then averaged across the nine student groups. Dendrograms were computed using clustermap in the Seaborn Python library. In all panels, colors closer to 1.0 indicate stronger similarity or correlation between layers. For example, in (a), a lighter shade between the “Like” and “Closest” layers indicates that students who receive many nominations in the “Like” layer also tend to receive many in the “Closest” layer.

Using the node-level similarity (see Methods), we hierarchically clustered the average in-degree correlations (averaged across student groups) between layers, revealing two primary clusters: an Affiliation cluster—comprising Closest, Like, Interact, Good News, Bad News, and Offer Support—and a Leadership cluster, consisting of Leaders and Influence layers, as shown in Fig. 2a. Within the Affiliation cluster, we further observed a subdivision into two sub-clusters: one composed of Good News, Bad News, and Offer Support (which we refer to as the Emotional Support cluster), and the other including Closest, Like, and Interact (the Friendship cluster in this study). Although the specific hierarchical structure slightly varies depending on the centrality measure used, the clustering into Emotional Support, Friendship, and Leadership groups remains consistent (see Fig. S2). These results suggest that individuals occupy clearly differentiated roles within Affiliation and Leadership clusters in the social networks of the surveyed college student groups. Moreover, even within the broader Affiliation category, we observe distinct structural node-level characteristics between layers associated with everyday social interaction—such as Closest and Like—and those reflecting emotionally supportive relationships, such as Good News, Bad News, and Offer Support. This finding highlights a meaningful divergence in node-level roles between friendship-oriented and emotional support-oriented ties, even though both stem from perceived peer relationships.

In terms of the relationships with other social networks, the Drinking Least layer (nominates related to the question of “Which group members drink the least alcohol?”) has very little node-level similarity with any other layer (see Fig. 2a and Figure S2), whereas the Drinking Most layer (nominates related to the question of “Which group members drink the most alcohol?”) is closely grouped with the Leadership cluster (the Leaders and Influence layers). These findings suggest a potential alignment between social status and perceived drinking behavior, with individuals seen as leaders or influential figures also more frequently nominated as heavy drinkers.

Building on the patterns observed at the node-level similarity, we found that similar clustering structures also emerged at the link-level similarity. As shown in Fig. 2b, hierarchical clustering of the average Jaccard coefficients again revealed a Leadership cluster and an Affiliation cluster, with the latter further subdividing into Friendship and Emotional Support layers—mirroring the structure observed in the node-level analysis. Within the Leadership cluster, the strong overlap between the Leaders and Influence layers is reflected in a high Jaccard score of 0.49, indicating that if a participant nominates someone as a leader, they are also likely to nominate the same person as influential. At the same time, we observe notable variability in edge overlap across other functional clusters, with Jaccard scores ranging from 0.5 to 0.57 within the Emotional Support cluster, and from 0.37 to 0.49 within the Friendship cluster. These patterns highlight both the internal coherence of functional clusters and the nuanced differences in tie formation within and across them.

In addition to the main functional clusters, we also observe notable characteristics regarding the placement of the Drinking Most layer. Although its average Jaccard similarity with the Leaders and Influence layers is relatively low—0.15 and 0.17, respectively—it still clusters with them, likely due to its even lower similarity with other layers. This shows that while Drinking Most does not strongly overlap with any one layer, its structural position is more closely aligned with leadership-related ties than with affiliative ones. To further explore these relational patterns beyond dyadic overlap, we next turn to triadic motif analysis, which captures higher-order structural features between layers.

Triad-level similarity

Fig. 3
Fig. 3
Full size image

Structural comparison of social layers using triad significance profiles. Triad Significance Profiles (TSP) of a layer in each student group are constructed from counts of the 16 possible directed triad motifs visualized in (a). TSPs of each layer are plotted here and grouped into the (b) friendship cluster (closest, interact, like), (c) emotional support cluster (bad news, good news, offer support), (d) drinking nominations (drinking most and drinking least), and (e) leadership cluster (influence and leaders layers). Each colored line represents the TSP of a single student group for the corresponding social network layer, capturing the triadic structure unique to that group’s social context..

While node- and link-level analyses revealed consistent clustering across social layers, they primarily capture pairwise relationships or individual-level prominence. To further investigate higher-order structural patterns, particularly those that may distinguish nuanced differences in group dynamics, we applied triadic motif analysis using the Triad Significance Profile (TSP) 30 (see Methods).

Triadic motifs offer insight into the underlying relational dynamics of each network layer, particularly in terms of reciprocity and local transitivity. For example, we hypothesized that layers in the Affiliation cluster would exhibit greater reciprocity and triangle closure, as individuals in these layers are more likely to form mutual or transitive ties (i.e., friends of friends tend to be friends). This expectation is confirmed in the TSPs of the Friendship and Emotional Support clusters (see Fig. 3b, c), which show increased prevalence of motifs such as ‘201’, ‘210’, ‘102’, and ‘300’, all of which reflect mutual nominations and closed triads. In particular, the Friendship cluster demonstrates strong overrepresentation of these motifs, consistent with the hypothesized highly reciprocal structure.

While Emotional Support layers show similar patterns, they also exhibit relatively higher proportions of motifs such as ‘210’ and ‘120D’, which indicate hierarchical structuring in triads 32. This suggests that emotional support ties may involve an implicit sense of directionality or social differentiation, distinguishing them from more egalitarian friendship networks. Notably, these motif patterns are consistently observed across nearly all student groups, as shown by the tightly aligned TSP profiles in both the Friendship and Emotional Support clusters (see Fig. 3b, c). This cross-group consistency further supports the structural distinctiveness and functional coherence of these relationship layers.

In contrast to the consistent TSP patterns observed in the Friendship and Emotional Support clusters, no clear motif-level regularities were found in the layers of the Leadership cluster. As shown in Fig. 3e, the Leaders and Influence layers display considerable variability in their triad profiles across student groups, with no dominant motifs emerging. A similarly scattered pattern is observed in the Drinking Most and Drinking Least layers (Fig. 3d), suggesting a lack of coherent triadic structure in how students perceive these roles. One possible explanation is that these layers are shaped by highly localized or asymmetric nomination behaviors—often centered around a few prominent individuals—making them less likely to exhibit consistent motif signatures across groups. Additionally, the TSP method may be less effective in detecting triadic patterns in sparse or hub-dominated layers.

Unsupervised k-means clustering of structural similarities

To gain a more integrated view of how social layers relate to one another, we applied unsupervised k-means clustering to the combined node-, link-, and triad-level similarity measures for each layer within each student group (see Methods). As shown in Fig. 4c–e, the clustering based on the combined multi-level similarity data closely aligns with the hierarchical clustering results observed at each separate level of structural analysis. Specifically, we assigned each social layer in a student group to one of five clusters identified through k-means clustering, which clearly captures the separation between Leadership (Clusters 3 and 4) and Affiliation (Clusters 1 and 5) layers (see Fig. 4). Furthermore, the earlier subdivision of the Affiliation cluster into Friendship and Emotional Support sub-clusters is reflected in the separation between Clusters 1 and 5, which appear close to each other in the low-dimensional projection. In contrast, the two Drinking layers form their own distinct group, constituting Cluster 2.

Fig. 4
Fig. 4
Full size image

Clustering of social network layers based on multi-level structural similarity using PCA and k-means. Unsupervised k-means clustering is applied to classify 10 different social network layers in each of the nine student groups. Node-, link-, and triad-level similarities are averaged for each pair of layers within a student group, separately. Principal Component Analysis (PCA) is used to reduce the 10 resulting similarity dimensions to three components. The scree plot in (a) visualizes the amount of variance explained over different numbers of components. Unsupervised k-means clustering scores on these three PCA components with different numbers of clusters are plotted in (b). The results of clustering into five groups are shown in (ce), projected onto different combinations of the first three PCA axes.

While the first principal component (“PC1”) distinguishes Affiliation from Drinking and Leadership clusters, “PC2” and “PC3” help further differentiate between the subtypes of Leadership and Drinking. Nevertheless, the Drinking layers remain structurally closer to the Leadership cluster, with an average Euclidean distance of 3.41 in the three-dimensional PC space, compared to a greater distance of 4.17 from the layers in the Affiliation cluster. This structural proximity between drinking and leadership layers, alongside their relative distance from affiliation layers, suggests that perceptions of drinking–particularly heavy drinking–may not merely reflect social affinity, but may also serve as a signal of social status or visibility within peer groups, rather than indicating everyday closeness.

Projection to the two general relationship axes of social relations: hierarchical and horizontal

Fig. 5
Fig. 5
Full size image

Projections of social layers onto the hierarchical and horizontal axes of relational types. (a) Each of the 10 layers from the nine student groups is projected onto a hierarchical–horizontal axis, defined by the line connecting the centroid of the Leaders layers and the centroid of the Closest layers, using cosine similarity. A single selected student group’s projected social layers are indicated here by the arrows and annotations as an example. (b) The projection spectrum of each layer type is shown separately, with colored lines indicating the position for each group and black dashed lines representing the average projection across all groups. By comparing this result with that depicted in Fig. 4, we note that qualitatively similar spectra would also be obtained along PC1 (although in the opposite direction).

To complement the clustering of layers based on three levels of structural similarity, we next examine how social relationships differ in their underlying organizational patterns–specifically, whether they are more hierarchical or horizontal in nature. To this end, we project each layer into a reduced structural feature space to determine its position along the hierarchical–horizontal relationship axis (see Methods). In Fig. 5a, the horizontal end of the axis is dominated by social layers characterized by mutual relationships, while the other end of the axis includes hierarchical relationships primarily with asymmetric interdependent patterns. The spectrum for a randomly selected student group in Fig. 5a identifies most of the Affiliation-related layers in the horizontal part of the spectrum, whereas the Leadership and Drinking clusters are placed in the hierarchical part of the spectrum.

Figure 5b displays each social layer from all student groups positioned individually along this spectrum. These differences in projection suggest that layers involving support or emotional disclosure—such as offering support (i.e., Offer Support layer) or sharing bad news (i.e., Bad News layer)—tend to be structurally more hierarchical than layers reflecting general social affinity or casual interaction, such as liking (i.e., Like layer) or sharing good news (i.e., Good News layer). This pattern suggests that not all forms of emotional exchange are structurally equivalent, even when categorized under support-related interactions. We also observe that the Drinking Most layers consistently appear near the hierarchical end of the spectrum, similar to Leadership and Influence, while Drinking Least layers are more variably positioned near the center. This distinction may indicate that heavy drinking behaviors are more commonly observed among students occupying structurally prominent or visible positions, which could potentially contribute to the reinforcement of social norms around alcohol use. In contrast, non-drinking behaviors may be less tied to structural positions and therefore less prominent in the social landscape.

Link prediction for drinking nominations using structurally similar social layers

Having identified structurally and functionally coherent clusters of social layers, we next evaluated whether these groupings can enhance link prediction, particularly for alcohol-related relationships with missing data. Specifically, we tested whether incorporating information from other layers, especially those within the same cluster, improves the accuracy of predicting missing links (see Methods).

Figure 6a shows the average area under the ROC curve (AUC) for link prediction across all target layers (vertical axis), using complete information from a second layer (horizontal axis) within the same student group as input. Diagonal blocks in the figure contain two different AUC scores: the score obtained using only the remaining 80% of the target layer (upper half), and the score obtained using all 10 layers of information (lower half, continuing only to use 80% of the target layer). We observed that AUC scores were comparably high when information from a layer within the same functional cluster as the target layer was provided, approaching the performance achieved using all layers combined. For instance, the Emotional Support layers (Offer Support, Bad News, Good News) yielded high scores when another layer from the same Emotional Support cluster was used as input.

Fig. 6
Fig. 6
Full size image

AUC scores of link prediction with the functionally clustered social networks. (a) Average of 5-fold cross-validated AUC scores of link prediction using two layers (10 runs). The vertical axis labels indicate the target layer being predicted (with 20% of edges held out at a time), while the horizontal axis labels indicate the source of additional information for the link prediction. The diagonal blocks contain two different AUC scores: the upper half value is obtained using only the remaining 80% of the target layer; the lower half value indicates the AUC when this 80% of the target layer is supplemented with full information from all nine other layers. (b) AUC scores using functionally clustered layers to predict missing links in a target layer (horizontal). In the legend, each item describes the source of additional information used for link prediction: ‘Self’ refers to using only the remaining 80% of the target layer, corresponding to the upper half of the diagonal in (a). ‘All’ (brown triangle) indicates the use of the remaining 80% of the target layer plus full information from all other nine layers, corresponding to the lower half of the diagonal in (a). ‘Affiliation’ (green circle), ‘Leadership’ (blue triangle), ‘Emotional Support’ (maroon square), and ‘Friendship’ (yellow-orange pentagon) indicate the use of additional information from the corresponding layers within each functionally similar cluster. Error bars represent the standard deviation across the nine student groups.

Notably, despite their structural similarity, the Friendship and Emotional Support clusters exhibit an asymmetric influence on link prediction, with one direction providing more predictive value than the other. For instance, while layers in the Affiliation cluster (including Friendship) enhance the prediction of Emotional Support ties, the reverse direction yields only marginal improvement. These results suggest that knowing Friendship information helps narrow down which node pairs might and might not be in an Emotional Support layer but not vice versa. This asymmetry indicates that Friendship networks may contain broader structural cues that encompass but go beyond the more intense support-related connections.

In addition to the asymmetric improvements in prediction performance observed across layers, our analysis highlights that structurally and functionally coherent clusters can effectively substitute for the full multilayer network in link prediction tasks. As shown in Fig. 6a, incorporating a single additional layer from the same functional cluster provides predictive performance that, although somewhat lower than when all layers are used, remains relatively close-demonstrating the utility of structurally coherent layers even in reduced settings.

However, combining multiple layers within the same cluster substantially enhances performance, achieving results comparable to those using the entire multilayer network (Fig. 6b). Specifically, average AUC scores achieved by using cluster-based layer sets fall within the range of 0.77 to 0.85, nearly matching the full-information condition. These findings suggest that most of the predictive gain can be captured by leveraging structurally and functionally coherent subsets of layers, offering a practical and efficient alternative to using all layers—particularly in contexts with missing or incomplete network data.

We next examine whether structurally coherent clusters can improve link prediction performance in drinking-related layers. As shown in Fig. 7, incorporating the Emotional Support layers substantially improves prediction performance for both Drinking Most and Drinking Least networks, with accuracy approaching that of using all available layers. However, despite their structural proximity, the Leadership layers yield only marginal improvements in prediction. It is worth noting that this effect remains robust even when a large proportion of links in the target layer is held out.

This contrast raises the question of why Emotional Support layers contribute more to prediction accuracy than Leadership layers, despite their structural similarity. A plausible explanation is that these layers are not only structurally more aligned with the drinking-related layers—closer even than Affiliation as a whole—but also exhibit a more even distribution of nominations across individuals. The interplay between this structural proximity and the broad coverage of ties may together enhance their effectiveness in predicting missing links. In contrast, Leadership-related nominations tend to concentrate on a small number of peers, which may limit their generalizability and predictive utility. These results suggest that structural and functional alignment across specific types of social relationship layers can serve as an efficient basis for recovering missing information in alcohol-related networks, offering a data-efficient approach to understanding behavioral dynamics within peer groups.

Fig. 7
Fig. 7
Full size image

AUC of link prediction on alcohol-related networks using other social layers. AUC of link prediction after removing information from the drinking least (a) and drinking most (b) layers, with varying levels of missing information (from 20% to 80%; x-axis). Each prediction is performed by first randomly removing a specified proportion \(p\)% (shown on the x-axis) of links from the target layer (i.e., drinking least in panel (a), drinking most in (b)), and then using the remaining \((100 - p)\%\) of links in the target layer together with all links from the designated source layers indicated in the legend. The source layers refer either to individual layers within a functional cluster (e.g., the Friendship cluster) or to composite layers constructed by merging the relevant layers using a logical OR operation, depending on the analysis condition (e.g., “OR Friendship” as shown in the legend). Specifically, OR denotes the logical OR operation over links in the listed layers. That is, a link exists in the “OR Friendship” layer if it appears in any of the friendship-related layers. ‘Single Layer’ uses only the remaining portion of the target layer for prediction. ‘All Layers’ adds full information from all other layers. Error bars represent the standard deviation across the nine student groups.

Discussion

In this study, we examined multilayer social networks formed by breaking down peer relationships into ten distinct types, based on peer nomination survey data. Our goal was to examine the structural alignment among distinct types of peer relationships and their utility in recovering missing links–particularly in perceived drinking-related networks. Through node-, link-, and triad-level analyses, we identified clusters of structurally and functionally similar social layers, which were further differentiated along a spectrum from hierarchical to horizontal relationship types. These observed patterns offer conceptual and methodological insights into the ways individuals differentiate among social relationship types–such as affiliative, leadership, and drinking-related ties–by revealing underlying structural similarities in their network patterns.

In particular, the results demonstrate that structural information from one type of social relationship cluster—such as Emotional Support or Friendship—can be used to infer the structure of other layers, including those related to perceived drinking behavior. This finding suggests that different types of social relationships are not structurally independent, but instead organized in patterned and functionally meaningful ways, with implications for theories of social role differentiation and influence. Building on these insights, our results show that providing additional information from layers within the same functional cluster consistently improves link prediction performance. This improvement offers a practical pathway for recovering missing ties in peer networks by leveraging structurally coherent and socially meaningful layers.

While the preceding results highlight the benefits of leveraging structurally coherent layers within the same cluster, our analysis also suggests that structurally adjacent yet distinct clusters may exhibit asymmetric predictive relationships. A salient example of this is the relationship between the Friendship and Emotional Support clusters. While these clusters are functionally aligned and often grouped under the broader category of “peer networks,” our analysis reveals subtle but consistent structural distinctions that shape their predictive roles. Specifically, Friendship layers are characterized by high reciprocity and broad connectivity across all groups, whereas Emotional Support layers are more selective and hierarchically structured. These differences help explain the asymmetric link prediction results: Friendship layers are more effective in predicting Emotional Support ties than the reverse. This asymmetry highlights how nuanced variations in social structure–even among closely related relationship types–can have significant implications for network inference and intervention design.

Theoretically, prior work on social structure and exchange suggests that providing support or sharing news can create implicit role asymmetries, especially when such exchanges involve sensitive or trust-dependent information 33,34. These theoretical perspectives offer a possible explanation for the structural differences we observed between Friendship and Emotional Support layers. Future research could investigate how such structural asymmetries influence practical outcomes like link prediction, potentially guiding more effective intervention strategies in health-related behaviors.

We also confirmed an improvement in link prediction accuracy on perceived alcohol-behavior nominations when using data from layers in the Emotional Support cluster. While these layers are not the most structurally similar to drinking-related layers in terms of triadic or link-level similarity, k-means and projection-based analysis indicate that they are among the closest in low-dimensional space—second only to Leadership layers. This proximity, combined with the distributed structure of Emotional Support nominations, may help explain their effectiveness in predicting links. By contrast, although Leadership layers exhibit strong structural similarity to Drinking layers, their nomination patterns tend to be highly concentrated around a few individuals. This may limit their predictive value, as such concentrated structures might not generalize across the network. Further investigation is needed to clarify how the combination of structural similarity and the distribution of nominations influences link prediction performance, particularly for the Drinking and Leadership layers.

A better understanding of how these features contribute to inferring missing links in drinking-related networks may inform the design of more efficient intervention strategies that draw on selected social layers, such as Emotional Support or Leadership, rather than relying on broadly defined peer networks. In addition to informing interventions, our results may align with theoretical perspectives on social influence, such as Primary Socialization Theory (PST) 10, which highlights the formative role of peer networks in shaping deviant or health-related behaviors. Building on this perspective, our multilayer framework offers a complementary structural lens for further examining how such socialization processes unfold within differentiated peer networks.

We also note some limitations of our study. To characterize the social lives within student groups, we asked participants to nominate other members in response to ten prompts. These prompts were selected to cover the most important dimensions of people’s social lives as identified in previous research 27,35,36,37. Despite our efforts to include diverse aspects of students’ social lives, the set of nomination items may not fully encompass the entire spectrum of meaningful relationships in the college context. Future work might address this limitation by asking participants to nominate other peers in response to a more comprehensive list of social network prompts and then applying dimension reduction methods to this large dataset 38,39.

We also note that the Drinking Most and Drinking Least links in our data are only nominations made by group members about others in the group. In particular, these perceptions can vary from the actual drinking behaviors of those nominated individuals. At the same time, a given nomination of an individual as someone who is typically Drinking Most or Drinking Least in the social group is undoubtedly the result of a complex interplay of different social elements, including the relative visibility of the person in the group as well as potential public/group demonstrations related to alcohol use or attitudes by the nominated target of the response.

Even considering these limitations, understanding the structural similarities and predictive effectiveness of social ties can inform the development of socially focused health interventions. In particular, these insights hold potential for predicting and preventing crisis-related behaviors, such as alcohol use, where direct data collection is often challenging. Specifically, based on a study showing that individuals distribute their emotion-regulation needs across multiple, discrete emotion-specific relationships 40, one could test the intervention effect of diverse peer models to promote healthy behaviors and buffer negative peer influence effects on unhealthy behaviors, such as drinking 41.

Future work could also explore similar situations where different social relationship layers interact with missing information about group or individual behaviors, as seen with the two drinking layers considered in this study. To do these tasks accurately in various settings, extensive exploration of performance across group sizes and different levels of missing information is undoubtedly needed. Such investigations may be particularly useful for designing efficient sampling procedures that accurately capture such comparative behavior nominations even with lower total response rates from the group. Extending these investigations may provide the foundation for developing strategies for uncovering discrepancies within behavioral nominations so that such data can be better used to target positive interventions within social groups.