Uncovering migration systems through spatio-temporal tensor co-clustering

Almquist, Zack W.; Nguyen, Tri Duc; Sorensen, Mikael; Fu, Xiao; Sidiropoulos, Nicholas D.

doi:10.1038/s41598-024-78112-z

Download PDF

Article
Open access
Published: 06 November 2024

Uncovering migration systems through spatio-temporal tensor co-clustering

Zack W. Almquist¹,
Tri Duc Nguyen²,
Mikael Sorensen³,
Xiao Fu² &
…
Nicholas D. Sidiropoulos³

Scientific Reports volume 14, Article number: 26861 (2024) Cite this article

2578 Accesses
1 Citations
3 Altmetric
Metrics details

Subjects

Abstract

A central problem in the study of human mobility is that of migration systems. Typically, migration systems are defined as a set of relatively stable movements of people between two or more locations over time. While these emergent systems are expected to vary over time, they ideally contain a stable underlying structure that could be discovered empirically. There have been some notable attempts to formally or informally define migration systems. However, they have been limited by being hard to operationalize and defining migration systems in ways that ignore origin/destination aspects and fail to account for migration dynamics over time. In this work, we propose to employ spatio-temporal tensor co-clustering—that stems from signal processing and machine learning theory—as a novel migration system analysis tool. Tensor co-clustering is designed to cluster entities exhibiting similar patterns across multiple modalities and thus suits our purpose of analyzing spatial migration activities across time. To demonstrate its effectiveness in describing stable migration systems, we first focus on domestic migration between counties in the US from 1990 to 2018. We conduct three case studies on domestic migration, namely, (i) US Metropolitan Areas, (ii) the state of California, and (iii) Louisiana, in which the last focuses on detecting exogenous events such as Hurricane Katrina in 2005. In addition, we also examine a case study at a larger scale, using worldwide international migration data from 200 countries between 1990 and 2015. Finally, we conclude with a discussion of this approach and its limitations.

Search-and-rescue in the Central Mediterranean Route does not induce migration: Predictive modeling to answer causal queries in migration research

Article Open access 03 August 2023

A highly granular temporary migration dataset derived from mobile phone data in Senegal

Article Open access 20 June 2025

Forecasting asylum-related migration flows with machine learning and data at scale

Article Open access 27 January 2022

Introduction

A central problem to the study of migration is how to define and detect migration systems^{1,2,3,4,5,6,7}. Migration systems represent an “emergent social entity,” continually evolving and exchanging people over varying levels of spatial and temporal scales⁸. There have been some notable attempts to formally or informally define migration systems^8,9. Still, they have been limited by either being hard to operationalize¹⁰ or defining migration systems as symmetric (rather than directed origin/destination of the migrant), static, or both symmetric and static. Most recently, the work by Abel et al.⁸ has employed a clustering algorithm—this is a major area of study in computational social science, social network, and network science, see for example^11,12,13—which allow for directed networks¹⁴ to detect international migration systems over five-year aggregates of migration data; however this work only considers clustering on static snapshots which are then strung together for analysis. Using clustering methods statically focuses on differences in migration clusters rather than on finding a harmonized set of clusters over time and space.

Like previous research in the area, this article leverages the idea that one can represent migration flows as a weighted graph or network^15,16. Here, we center on the “raw” migration data, i.e., the counts of individuals or households between two geographical units (e.g., United States and Mexico or Los Angeles County, CA and King County, WA). By representing the migration flows between such spatial units, one can employ tools from social networks¹⁷, network science^18,19, and other computational social sciences²⁰ to analyze this data.

Computational social science and its allied fields—Social Network Analysis²¹, and Network Science²²—have a long history of studying network clustering and Community detection problems. Classic Community detection methods^23,24 look for clusters of nodes in a graph or network. Primary Community detection methods in the literature include optimal modularity^25,26,27, edge-betweenness⁷, leading eigenvector²⁸, fast-greedy^19,29, multi-level³⁰, walktrap³¹, label propagation³², and infoMap³³. However, these approaches typically do not consider the temporal modality; incorporating the time domain information in Community detection is of interest to both social and physical sciences; for example, such techniques have been used to explain Biology mechanisms^7,34 and the group dynamics of Windsurfers³⁵.

There is a well-established mathematical literature on migration dynamics, drawing on methods inspired by statistical physics. Knopoff et al.³⁶ have utilized kinetic theory to model crowd behavior and migration processes, demonstrating how individual-level interactions can aggregate into large-scale patterns^37,38,39. Compartment models, such as those developed by Rogers⁴⁰ and others for multiregional demography and SIR models often solved using differential equations or stochastic methods, also play a pivotal role in understanding the distribution of populations and their transitions across different states. In the context of consensus dynamics, agent-based models have been used to explore rural-urban migration dynamics, highlighting how individual choices aggregate to influence population distributions⁴¹. Unlike these models that study migration from a dynamical system viewpoint, our study adopts a tensor model based on multilinear algebra. The tensor model captures the migration data’s cross-domain dependence over space and time, offering a distinct perspective.

Many network problems, such as international or domestic migration, are dynamic in nature, and a method that considers this property is preferred. There has been growing interest in holistically applying Community detection methods to dynamically evolving networks. Finding Communities in dynamically changing networks has primarily been done by using the classic Community detection methods to network “snapshots” or panel data and analyzing how the system has changed. For example⁴², studies the change of node associations in graphs that are collected sequentially, and⁴³ studies the computational aspects of adapting new Community structures quickly based on previously estimated Communities. One can find a brief review of dynamic Community detection methods in this book chapter⁴⁴.

Notably, dynamic Community detection^42,43,44 centers on change in Community structure. Instead, our interest lies in discovering the consistency of Community structure over time. Within migration systems analysis, there has been one attempt at applying Community detection methods to international migration. This includes work using compartment models⁴⁰ and other attempts to classify movement between geographies, such as gravity models⁴⁵. More recently, network-based approaches have been applied in the field. Specifically, Abel et al.⁸ used the infoMap Community detection method over five-year migration flows and subsequently analyzed the change in Community structure over the observed periods. This article introduces a technique for holistically measuring the Community structure over time, focusing on stable Communities rather than differences. At the end of our results section, we compare the international migration system in Abel et al.⁸ to our method. Further, we compare the walktrap method applied to pre- and post-Hurricane Katrina to our method, where we find local clustering compared to a limited set of non-local clustering, and our method allows for overlapping clustering and a measure of significance for the Community/migration system.

In terms of methodology, we propose to employ a spatio-temporal (ST) tensor co-clustering method from the signal processing and machine learning literature^46,47. Tensors are a natural format to store data having multiple modalities (e.g., the migration counts indexed by origin, destination, and time). Tensors also encode the cross-modality dependencies using the notion of tensor rank, a high-order generalization of the matrix rank. The ST tensor co-clustering method allows for a low tensor rank representation of a weighted spatial-temporal graph, e.g., origin-destination counts or other migration measures acquired over time. At each time point, the weighted graph is defined (in this case) by an origin-destination directed adjacency matrix where an edge represents the number of migrants from one spatial unit to another (e.g., Los Angeles County to New York County). This representation results in a data-driven migration system that meets the concept of a migration system in the literature (e.g., Massey et al.^{1, p. 61]}). Every rank-one tensor extracted from the ST tensor co-clustering model represents a migration “Community” (e.g., collection of counties) whose members maintain a spatial interaction pattern with each other and share a similar temporal profile.

The ST tensor co-clustering—under this data definition—identifies the stable temporal clusters of the weighted graph (e.g., migrant counts from the United States and Mexico) and its temporal intensity over time (e.g., the Mexican-born population peaking in 2007 and decreasing post-2011; for attempts to estimate world migration rates, see^48,49). To demonstrate the effectiveness of this approach, we consider two datasets and several case studies. First, we apply it to domestic migration data within the United States (US) from 1990 to 2018 and international migration data at five-year intervals between 1990 and 2015. The US Internal Revenue Service (IRS) makes publicly and freely available migration data at the state and county levels^50,51,52. These data are built from address information in year-to-year tax returns, covering approximately 87% of all US households⁵³. The IRS migration data represents a particularly unique and valuable set of migration data for the US⁵⁰. The US Census Bureau uses the IRS migration data to produce state and county net migration estimates as part of its Population Estimate Program⁵⁰. This data set is ideally suited for testing ST co-clustering methods. Our final case study is based on international migration. Specifically, we apply the ST tensor co-clustering approach to the international migration data constructed by Azose and Raftery^48,49, and updated by Abel et al.⁸. International migration has a long history of theory on migration systems with a strong interest in empirically finding stable country clusters over time but with limited actual methods and applications. The ST tensor co-clustering method is again uniquely suited for this task.

Migration Systems: The attempt to capture the persistent interchange of people between places over time has been referred to in the literature as migration systems^{1, p. 61}; according to Massey et al.^{1, p. 61}: “[t]he end result is a set of relatively stable exchanges of people between [places ... yielding an identifiable geographic structure that persists across space and time.” In particular, these systems are expected to be sustained over time, emergent, and vary by spatial and temporal scales², making them naturally representable by mathematical graphs or networks³. These systems should be expected to exist at the international and local levels⁸ as a hierarchical process. In this work, we will look to operationalize this concept of a migration system. Through the ST tensor co-clustering algorithm, we aim to find stable spatiotemporal “systems” in the United States’ internal migration and the international migration estimates from 1990 to 2015⁴⁸.

Spatial-temporal tensor co-clustering for migration systems: The ST tensor co-clustering approach takes a three-way array as its input. The tensor has a size of $I \times I \times K$, where I is the number of geographical entities (e.g., counties, cities, and countries) and K is the number of temporal samples (e.g., years or months). The tensor is represented using the notation $\mathcal{X}\in \mathbb {R}^{I\times I\times K}$. Every entry of $\mathcal{X}$ has three coordinates. For example, in the IRS migration data we analyze in this work, the entry $\mathcal{X}(i,j,k)$ represents the number of migrants moving from county i to county j in year k. It can be regarded as a natural extension of a matrix whose entries only have two coordinates. When fixing k, the matrix (or the kth “tensor slab”) $\mathcal{X}(:,:,k)\in \mathbb {R}^{I\times I}$ is the weighted graph (e.g., origin to destination counts) collected in the kth year. The diagonal entries of every such matrix are ignored during data analysis using an incomplete tensor decomposition technique. The reason is that the diagonal elements do not have meaning in this migration flow analysis (i.e., we do not have measurements on within-county or within-country mobility patterns). The tensor co-clustering method decomposes $\mathcal{X}$ into the summation of F rank-one tensors, where F is pre-specified (we chose this based on information decay in the model fitting process). After the co-clustering optimization algorithm converges, F migration Communities will be discovered (note that we use migration system and migration Community interchangeably in this work). Each Community is represented by a tuple of vectors $({\textbf{a}}_f\in \mathbb {R}^I, {\textbf{b}}_f\in \mathbb {R}^I, {\textbf{c}}_f\in \mathbb {R}^K)$. The ${\textbf{a}}_f$ vector is an origin entity indicator, where ${\textbf{a}}_f(i)$ indicates the level of involvement of entity i in the fth migration system. The ${\textbf{b}}_f$ vector is defined similarly for destination entities. The vector ${\textbf{c}}_f$ represents the temporal profile of the fth migration system, i.e., how active this system is each year. Furthermore, the matrix ${\textbf{a}}_f {\textbf{b}}_f^T$ represents the spatial association of origin and destination entities in the Community, while $\varvec{c}_f$ encodes temporal intensity of the association. The tuple forms a rank-one tensor $\mathcal {C}_f$ by the outer product operation, i.e.,

$$\begin{aligned}&\mathcal {C}_f = {\textbf{a}}_f \circ {\textbf{b}}_f \circ {\textbf{c}}_f =({\textbf{a}}_f{\textbf{b}}_f^T)\circ {\textbf{c}}_f^T \\&\mathcal {C}_f(i, i, k) = 0, \quad \text {for all }1 \le i \le I, 1\le k \le K, \end{aligned}$$

where $\circ$ denotes the outer product, i.e., The above can also be expressed as

$$\mathcal{C}_f(i,j,k)={\textbf{a}}_f(i){\textbf{b}}_f(j){\textbf{c}}_f(k).$$

The readers are referred to more detailed definitions of tensor operators in Ref⁴⁶.

This rank-one representation is exactly a stable migration system with time-varying activity levels. The rank-one tensor representation of a spatio-temporal migration system is illustrated in Fig. 1. In this system, the origins are San Francisco and Santa Clara. Hence, ${\textbf{a}}(1)$ (San Francisco) and ${\textbf{a}}(2)$ (Santa Clara) are nonzero. The destinations are Alameda, San Mateo, and Marin, and thus, the corresponding ${\textbf{b}}(j)$’s ($j=3,4,5$) are nonzero—as shown in the lower subfigure. In addition, the top table shows ${\textbf{a}} {\textbf{b}}^T$, i.e., the spatial association of transmitters and receivers. The migration intensity is the ${\textbf{c}}$ vector, which reflects how this system’s activity level varies over the years. When multiple migration systems are simultaneously present, the associated data tensor is described by a sum of spatio-temporal rank-one terms.

Results

To illustrate the viability of using the ST tensor co-clustering method for understanding migration systems, we employ two datasets and four case studies: (i) US Metropolitan Areas, (ii) California, (iii) Louisiana with a focus on detecting exogenous events such as Hurricane Katrina in 2005 and (iv) international migration from 1990 to 2015 at five-year intervals. Case studies (i)-(iii) are conducted with IRS migration data⁵⁰, and case study (iv) is based on international migration data from⁴⁹.

IRS data—US census metropolitan statistical areas

Migration scholars have often focused on the economic, social, and political impact of internal migration in the United States^54,55. Internal migrants, unlike international migrants, are attracted to destinations other than traditional port-of-entry. The origins and destinations can respond to “pushes” and “pulls” related to environmental, political, and economic changes⁵⁶. Frey⁵⁷ notes that metropolitan areas are more closely aligned with the labor market or Community concept and are potentially the most appropriate geographic units for examining internal migration patterns. Here, we can employ the ST tensor co-clustering algorithm to “uncover” the stable migration systems over the last twenty or so years, and our method also allows us to observe the temporal change in migration intensity due to fluctuations (e.g., labor market) over this period.

The US Census Bureau defines 384 metropolitan statistical areas (MSAs), representing one or more counties with at least one urbanized area of 50,000 or more inhabitants. The 384 MSAs range from 20 Million (New York City–Newark–Jersey City) to 58,000 (Carson City, NV MSA). These 384 metropolitan areas represent 86% of the US population in 2020⁵³. In this vein, the important question is, can we find migration systems between these major economic regions from 1990 to 2018? We follow up on this question by asking how variable these Communities are over time regarding the intensity of their activities. Based on our sensitivity analysis (see the Online Appendix), we center our analysis on six major migration systems.

The ST tensor co-clustering method provides (i) a set of migration systems, (ii) a ranking of the core migration systems, and (iii) a temporal profile of the intensity of each migration system. At its crudest, the ST tensor co-clustering method provides indicators of the associations of each US county with the six migration systems (as described in Fig. 1). The number of systems (i.e., F in the model) is picked by observing the residuals between the low-rank representation and the complete tensor data—see more discussions in the Appendix. The low-rank decomposition aims to find the following representation of the spatial-temporal data $\mathcal{X}$:

$$\begin{aligned} \mathcal {X} \approx \sum _{f=1}^F {\textbf{a}}_f\circ {{\textbf{b}}}_f\circ {{\textbf{c}}}_f,~ \textbf{A}\ge \varvec{0}, \textbf{B}\ge \varvec{0}, \textbf{C} \ge {\varvec{0}}, \end{aligned}$$

where $\textbf{A}=[{\textbf{a}}_1,\ldots ,{\textbf{a}}_F]$ and $\textbf{B}$ and $\textbf{C}$ are defined identically. The nonnegativity constraints are added to the factor matrices to reflect their physical meaning (i.e., the level of involvement in different migration systems for $\textbf{A}$ and $\textbf{B}$ and the activity intensity for $\textbf{C}$). The columns of $\textbf{A}, \textbf{B}$ and $\textbf{C}$ are normalized to have unit Euclidean norms (details in the Online Appendix).

We focus on the top five to ten counties in each migration system as the probability of a county being in the system quickly approaches approximately zero in almost all cases below this threshold (details in the Online Appendix).

Metropolitan migration hubs: the case of Los Angeles–Long Beach MSA

Focusing on one of the six major migration systems, the Los Angeles–Long Beach MSA, we can observe some key regional relationships that extend beyond the LA metro. See Fig. 2 where the core migration system is Los Angeles–Long Beach MSA, Orange County MSA, San Diego MSA, and Riverside–San Bernardino MSA, all in California with a secondary set of MSAs in Arizona (Phoenix-Mesa), Illinois (Chicago), and Nevada (Las Vegas).

The pre-housing collapse and the great migration slowdown

By engaging with the temporal intensity measures pulled out of our method, we can see major economic events like the housing slowdown and its resulting impact on migration systems in the US. Looking at Fig. 2:(c) time-series plot we see that this method pulls out (in an entirely data-driven way) the same qualitative story as⁵⁸ which found that California lost most migrants to Arizona and Nevada in 2004–2005 and pre-housing collapse in 2010 a gain in migrants to Riverside–San Bernadino MSA from 2007 to 2009. We find a bump in migration intensity as US housing recovers starting in 2013/2014 (see⁵⁹) and a decline as the housing market began to heat up in 2016. Similar to Frey⁵⁷, and others⁶⁰, we can observe the “great slowdown” where internal migration declines over the whole US. Notice that we can find local variation in the system, such as that of the Los Angeles–Long Beach MSA, due to the impact of the housing market. Later, we can see a rebound in the early 2000s, followed by the most recent decline.

IRS data—California

California has been one of the most studied states for domestic migration⁶¹. Further, Frey⁶¹ Huang and Butts⁶², and others have shown that California is a high migration state with intense internal and cross-state effects. Here, we zoom into California from 1990 to 2018 and look at county-to-county migration within the state. We observe two core Communities that Southern California and Northern California define. This finding aligns with a colloquial notion of the North/South divide in California popular culture (see Fig. 3). A fundamental question in the migration systems framework is which systems represent the “core migration systems.” To understand how this method can illicit such information, we center our analysis on California because it is the largest state in the U.S., with approximately 40 million residents, and has two of the best-known regions within a state: Northern California (centered in San Francisco/Bay Area) and Southern California (centered in Los Angeles) to validate our method with.

We find a clear set of migration systems dominated by California’s Northern and Southern counties.

Our two major systems are: (i) Southern California—Communities 1 and 4 in Fig. 3a,d; and (ii) Northern California—Communities 4 and 6 in Fig. 3d,e) with Los Angeles County as the link between the two systems, Communities (b) and (e) in Fig. 3b,e.

Southern California migration system

The Southern California system has three distinct regions: Los Angeles, the Inland Empire (Riverside and San Bernardino counties), and San Diego (Fig. 3b,d). We can see in Community 1 that the Inland Empire and San Diego are receiving migrants from the Los Angeles area—we can interpret this as people moving from higher home prices (Los Angeles County) to lower housing costs (Inland Empire and San Diego). In Community 4, reciprocal migrations occur where people move around the greater Southern California region. Further, if we look at the temporal profiles (Appendix Fig. E.11), we can see that Community 4 has been largely deactivated recently, with Community 1 being the most active. This aligns with the recent rise in housing costs and correlates with the current housing crisis and growth in homelessness⁶³.

Northern California migration system

We can see two distinct migration systems for Northern California: one dominated by San Francisco, representing Silicon Valley (Fig. 3c), and a second one dominated by Sacramento (the capital of CA), representing the political capital of California (Fig. 3f). Next, we again look at the migration systems’ temporal profiles (Appendix Fig. E.11). We discover that in Community 3 (Appendix Fig. E.11c), the temporal intensity matches the crest of unemployment and subsequent decline in unemployment (see⁶⁴), which reinforces the idea of the importance of labor markets on internal migration. Focusing on the temporal profiles of these migration systems, we see that the core migration system (Community 1; Fig. 3) shows the general trend known as the “Great American Migration Slowdown” (coined by Frey^{58, p.1}). It is generally established that there has been a decline in internal migration since about the 1970s, with the slowdown picking up in the 1990s⁵³. From the figure, we also pick up the decline in unemployment from around 2010 to 2018 (see⁶⁴).

Linking northern and southern California

When we look at Communities 2 and 5 (Fig. 3b,e), we can see the link between Northern and Southern California centering around a suburb of Silicon Valley (Santa Clara County) which receives migrants from Los Angeles primarily, further, when we look at the temporal profiles, we see a big increase in movement from Southern California to the Bay Area, which correlates with the more recent tech boom.

Classifying the state into northern and southern California regions

Given these two systems, we might be interested in classifying the whole state as three Community systems using our method combined with a clustering algorithm (in this case, k-nearest neighbors;⁶⁵). In Fig. 4, we can see the Northern California versus Southern California split, with Los Angeles being Southern California’s core origin/destination system. A noticeable implication is that Santa Barbara County is classified as part of the Southern California system. There is active research on where people divide Southern and Northern California cognitively⁶⁶ with Santa Barbara typically being the dividing line of Northern and Southern California in regional identification tasks^66,67. This places further evidence of the importance of the Santa Barbara divide and whether it should be placed in Northern or Southern California.

IRS data—Hurricane Katrina, 2005

One particularly compelling aspect of the ST tensor co-clustering method is the ability to detect the activity intensity changes in the migration system in response to external shocks. In 2005, Hurricane Katrina’s effect on the City of New Orleans provided an extreme example of how severe weather events can change the demographics of a major city^68,69. In this section, we look at the migration system between New Orleans Parish and all other counties within Louisiana, as well as a node representing all the combined counties outside the state. Pre-disaster is defined as before 2004, and recovery as 2007–2009⁶⁹. In Fig. 5 and Appendix Fig. E.12, we can distinctly see the effects of Hurricane Katrina on the migration systems in Louisiana and the City of New Orleans specifically.

Migration system activated by Hurricane Katrina

In Fig. 5 (Community 1), we can see the migration out of New Orleans Parish. In this case, the system engaged after the natural disaster (Community 1) differs from the one engaged for recovery (Community 2, though they share some counties in common). The key finding here is that we can see exactly which migration systems are activated for the displacement event (Hurricane Katrina) and which are activated for return migration (recovery). Further, we can see that while there is overlap in the counties, it is not the same Communities involved in the recovery—suggesting that some of the recovery is driven by new migration to the area. This can be seen in Fig. 5.

Migration recovery system from Hurricane Katrina

In Appendix Fig. E.12 (Community 2), we can see the recovery of the migration systems, which has been defined as 2009⁶⁸. This method allows us to see this change in migration system activation due to exogenous shock and to see how people activate different migration systems depending on a particular event like Hurricane Katrina.

Comparison with alternative methods: case of Hurricane Katrina

A typical strategy for dynamic graph clustering is to apply a classic ‘static’ Community detection technique designed for graphs without the temporal dimension (such as the walktrap method³¹) to the graphs collected at different time-points separately and look at how the resulting system changes. Here, we use a walktrap algorithm on the Louisiana domestic migration network split into pre- and post-Hurricane Katrina. We then compare the ST co-clustering method with the walktrap method (see details in Materials and Methods). The walktrap method is a random walk-based Community detection algorithm. It provides hard, nonoverlapping clustering of the counties based on their migration patterns. However, the method does not tell which migration system exhibits higher activity levels as revealed in our method; see (Fig. 6). To better compare with the ST co-clustering method, we observe the cluster (i.e., a migration system) containing New Orleans, the prominent city and the one hit heavily by Katrina (Fig. 6). In Fig. 7, we look at the top 5 origin (sender) and destination (receiver) counties found by the ST tensor co-clustering method. Note that walktrap does not offer such sender/receiver information. Next, we observe differences in cluster patterns, with Communities 1 and 2 producing the closest to the walktrap solution (see Fig. 6). According to the walktrap method, the primary system in New Orleans shrinks by half between pre- and post-Hurricane Katrina, representing the changes in the system. However, in the ST co-clustering method, we see the local cluster around New Orleans with only East Baton Rouge Parish being in the system, suggesting that the evacuation was much closer to the disaster center than the walktrap method would suggest.

International migration data—global migration systems

Identifying migration systems in the international context remains an open problem in the field. Several works have posited that there should exist international migration systems^1,2,3,4 and have provided a set of “general principles” of such systems rather than analytic approaches⁸. Recent work by Abel et al.⁸ has applied static Community detection methods to the international migration data provided by Azose and Raftery⁴⁸ and updated by Abel et al.⁸ to demonstrate the change in migration systems over time. In⁸, the migration systems were found year-by-year by repeatedly applying the Community detection method to each year’s data.

Here, we apply the ST tensor co-clustering method to the same data. This clustering results in essential differences in output and understanding of the migration systems. First, ST tensor co-clustering produces a set of migration systems that exist over the entire period (though the intensity of their importance varies over time) and thus represent a cleaner set of migration systems than those in⁸. By fitting Community detection methods year by year, Abel et al.⁸ cannot guarantee the persistence of a given system. This means that the method does not ensure the discovery of clusters (networks) of countries with similar spatial interactions and varying activity intensity over time. Thus, the ST co-clustering model produces a better representation of the migration system, especially under what Kritz and Zlotnik³ described as “network[s] consisting of sets of the concept of dynamic stability” (see also³).

We explore six major Communities consisting of the top 10 origin and destination locations from 1990 to 2015 (Fig. 8 and Appendix Fig. E.13). This set was chosen based on the least square fit of the data (see the Online Appendix for details). The first migration system (Appendix Fig. E.13; Community 1) is dominated by the relationship between Mexico (origin) and the United States (destination) and is characterized by countries sending migrants to the United States. The second migration system (Appendix Fig. E.13; Community 2) is characterized by Eastern European migration, with Russia dominating both the origin and destination of the system. The third migration system (Appendix Fig. E.13; Community 3) is characterized by India, Bangladesh, and other Southern and South Eastern Asian countries. The fourth migration system (Appendix Fig. E.13; Community 4) is characterized by the United States, China, and India as the largest origin countries with destination countries Mexico, South America, Asia, and Western Europe, and Russia as the primary set. The fifth migration system (Appendix Fig. E.13; Community 5) is dominated by the Middle East, with Syria being the largest origin country. Last, the sixth migration system (Appendix Fig. E.13; Community 6) is also in the Middle East and comprises Iran, Pakistan, and Afghanistan.

We have visualized these Communities with world maps in Fig. 8. In Appendix Fig. E.14, we provide examples of the temporal profile and the spatial interaction matrix of the top 10 countries in migration systems 1–6. The matrix is produced by instantiating $\widetilde{\textbf{a}}_f(i)\widetilde{\textbf{b}}_f(j)$ as its (i, j)th element, where $\widetilde{\textbf{a}}_f \in \mathbb{R}^{10}$ and $\widetilde{\textbf{b}}_f\in \mathbb {R}^{10}$ are the sub-vectors of ${\textbf{a}}_f$ and ${\textbf{b}}_f$ holding the top-10 strongest elements, respectively. The temporal profile allows us to see major events, such as the end of the Syrian occupation of Lebanon in 2005 (Appendix Fig. E.14; Community 4). Altogether, the results are similar to those of⁸ with major migration systems centering around the United States with distinct European and Eastern European systems and Middle Eastern and Southern Asia systems. We also find evidence of change in the importance of the United States-dominated migration system from migration system 1 (Community 1; Appendix Fig. E.14) to migration system 3 (Community 3; Appendix Fig. E.14) in 2005–2010, which is similar to what was described in⁸. However, we can see that this change started at the beginning of the period (Community 3; Appendix Fig. E.14), with the peak change occurring in 2005–2010. Further, because we have a stable set of countries in our system, we can see precisely how and when one system versus another becomes dominant from the temporal profile change of two US-dominated systems.

Comparison with prior works: world migration systems

The work in Abel et al.⁸ employed infoMap¹⁴ to demonstrate the efficacy of finding migration systems using static network clustering techniques. Both methods generate a distinct North American cluster, European and Asian cluster; however, the technique of in⁸ does not provide distinct origin and destination clusters or highlight which Communities are more significant over a given time period. We have recreated these clusters in Appendix Fig. E.9 and isolated the Communities that include either the United States or China (Appendix Fig. E.10). Again, one distinct difference is that the classic methods provide a complete partitioning and focus on how these Communities evolve. In contrast, our method provides a quantitative measure and order of how “important” a Community is and how stable the Community is over time. Both methods detect the US–Canada–Mexico cluster, but our method also detects separately the China–US origin-destination cluster, where we can distinguish between whether the origin or destination is driving the relationship (see Appendix Fig. E.10 in comparison with Appendix Fig. 8). Altogether, the ST Co-Clustering method provides a distinct and valuable solution that differs from the current focus on change. Instead, ST Co-Clustering aims to uncover stable migration systems over time, as hypothesized in the literature (see Figs. 9, 10 demonstrating this difference). It further allows the ability to rank/prioritize these systems and distinguish between origin and destination clusters.

Syrian occupation of Lebanon Last, another key difference is our ability to spot major changes (e.g., the end of the Syrian occupation of Lebanon in 2005, which can be seen clearly in Appendix Fig. E.13e)—which does not appear in the work of Abel et al.⁸.

Discussion

Unlike historical methods^{70,71,72,73,74,75}, we provide a holistic framework for dealing with space and time in migration patterns for understanding migration systems. Our method also allows for improvements on Pandit’s⁷⁶ migration system. Pandit sets out a “subsystems” and migration “typologies” framework, where subsystems represent high interconnectivity and low levels of interchange with other subsystems, and migration typologies represent clustering by the origin and/or destination of migrants. Thus, the relationship is not the interconnectedness between areas but rather the relative similarity of their flows within spatial units. This work provides a novel way to differentiate subsystems and typologies. Specifically, our method allows for measuring the change in typologies over a given spatial region (e.g., the effects of Hurricane Katrina) and a general measure of subsystems that is responsive to the temporal nature of migration systems, unlike historical methods like⁷⁶ that rely on simple principal component analysis (PCA). In general, we can view this as the natural extension of what has been done historically in the empirical system, finding literature in the social sciences but accurately taking into account time and space in a way that is not currently done in the field. Further, it allows researchers interested in migration systems to go beyond simple changes in static snapshots of migration systems to holistic models of temporal stability and large-scale shifts in migration typologies.

Limitations: This method definitionally pulls out the spatial/temporal pattern and does not quantify “change” per se in the system. For example, if you want new systems over a range of years, you need to discretize the use of this method so that it would act more like the infomap or walktrap solutions we discuss as comparisons; however, even in that context, this method performs quite well (see our IRS Data—Hurricane Katrina: Louisiana comparison with walktrap algorithm or International Migration Data—Global Migration Systems comparison with the infomap algorithm). Overall, this method performs very well in terms of the intuitive definition of the migration system discussed in the literature. This method generally requires complete network data with multiple relations (time being the one focused on in this article). Future work will look to applying this method to sampled network data, but it is an ongoing research problem.

In summary, our findings have five crucial implications: (i) empirically derived migration systems can be established as stable over time and space (domestic and international migration); (ii) we can correctly derive expected migration systems across the US (e.g., Northern and Southern California) and in international contexts; (iii) we can detect exogenous shocks to the migration system (e.g., Hurricane Katrina or the end of Syrian occupation of Lebanon); (iv) we can establish changes in migration systems over time (e.g. Syrian refugee crisis); and (v) we provide a novel approach to dynamic community detection that focuses on stable clusters over time rather than change in clusters over time. Beyond the purely descriptive, these quantitative data-driven methods have the potential to improve population forecasting⁷⁷, as Andris et al.⁷⁸ demonstrated the usefulness of migration clusters in predicting future migration flows or potentially could be integrated in formal demographic models such as those used in Massey et al.⁷⁹. Further, these methods could be employed to build complex statistics for exponential random graph models, which have been applied successfully to migration networks (see, for example,⁶⁰). Tensor co-clustering is powerful for finding spatio-temporal clusters in networks like migration systems, and it also has strong potential for broader impact in the social sciences beyond migration systems in areas such as partisan politics or alliance formation.

Further, this method has the potential to change how we think about community detection in the larger social network and network science literature—allowing us to consider not just two dimensions (a typical social network) but k-dimensions for many multiplex⁸⁰ relations (e.g., Friendship, Acquaintanceship, Kinship, Job Leads, homelessness over a single set of actors), not just time. This larger scientific endeavor under multiplexity is of general interest to social networks and the social science community.

Methods

Datasets

IRS data The IRS migration data are created in the following manner: (1) taxpayer identification numbers (TINs) are used to match tax returns in consecutive years; (2) matched tax returns where migrant returns are defined as those that do not match the state or county of residence in consecutive years; (3) total counts of tax returns and tax exemptions—effectively households and individuals respectively—and the total adjusted gross income or AGI contained in the migrant and non-migrant returns are aggregated up to the state and county levels^50,51,52. Four major limitations⁵⁰ of this data are discussed in the, which include: (1) by definition, these data do not include those who do not file a tax return. This group is disproportionately elderly and/or poor⁵⁰. (2) The data is limited to aggregate counts of the county (or state) data on three variables: (i) total counts of migrant and non-migrant returns (i.e., households), (ii) exemptions (i.e., individuals), and (iii) AGI. (3) There is a methodological change between the 1990–2011 data and the 2012–2018 data⁸¹. In the 2011–2012 tax year, the data preparation shifted from the US Census Bureau to the IRS, which expanded the window for returns included in the estimates to go through December rather than September. (4) The last criticism is that for privacy reasons, the county-to-county migration flows involving less than ten households are obscured⁸¹. This was increased to 20 in the 2011–2018 periods⁸¹.

Dewaard et al.⁸¹ describe the limitations of using the combination of the 1990–2018 period due to processing standards change from US Census Bureau to IRS workers. Nonetheless, because the ST tensor co-clustering method works in a low-rank approximation manner analogous to the principal component analysis (PCA) for matrix data, such measurement error-induced noise is not expected to cause visible issues. Further, to evaluate the robustness of this assumption, we have done a series of tests (available in the Online Appendix), and no major red flags have appeared. So while⁸¹ does caution against this practice, we find our procedure robust to the issues discussed.

Alternative migration data in the US lacks temporal and spatial resolution for such an analysis. For example, there exists one-year migration estimates from the 2000 US Census long form (1 year of data) and five-year interval estimates from the American Community Survey (ACS) from 2010 to 2019 (i.e. 2–3 years of data). However, neither of these estimates provides temporal resolution of the IRS data. Many county estimates are suspect because the ACS surveys do not have a large enough sample in any given year to make estimates below the state level.

International migration data Estimation of international migration is a complex and important area of research. Recently, Abel and Cohen⁸² updated the migration estimates from Azose and Raftery⁴⁸, which cover 200 countries every five years from 1990 to 2015. This is the same data used in Abel et al.⁸ and the estimates we use in this article. The statistical method used to develop these estimates is built on the work by Azose and Raftery⁴⁸, which builds on the work by Abel^83,84.

This data is developed by first gathering data on country-level migration stocks, desegregated by country of birth based on administrative and United Nations records. Next, the researchers employ demographic balancing equations to harmonize the data between two periods. The idea is that any change in the migration stock must align with the component changes in fertility, mortality, and migration in a given country. These models use the fertility and mortality information from the United Nations World Populations Prospects to estimate the country-to-country migration flow data at five-year intervals. It is worth noting that Abel’s⁸³ original construction more closely follows report data, and Azose and Raftery’s⁴⁸ employ a Bayesian model to produce the final estimates. Abel et al.⁸ describe the following major limitations to these migration data in that statistical models do not reconcile migration reports of sending and receiving countries, which are often not in agreement. However, these harmonized and estimated migration data are generally considered the best migration data under current standards⁸².

Spatial-temporal co-clustering model

We consider three-way data $\mathcal{X}\in \mathbb {R}^{I\times J\times K}$, where $\mathcal{X}(i,j,k)$ represents the number of individuals who moved from location i to location j in year k. We expect that migration occurs organically in systems largely driven by sociological, economic, and demographic factors, as well as major local events such as a natural disaster. The migration patterns are grouped through the “origin locations” (i.e., locations where people move from) and “destination locations” (i.e., where people move to) and the temporal pattern of this movement. Such a data set can be represented as in a multi-way tensor co-clustering framework⁴⁷. Specifically, we model $\mathcal{X}$ (the origin by destination by time array) as the following decomposition:

$$\begin{aligned} \mathcal{X} \approx&\sum _{f=1}^F {\textbf{a}}_f\circ {\textbf{b}}_f \circ {\textbf{c}}_f, \end{aligned}$$

(1)

where $\circ$ denotes the outer product, i.e.,

$$\begin{aligned} \left[{\textbf{a}}_f\circ {\textbf{b}}_f\right]_{i,j} ={\textbf{a}}_f(i){\textbf{b}}_f(j),~ [ {\textbf{a}}_f\circ {\textbf{b}}_f \circ {\textbf{c}}_f]_{i,j,k} ={\textbf{a}}_f(i){\textbf{b}}_f(j){\textbf{c}}_f(k). \end{aligned}$$

Here, ${\textbf{a}}_f\circ {\textbf{b}}_f\circ {\textbf{c}}_f$ represents the fth co-cluster – a migration system over time in our context. The vector ${\textbf{a}}_f$ indicates the membership (or the degree of association) of the I origin locations with co-cluster f. For example, ${\textbf{a}}_f(i)=0$ means that county i is not in the fth migration system. The ${\textbf{b}}_f$ vector is defined similarly for the destination locations. Note that ${\textbf{a}}_f\circ {\textbf{b}}_f={\textbf{a}}_f{\textbf{b}}_f^T$ is a rank-one matrix and defines a bipartite clique (i.e., fully connected bipartite sub-network) over the origin-destination network. The vector ${\textbf{c}}_f$ scales the clique over time, which can be regarded as the clique’s temporal signature. Intuitively, ${\textbf{c}}_f(k)$ being a large value means that the fth migration system has intense migration activities at the kth year.

The model in (1) is the so-called canonical polyadic decomposition (CPD) of third-order tensors if F is the smallest integer that makes (1) hold exactly. In such cases, F is referred to as the tensor rank. Under our hypothesis, finding the CPD expression of the migration data can reveal major migration co-clusters and their activity levels over time.

The key advantage of CP decomposition is that its rank-one components are unique and can thus be better interpreted. This is to be contrasted with bilinear (matrix) factor analysis methods, which do not produce unique rank-one components. Taking SVD, for example, and absorbing the singular values into the left and right matrix factors, we can obtain another equivalent decomposition of the given low-rank matrix. The reason that SVD itself is unique is that we insist on the orthogonality of the singular vectors. However, the ‘true’ underlying components we seek in applications are rarely orthogonal; thus, SVD fails to unravel them.

Owing to the inherent uniqueness of CP decomposition, we cannot enforce its components to be orthogonal, as the true generating latent factor matrices are not orthogonal. The result is that the variance explained by the sum of CP components is not the sum explained by the individual components, so we cannot talk about the variance explained by a single component in isolation, as in SVD. However, we can extract a set of F principal CP components, which best explain the given data. Because they are unique, there is no ambiguity in visualizing them, as is the case with the matrix.

To be more precise, the co-clustering algorithm tackles the following optimization problem:

$$\begin{aligned} \mathop {\textrm{minimize}}\limits _{\textbf{A}, \textbf{B}, \textbf{C}}&\quad \left\Vert \mathcal{W} \circledast \left( \mathcal {X} - \sum _{f=1}^F {\textbf{a}}_f\circ {\textbf{b}}_f\circ {\textbf{c}}_f \right) \right\Vert _F^2, \nonumber \\ \text {subject to}\quad&~\textbf{A}\ge \textbf{0}, \textbf{B}\ge \textbf{0}, \textbf{C} \ge \textbf{0}, \end{aligned}$$

(2)

where $\textbf{A}=[{\textbf{a}}_1,\ldots ,{\textbf{a}}_F]$, $\textbf{B}=[{\textbf{b}}_1,\ldots ,{\textbf{b}}_F]$, $\textbf{C}=[{\textbf{c}}_1,\ldots ,{\textbf{c}}_F]$, $\mathcal{W}\in \mathbb {R}^{I\times I\times K}$ is a weight tensor such that

$$\begin{aligned} \mathcal{W}(i,i,k)=0,~\forall i,~\forall k,~~~\mathcal{W}(i,j,k)=1,~\forall k,~\forall i \ne j, \end{aligned}$$

and $\circledast$ denotes the Hadamard product. The nonnegativity constraints imposed on $\textbf{A}$, $\textbf{B}$ and $\textbf{C}$ reflect their physical interpretations. The weighting discards the diagonal entries in each slab of the data tensor since entries like $\mathcal{X}(i,i,k)$ always dominate in magnitude. Still, they represent static residents in county i and do not encode movements.

Algorithm, software, and hyperparameter selection The formulation in (2) entails a special tensor completion problem. Many off-the-shelf algorithms have been designed to handle this problem and its variants; see^47,85,86. In this work, we employ the well-optimized and freely available Tensorlab software toolbox⁸⁷ to solve the formulated problem in (2). Tensorlab is a Matlab toolbox that is widely used in the signal and data analytics Community. The software has a suite of flexible functions that can deal with plain-vanilla tensor decomposition and tensor decomposition with multiple constraints, e.g., nonnegativity, sparsity, and smoothness. The software can also easily handle missing values. In a nutshell, tensorlab treats a wide range of tensor decomposition problems as a nonlinear least squares problem and recasts these problems into a form that can be dealt with using a Gauss-Newton (GN) nonlinear programming framework. The subproblems in the GN framework are handled using conjugate gradient, which can effectively exploit the multilinear structure of tensor problems to develop lightweight updates. A tutorial of tensorlab’s basic framework and updating rules can be found in Ref⁸⁵. Users unfamiliar with tensors and nonlinear programming may also use tensorlab as a black box.

The proposed method selects only one hyperparameter, the model’s tensor rank, corresponding to the number of migration systems. The tensor rank is analogous to the number of principal components in the matrix principal component analysis (PCA) case. For real-life data, due to noise and modeling error, the data tensors tend to have high (or full) rank. However, the tensor’s “useful signal part” is believed to have a low rank due to the high correlations across different modes. Unlike matrix PCA, incrementally extracting F components from the tensor one by one does not ensure that one will extract the F best (most significant) rank-one components from the data—due to the lack of orthogonality of the latent factors. Furthermore, extracting the principal CP component is NP-hard in general; see⁴⁶ and references therein. Nevertheless, we do have good software tools such as tensorlab that work very well in practice, and when the latent factors $\textbf{A}$, $\textbf{B}$ and $\textbf{C}$ are nonnegative and sparse. Even incremental extraction often produces the most prominent F components, as observed in Ref⁴⁷. In our case, the latent factors are indeed nonnegative and sparse, and thus, we have good reason to believe that the $F=6$ migration systems extracted from both datasets are the most prominent ones. In the Online Appendix, we present evidence supporting our choice of this single hyperparameter, i.e., setting $F=6$. It turns out that further increasing F does not change the first six Communities significantly, which validates our postulate.

Related works

The co-clustering idea was first introduced in Ref⁴⁷ for discovering Communities from email networks over time. Tensor-based co-clustering was also found useful in analytical chemistry⁸⁸. Variations of tensor co-clustering were recently used for football team clustering, Wikipedia user clustering, and autonomous systems analysis⁸⁹. In terms of migration data analysis, a short workshop paper presented preliminary results of using tensor models to discover the most significant migration clique spatio-temporal migration data. There, instead of using optimization-based low-rank decomposition as in our work, a Bayesian inference framework was used, where the migration counts were modeled as drawn from Poisson distributions and the factor matrices were given Gamma priors⁹⁰. The Bayesian nature of the work in Ref⁹⁰ may make the method heavily dependent on priors, which are not known for real-world data. Non-parametric approaches and those that use as few assumptions and parameters as possible are preferable for exploratory analysis.

Comparison with dynamic approaches to community detection

Comparison methods: Walktrap To detect any changes in the migration pattern before and after Hurricane Katrina, we construct an aggregated pre-Katrina migration matrix and an aggregated post-Katrina migration matrix. More precisely, let $\textbf{W}_{\text {pre}} =\sum _{i=1990}^{2004}\textbf{W}_i-\text {diag}\left( \sum _{i=1990}^{2004}\textbf{W}_i\right)$ denote the aggregated pre-Katrina weight matrix, where $\textbf{W}_i$ denotes the weight matrix associated with the i-th observation period and $\text {diag}\left( \sum _{n=1990}^{2004}\textbf{W}_i\right)$ is a diagonal matrix that holds the diagonal elements of $\sum _{i=1990}^{2004}\textbf{W}_i$ on its diagonal. Note that we do not have details on migration patterns within the county; thus, the diagonal elements of $\textbf{W}_{\text {pre}}$ are set to zero. We construct the aggregated post-Katrina migration matrix ($\textbf{W}_{\text {post}} =\sum _{i=2006}^{2018}\textbf{W}_i-\text {diag}\left( \sum _{i=2006}^{2018}\textbf{W}_i\right)$) analogously. The clustering method Walk Trap proposed in Ref³¹ and applied on $\textbf{W}_{\text {pre}}$ and $\textbf{W}_{\text {post}}$ yields the Communities depicted in Fig. 6. The Community containing New Orleans is depicted in Fig. 6.

Comparison method: InfoMAP The InfoMAP¹⁴ Community detection method is an information theoretic-based Community detection technique. It receives an adjacency matrix representing a directed and weighted network (e.g., our international migration data). It produces a list of hard-clustering (i.e., full partitioning) Communities with the goal of optimally compressing information flow. InfoMAP is directly applied to the international migration data for each period. The corresponding results for the five different periods are shown in Fig. E.9. In addition, to reflect changes in community structure over time, we focus on communities, including the US and China, as these are likely the most important members. The results are depicted in Fig. E.10.

Data availability

All data and code is available through the Harvard Dataverse at https://doi.org/10.7910/DVN/EGFDU3.

References

Massey, D. S., Arango, J., Hugo, G., Kouaouci, A. & Pellegrino, A. Worlds in Motion: Understanding International Migration at the End of the Millennium: Understanding International Migration at the End of the Millennium (Clarendon Press, 1999).
Book Google Scholar
Bakewell, O. Relaunching migration systems. Migr. Stud. 2, 300–318 (2014).
Article Google Scholar
Kritz, M. M. et al. International migration systems: A global approach (Oxford University Press, OXford, 1992).
Google Scholar
Mabogunje, A. L. Systems approach to a theory of rural-urban migration. Geogr. Anal. 2, 1–18 (1970).
Article Google Scholar
Massey, D. S. et al. A missing element in migration theories. Migr. Lett. 12, 279–299 (2015).
Article Google Scholar
Yang, Z., Algesheimer, R. & Tessone, C. J. A comparative analysis of community detection algorithms on artificial networks. Sci. Rep. 6, 1–18 (2016).
Google Scholar
Girvan, M. & Newman, M. E. Community structure in social and biological networks. Proc. Natl. Acad. Sci. 99, 7821–7826 (2002).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Abel, G. J., DeWaard, J., Ha, J. T. & Almquist, Z. W. The form and evolution of international migration networks, 1990–2015. Popul. Sp. Place 27, e2432 (2021).
Article Google Scholar
Expert, P., Evans, T. S., Blondel, V. D. & Lambiotte, R. Uncovering space-independent communities in spatial networks. Proc. Natl. Acad. Sci. 108, 7663–7668 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
DeWaard, J., Kim, K. & Raymer, J. Migration systems in Europe: Evidence from harmonized flow data. Demography 49, 1307–1333 (2012).
Article PubMed Google Scholar
Clauset, A. Finding local community structure in networks. Phys. Rev. E 72, 026132 (2005).
Article ADS Google Scholar
Boccaletti, S., Ivanchenko, M., Latora, V., Pluchino, A. & Rapisarda, A. Detecting complex network modularity by dynamical clustering. Phys. Rev. E 75, 045102 (2007).
Article ADS CAS Google Scholar
Ahajjam, S., El Haddad, M. & Badir, H. A new scalable leader-community detection approach for community detection in social networks. Soc. Netw. 54, 41–49 (2018).
Article Google Scholar
Rosvall, M. & Bergstrom, C. T. Maps of random walks on complex networks reveal community structure. Proc. Natl. Acad. Sci. 105, 1118–1123 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Slater, P. B. A two-stage algorithm for extracting the multiscale backbone of complex weighted networks. Proc. Natl. Acad. Sci. 106, E66–E66 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Sorichetta, A. et al. Mapping internal connectivity through human migration in malaria endemic countries. Sci. Data 3, 1–16 (2016).
Article Google Scholar
Butts, C. T. Revisiting the foundations of network analysis. Science 325, 414–416 (2009).
Article ADS MathSciNet CAS PubMed Google Scholar
Vespignani, A. Twenty years of network science. Nature 558, 528–529 (2018).
Article ADS CAS PubMed Google Scholar
Clauset, A., Newman, M. E. & Moore, C. Finding community structure in very large networks. Phys. Rev. E 70, 066111 (2004).
Article ADS Google Scholar
Lazer, D. M. et al. Computational social science: Obstacles and opportunities. Science 369, 1060–1062 (2020).
Article ADS CAS PubMed Google Scholar
Wasserman, S. et al. Social Network Analysis: Methods and Applications (Cambridge University Press, 1994).
Book Google Scholar
Barabási, A.-L. Network science. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 371, 20120375 (2013).
Article ADS Google Scholar
Airoldi, E. M., Blei, D., Fienberg, S. & Xing, E. Mixed membership stochastic blockmodels. Adv. Neural Inf. Process. Syst. 21 (2008).
Karrer, B. & Newman, M. E. Stochastic blockmodels and community structure in networks. Phys. Rev. E 83, 016107 (2011).
Article ADS MathSciNet Google Scholar
Good, B. H., De Montjoye, Y.-A. & Clauset, A. Performance of modularity maximization in practical contexts. Phys. Rev. E 81, 046106 (2010).
Article ADS MathSciNet Google Scholar
Fortunato, S. & Barthelemy, M. Resolution limit in community detection. Proc. Natl. Acad. Sci. 104, 36–41 (2007).
Article ADS CAS PubMed Google Scholar
Lancichinetti, A. & Fortunato, S. Limits of modularity maximization in community detection. Phys. Rev. E 84, 066122 (2011).
Article ADS Google Scholar
Newman, M. E. Finding community structure in networks using the eigenvectors of matrices. Phys. Rev. E 74, 036104 (2006).
Article ADS MathSciNet CAS Google Scholar
Wakita, K. & Tsurumi, T. Finding community structure in mega-scale social networks, 1275–1276 (2007).
Blondel, V. D., Guillaume, J.-L., Lambiotte, R. & Lefebvre, E. Fast unfolding of communities in large networks. J. Stat. Mech. Theory Exp. 2008, P10008 (2008).
Article Google Scholar
Pons, P. & Latapy, M. Computing Communities in Large Networks Using Random Walks 284–293. (Springer, 2005).
Google Scholar
Raghavan, U. N., Albert, R. & Kumara, S. Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E 76, 036106 (2007).
Article ADS Google Scholar
Rosvall, M., Axelsson, D. & Bergstrom, C. T. The map equation. Eur. Phys. J. Spec. Top. 178, 13–23 (2009).
Article Google Scholar
Bonchev, D. D. & Rouvray, D. Complexity in Chemistry, Biology, and Ecology (Springer, 2007).
Google Scholar
Almquist, Z. W. & Butts, C. T. Logistic network regression for scalable analysis of networks with joint edge/vertex dynamics. Sociol. Methodol. 44, 273–321 (2014).
Article PubMed PubMed Central Google Scholar
Aylaj, B., Bellomo, N., Gibelli, L. & Knopoff, D. Complexity of Human Crowds and Modeling Strategy 1–15 (Springer, 2021).
Google Scholar
Aguiar, M., Dosi, G., Knopoff, D. A. & Virgillito, M. E. A multiscale network-based model of contagion dynamics: Heterogeneity, spatial distancing and vaccination. Math. Models Methods Appl. Sci. 31, 2425–2454 (2021).
Article MathSciNet CAS Google Scholar
Bellomo, N., Dosi, G., Knopoff, D. A. & Virgillito, M. E. From particles to firms: On the kinetic theory of climbing up evolutionary landscapes. Math. Models Methods Appl. Sci. 30, 1441–1460 (2020).
Article MathSciNet Google Scholar
Degond, P., Appert-Rolland, C., Moussaid, M., Pettré, J. & Theraulaz, G. A hierarchy of heuristic-based models of crowd dynamics. J. Stat. Phys. 152, 1033–1068 (2013).
Article ADS MathSciNet Google Scholar
Rogers, A. Applied Multiregional Demography: Migration and Population Redistribution (Springer, 2015).
Book Google Scholar
Cai, N., Ma, H.-Y. & Khan, M. J. Agent-based model for rural-urban migration: A dynamic consideration. Phys. A Stat. Mech. Appl. 436, 806–813 (2015).
Article MathSciNet Google Scholar
Martinet, L.-E. et al. Robust dynamic community detection with applications to human brain functional networks. Nat. Commun. 11, 1–13 (2020).
Article Google Scholar
Nguyen, N. P., Dinh, T. N., Shen, Y. & Thai, M. T. Dynamic social community detection and its applications. PLoS ONE 9, e91431 (2014).
Article ADS PubMed PubMed Central Google Scholar
Cazabet, R., Rossetti, G. & Amblard, F. Dynamic community detection (2017).
Haynes, K. E. & Fotheringham, A. S. Gravity and Spatial Interaction Models (Regional Research Institute, West Virginia University, 2020).
Sidiropoulos, N. D. et al. Tensor decomposition for signal processing and machine learning. IEEE Trans. Signal Process. 65, 3551–3582 (2017).
Article ADS MathSciNet Google Scholar
Papalexakis, E. E., Sidiropoulos, N. D. & Bro, R. From k-means to higher-way co-clustering: Multilinear decomposition with sparse latent factors. IEEE Trans. Signal Process. 61, 493–506 (2013).
Article ADS Google Scholar
Azose, J. J. & Raftery, A. E. Estimation of emigration, return migration, and transit migration between all pairs of countries. Proc. Natl. Acad. Sci. 116, 116–122 (2019).
Article ADS MathSciNet CAS PubMed Google Scholar
Abel, G. J. & Sander, N. Quantifying global international migration flows. Science 343, 1520–1522 (2014).
Article ADS CAS PubMed Google Scholar
Hauer, M. & Byars, J. IRS county-to-county migration data, 1990–2010. Demogr. Res. 40, 1153–1166 (2019).
Article Google Scholar
Gross, E. Internal revenue service area-to-area migration data: Strengths, limitations, and current uses. Stat. Income SOI Bull. 25, 159–160 (2005).
Google Scholar
Pierce, K. SOI migration data: A new approach: Methodological improvements for SOI’s United States population migration data, calendar years 2011-2012. Statistics of Income. SOI Bulletin 35 (2015).
Molloy, R., Smith, C. L. & Wozniak, A. Internal migration in the United States. J. Econ. Perspect. 25, 173–96 (2011).
Article Google Scholar
Frey, W. H. Internal Migration: What Does the Future Hold? 265–271 (Routledge, 2017).
Google Scholar
Greenwood, M. J. & Sweetland, D. The determinants of migration between standard metropolitan statistical areas. Demography 9, 665–681 (1972).
Article CAS PubMed Google Scholar
Plane, D. A., Henrie, C. J. & Perry, M. J. Migration up and down the urban hierarchy and across the life course. Proc. Natl. Acad. Sci. 102, 15313–15318 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Frey, W. H. Immigration, domestic migration, and demographic balkanization in America: New evidence for the 1990s. Population and Development Review 741–763 (1996).
Frey, W. The Great American Migration Slowdown (Brookings Institution, 2009).
Google Scholar
Schuetz, J. & Crump, S. The Housing Market and the COVID-19 Pandemic: Implications for Las Vegas, Phoenix, Riverside, Los Angeles, Orlando, and New Orleans (Brookings Mountain West, 2021).
Huang, P. & Butts, C. T. Rooted America: Immobility and segregation of the intercounty migration network. Am. Sociol. Rev. 88, 1031–1065 (2023).
Article Google Scholar
Frey, W. H. Immigration and internal migration “flight’’: A california case study. Popul. Environ. 16, 353–375 (1995).
Article Google Scholar
Huang, P. & Butts, C. T. California exodus? a network model of population redistribution in the united states. J. Math. Sociol. 48, 311–339 (2024).
Article MathSciNet PubMed Google Scholar
Almquist, Z. W., Helwig, N. E. & You, Y. Connecting continuum of care point-in-time homeless counts to United States census areal units. Math. Popul. Stud. 27, 46–58 (2020).
Article MathSciNet Google Scholar
Tan, D. California’s Safety Net in Recession and Recovery (Public Policy Institute of California, 2021).
Fukunaga, K. & Narendra, P. M. A branch and bound algorithm for computing k-nearest neighbors. IEEE Trans. Comput. 100, 750–753 (1975).
Article Google Scholar
Montello, D. R., Friedman, A. & Phillips, D. W. Vague cognitive regions in geography and geographic information science. Int. J. Geogr. Inf. Sci. 28, 1802–1820 (2014).
Article Google Scholar
Almquist, Z. W. & Butts, C. T. Predicting regional self-identification from spatial network models. Geogr. Anal. 47, 50–72 (2015).
Article PubMed Google Scholar
Fussell, E., Curtis, K. J. & DeWaard, J. Recovery migration to the city of New Orleans after Hurricane Katrina: A migration systems approach. Popul. Environ. 35, 305–322 (2014).
Article PubMed PubMed Central Google Scholar
DeWaard, J., Curtis, K. J. & Fussell, E. Population recovery in new Orleans after Hurricane Katrina: Exploring the potential role of stage migration in migration systems. Popul. Environ. 37, 449–463 (2016).
Article PubMed Google Scholar
Wilson, A. G. The use of the concept of entropy in system modelling. J. Oper. Res. Soc. 21, 247–265 (1970).
Article ADS Google Scholar
Snickars, F. & Weibull, J. W. A minimum information principle: Theory and practice. Reg. Sci. Urban Econ. 7, 137–168 (1977).
Article Google Scholar
Weidlich, W. & Haag, G. A dynamic phase transition model for spatial agglomeration processes. J. Reg. Sci. 27, 529–569 (1987).
Article CAS PubMed Google Scholar
Rogerson, P. & MacKinnon, R. D. Interregional migration models with source and interaction information. Environ. Plan. A 14, 445–454 (1982).
Article CAS PubMed Google Scholar
He, J. The regional concentration of china’s interprovincial migration flows, 1982–90. Popul. Environ. 24, 149–182 (2002).
Article Google Scholar
Plane, D. A. A systemic demographic efficiency analysis of us interstate population exchange, 1935–1980. Econ. Geogr. 60, 294–312 (1984).
Article CAS PubMed Google Scholar
Pandit, K. Differentiating between subsystems and typologies in the analysis of migration regions: A us example. Prof. Geogr. 46, 331–345 (1994).
Article Google Scholar
Raftery, A. E., Li, N., Ševčíková, H., Gerland, P. & Heilig, G. K. Bayesian probabilistic population projections for all countries. Proc. Natl. Acad. Sci. 109, 13915–13921 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Andris, C., Halverson, S. & Hardisty, F. Predicting migration system dynamics with conditional and posterior probabilities, 192–197 (IEEE, 2011).
Massey, D. S. & Zenteno, R. M. The dynamics of mass migration. Proc. Natl. Acad. Sci. 96, 5328–5335 (1999).
Article ADS CAS PubMed PubMed Central Google Scholar
Lee, K.-M., Kim, J. Y., Lee, S. & Goh, K.-I. Multiplex networks. Networks of networks: The last frontier of complexity 53–72 (2014).
DeWaard, J. et al. User beware: Concerning findings from the post 2011–2012 us internal revenue service migration data. Popul. Res. Policy Rev. 41(2), 437–48 (2021).
Article PubMed PubMed Central Google Scholar
Abel, G. J. & Cohen, J. E. Bilateral international migration flow estimates for 200 countries. Sci. Data 6, 1–13 (2019).
Article Google Scholar
Abel, G. J. Estimating global migration flow tables using place of birth data. Demogr. Res. 28, 505–546 (2013).
Article Google Scholar
Abel, G. J. Estimates of global bilateral migration flows by gender between 1960 and 20151. Int. Migr. Rev. 52, 809–852 (2018).
Article Google Scholar
Fu, X., Vervliet, N., De Lathauwer, L., Huang, K. & Gillis, N. Computing large-scale matrix and tensor decomposition with structured factors: A unified nonconvex optimization perspective. IEEE Signal Process. Mag. 37, 78–94 (2020).
Article Google Scholar
Acar, E., Dunlavy, D. M., Kolda, T. G. & Mørup, M. Scalable tensor factorizations for incomplete data. Chemom. Intell. Lab. Syst. 106, 41–56 (2011).
Article CAS Google Scholar
Vervliet, N., Debals, O., Sorber, L., Van Barel, M. & De Lathauwer, L. Tensorlab 3.0 (2016). URL https://www.tensorlab.net.
Bro, R., Papalexakis, E. E., Acar, E. & Sidiropoulos, N. D. Coclustering-a useful tool for chemometrics. J. Chemom. 26, 256–263 (2012).
Article CAS Google Scholar
Gujral, E., Pasricha, R. & Papalexakis, E. Beyond rank-1: Discovering rich community structure in multi-aspect graphs, 452–462 (2020).
Nguyen, H. & Garimella, K. Understanding international migration using tensor factorization, 829–830 (2017).

Download references

Funding

Zack W. Almquist, Tri Duc Nguyen, Mikael Sorensen, Xiao Fu, and Nicholas D. Sidiropoulos received partial support for this research through the ARO Award #W911NF-19-1-0407. Almquist also received partial support for this research from an NSF CAREER AWARD (# SES-214296), UW Population Health Initiative Tier 2 and 3 Grant, and a Eunice Kennedy Shriver National Institute of Child Health and Human Development research infrastructure grant, P2C HD042828, to the Center for Studies in Demography & Ecology at the University of Washington.

Author information

Authors and Affiliations

Departments of Sociology and Statistics, University of Washington, Seattle, WA, 98195, USA
Zack W. Almquist
Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, 97331, USA
Tri Duc Nguyen & Xiao Fu
Electrical and Computer Engineering, University of Virginia, Charlottesville, VA, 22904, USA
Mikael Sorensen & Nicholas D. Sidiropoulos

Authors

Zack W. Almquist
View author publications
Search author on:PubMed Google Scholar
Tri Duc Nguyen
View author publications
Search author on:PubMed Google Scholar
Mikael Sorensen
View author publications
Search author on:PubMed Google Scholar
Xiao Fu
View author publications
Search author on:PubMed Google Scholar
Nicholas D. Sidiropoulos
View author publications
Search author on:PubMed Google Scholar

Contributions

Z.W.A. designed the experiment, contributed to the analysis, wrote the manuscript, led the editing and revising process, and served as Co-Principal Investigator (Co-PI) for the funding. T.D.N. led the analysis and data cleaning. X.F. contributed to the experiment design and data analysis, assisted with the writing and editing process, and served as Co-PI for the funding. M.S. assisted with the analysis and data management and contributed to editing the manuscript. N.S. contributed to the experiment design, participated in manuscript writing, led the editing and revising process, and served as the Principal Investigator (PI) for the funding.

Corresponding author

Correspondence to Zack W. Almquist.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Almquist, Z.W., Nguyen, T.D., Sorensen, M. et al. Uncovering migration systems through spatio-temporal tensor co-clustering. Sci Rep 14, 26861 (2024). https://doi.org/10.1038/s41598-024-78112-z

Download citation

Received: 04 September 2024
Accepted: 28 October 2024
Published: 06 November 2024
Version of record: 06 November 2024
DOI: https://doi.org/10.1038/s41598-024-78112-z

Keywords

This article is cited by

Unveiling the social fabric through a temporal, nation-scale social network and its characteristics
- Jolien Cremers
- Benjamin Kohler
- Andreas Bjerre-Nielsen
Scientific Reports (2025)