Introduction

Most phytoplankton species near the ocean’s surface have a doubling time on the order of a day1. This rapid doubling time means phytoplankton quickly respond to changes in external forcings2. Coupled with the dynamic physical nature of the marine environment and grazing pressures with resulting loss rates similar to growth rates for phytoplankton3,4, the ocean has considerable spatial heterogeneity, or patchiness, at all scales5,6,7,8. This spatial heterogeneity supports increased diversity by creating different environmental conditions within close proximity, ecosystem stability through diversity and maintenance of seed populations, and efficient transfer of carbon up trophic levels9. The link between patchiness and the efficient transfer of carbon in the marine ecosystem is largely due to dense aggregations that make it energy efficient for zooplankton to graze, and patches of zooplankton make it feasible for planktivorous fish to meet their own energy requirements10,11,12. This spatial variability of phytoplankton and zooplankton, and the generally increasing patchiness as one moves up the trophic chain, has important implications for food availability and productivity of the entire ocean food web13,14,15.

The spatial patterns of the plankton ecosystem and their drivers have long been investigated. An early outline by Hutchinson discusses nutrient and temperature gradients, stochastic events such as storms, intra-species signaling, competition, and predation among processes that likely control these plankton patterns16. An updated synthesis by Levin described pattern and scale as the central problem in ecology, arguing for cross-scale investigations to understand the mechanisms and consequences of marine patchiness5. The scale-dependent spatial heterogeneity exhibited by both biology and physics has been a central focus of studies into pattern and scale in the ocean6, with much of this work attempting to divide spatial scales into domains of scale-independence and then investigating the dominant processes influencing biology in those domains17,18. Power spectral analysis, a Fourier decomposition where variance is partitioned into the contribution of bands of specific frequencies or wavenumbers (see19 for an early review on the implementation of the technique in ecology), is often used to describe phytoplankton patchiness and quantify this scale-dependence. The scaling behavior of a geophysical variable is typically represented as a log-log plot of variance against length scale. If that relationship is approximately linear (in log space) this represents a power law relation between variance and spatial scale where the exponent of that power law relationship is referred to as ‘the spectral slope’20. A flatter spectral slope indicates less of a decrease of variance with scale, i.e. more variability at smaller scales relative to larger scales, when compared to a steeper slope. Spectral slope has often been used synonymously with patchiness, but can also be interpreted as the cascade of variance across scales. While there are a range of issues with this analysis, such as discarding of phase information9,21, it is commonly used as an indicator of the distribution of spatial variance.

There is over a century of inquiry into the spatial heterogeneity of ocean physics. Energy in the ocean generally cascades down spatial scales. Mesoscale eddies spin off of large-scale circulation features. Energy then transfers from these major eddies to smaller turbulent whorls and finally to a small enough scale that energy is dissipated by viscosity into heat22,23,24. This turbulent cascade generally has a spectral slope of -5/3 in the inertial subrange (3D and a spatial range smaller than the major energy containing eddies, but larger than viscous eddies). While Kolomogrov initially proposed this cascade of variance for fully developed 3D turbulence in the ocean’s inertial subrange, O(10-0.001) meters depending on flow characteristics25, the -5/3 spectral slope has been shown to hold generally true for the inertial subrange of 2D ocean turbulence O(100-1) km as well26. Temperature generally follows the same cascade as turbulent kinetic energy. It has a source of variance at large scales, a dissipation of variance at small scales, and it is typically being mixed by similar turbulent processes27, though could have a variance source at smaller scales such as spatially heterogenous upwelling of cooler water.

The patchiness of phytoplankton was initially thought to be linked to turbulent stirring simply by physical processes28, with a general conclusion of consistent scaling behavior between physical and biological features18. This conclusion was drawn largely from its spectral slope that appeared similar to the -5/3 of 2D turbulence28,29. Modeling studies indicated that the spectral slope of phytoplankton was also influenced by the phytoplankton reproductive rate, diagnosed by a scaling break at larger spatial scales where the spectral slope became flatter than that of a passive tracer30,31. This led to a substantial amount of work, with many conflicting results, investigating the time and length scales where the contribution of biologically generated spatial variability (growth rates, grazing, etc) to the total planktonic spatial variability is significant vs where physics dominates6,7,9,25,32.

Abrahams7 suggested, and others have come to a similar conclusion via modeling33, that the spatial patterns in both plankton and physical tracers are a product of the timescales of their response to physically driven changes. For example, the equilibration of temperature to a heat flux is considerably slower than the response of phytoplankton to a nutrient flux. A small burst of cold nutrient-rich water thus leads to increased spatial heterogeneity in chlorophyll-a (chl-a) vs temperature, but as nutrients are exhausted and the temperature equilibrates over time the patchiness in chl-a and temperature would return to covarying32. Modeling work shows a steeper spectral slope in temperature compared to chl-a, attributing some of the slope variability to low frequency physical variations and the correlation of physical and biological variables to “the dominant role of physics in the spatial variability of plankton distribution“34.

Despite these theoretical advances, interpretations from observational studies remain inconsistent. Some work concludes that at <1 km scales biological processes are important for patch dynamics10,35, while other studies assert that at this scale spatial distributions of phytoplankton are dominated by turbulent diffusion. For example, this latter conclusion is advanced by an investigation that found a strong correlation between chl-a and temperature patchiness at scales below 5 km yet no correlation at scales from 5 km to 80km19. Instances spring up where phytoplankton are patchier than zooplankton36, conflicting findings that the patchiness of phytoplankton is between that of zooplankton and physical factors7,15,28. Work assessing the spectral slope of chl-a in the same region for multiple years found variable slopes from month to month25,37. After decades of work, it is not clear at which spatiotemporal scales biological processes are important, or even which processes are dominant in controlling patchiness in the ocean.

While biophysical ocean patchiness has been relatively well documented on local scales, its characteristics across the globe have not yet been analyzed. Therefore, there is little data to test new theories, constrain impacts on productivity and diversity, and simply describe the patchiness itself. Importantly, this limits our ability to parameterize global biogeochemical models that are computationally limited from explicitly modeling these scales in our global models of the carbon cycle.

Here we address this gap, and explore the global characteristics of biophysical patchiness using a unique dataset of in situ surface plankton-related optical properties from the S/V Tara program. All data comes from the Tara Pacific38 and Tara Microbiome missions which represent five years of observations spanning the Atlantic, Pacific, and Southern Oceans. This data was collected and processed consistently. After conservative quality control this dataset amounts to N = 661,552 (dt = 1 minute, dx ~ 200 m) measurements used in this analysis (Fig. 1). As we aim to discern between patterns of physical and biological patchiness, and specifically to test if patchiness in biology is explained by patchiness in physics, the measurements are divided into two corresponding groups of variables, with the physical domain being represented by temperature, salinity and density, and the biological domain by chl-a, particulate attenuation of light at 443 nm (cp(443)), and an optical proxy for mean particle size (γ)39. The latter two, because they represent other living (e.g. bacteria and micro-grazers) and non-living particles in addition to phytoplankton, are referred to as biogeochemical properties.

Fig. 1: The spatial extent and statistical distribution of the data used in this study.
figure 1

Maps show the geographic distribution of chlorophyll-a (chla, a) and sea surface temperature (temp, b). The data includes 661,552 minute binned measurements. Insets show the histograms of both variables. The four numbers in panel (a) correspond to the example data shown in Fig. 2.

Results and discussion

While the Fourier-based spectral slope is a useful and commonly used tool for characterization of patchiness, it is not suitable for analyzing unevenly sampled datasets. To overcome this limitation here we quantify patchiness via the dependence of the variance, V, on length scale, \(L\) following33. As found in previous work, V varies mostly linearly with \(L\) in log space (Fig. 2), indicating a power law relationship of the form: (1) V=LΓ, where Γ describes the slope (exponent) of the relationship in log space. In this work we focus on Γ, the “variance slope”. As with a power spectral density slope, a transect with a lower Γ would have relatively more variance retained at smaller length scales relative to larger scales compared to a transect with higher Γ, i.e. a lower Γ is patchier. This metric can be thought of qualitatively as a ratio of large scale to small-scale variance (Figure S1), with Γ increasing as the large-scale variance increasingly dominates the small. This parameter is similar and strongly correlated with the conventional Fourier-based spectral slope exponent (Figures S2 and S3), but our approach (from33) has the advantage of being applicable to data with gaps, as long as they are smaller than 20% of a transect. This parameter is non-parametric and the value is the same for a transect before and after a log-transformation, allowing robust comparison between values like chlorophyll-a, which is often log-distributed40, and values like temperature and salinity. Based on the observed relationship between Γ and the power spectral density slope (Figure S3) the classic -5/3 slope found in previous studies for the inertial subrange would correspond to a Γ value of 0.84.

Fig. 2: Chlorophyll-a (chla) and temperature (temp) from four different transects and the resulting cascades of variance.
figure 2

Examples of chla (green, solid) and temp (red, dashed) values along a leg (left panel in each pair) and the scale-dependent variance used to calculate the variance slope (Γ) derived from these legs (right panel in each pair). The variance slope can be seen in the legend for each panel on the right. The intercept indicated in each caption is a product of both the total amount of variance and the variance slope. The numbers of each panel (14) correspond to the locations in Fig. 1a.

Specifically in this work, variance was calculated on subsets of 500 samples over a set of 11 log-distributed windows from 3 to 500 samples, spanning ~0.6 km to 100 km. Both variance and window size were log-transformed, and we ran a least-squares fit to find the slope of this line, yielding the Γ of V=LΓ. In our analysis of global patterns of oceanic patchiness, we distinguish between Γ of the biological and biogeochemical variables chl-a, cp(443) and γ (Γchl-a, Γcp(443) and Γγ, respectively), and of the physical variables temperature, salinity and density (Γtemperature, Γsalinity and Γdensity, respectively).

Comparison between the geographic distributions of Γchla and Γtemperature reveals distinct differences between global patterns of physical and biological patchiness (Fig. 3). Notably, the geographic distributions of Γtemperature are more random with lower autocorrelation compared to Γchla at all spatial lags, while Γchla is consistently lower in oligotrophic regions, particularly the subtropical gyres, and higher in physically energetic regions such as western boundary currents, eastern boundary upwelling zones, and equatorial upwelling zones. In agreement with most previous observations, Γchla is, on average, lower than Γtemperature, indicating chl-a is patchier than temperature, though there are exceptions, particularly in some coastal regions (Figure S4). Γtemperature is fairly normally distributed around 1, while Γchla is slightly right-skewed with a mean around 0.5 (see inserts in Fig. 3). The -5/3 slope found in previous studies for the inertial subrange corresponds to a Γ value of 0.84 which is near, but slightly lower than the mean of our global Γtemperature calculations, 0.97.

Fig. 3: Geographic distributions of variance slope (Γ).
figure 3

Maps show the Γ of chlorophyll-a (Γchla, a) and temperature (Γtemperature, b). Insets in both panels show the histograms for these variables.

Interestingly, running the same analysis on satellite-based chl-a and temperature reveals contradicting results. Although satellite imagery has been successfully used to track individual chl-a patches41,42, and to quantify patchiness at local scales28, in the satellite-based analysis we find Γtemperature is lower than Γchla (i.e. the opposite of our in situ results), and the geographic patterns for both chl-a and temperature are different (Figures S5 and S6, respectively). We attribute this discrepancy primarily to issues of measurement sensitivity and atmospheric correction of ocean color remote sensing (from which chl-a is derived) which is applied as a pixel level correction, artificially injecting variance at small scales. This effect is likely to have the strongest impact in oligotrophic regions, where chlorophyll-a levels are particularly low, resulting in minimal absorption and reduced particulate scattering, and thus a low signal to noise ratio in the satellite retrieval. This could cause issues in both spatial and temporal analyzes43. This corroborates results from previous work that used a semivariogram approach to analyze variance in global satellite chl-a patterns from SeaWIFS44 which showed that unresolved variance was negatively correlated with mean chl-a and the authors suggested the high fraction of unresolved variance in the subtropical gyres might stem from atmospheric correction, though they did not differentiate between instrument noise and submesoscale variability. A more recent, but similar study compared the unresolved variance between MODIS-Aqua and SeaWIFS and found it was reduced, but not eliminated with MODIS45.

The global patterns we observe point to a difference between the governing processes underlying the formation of physical and biological ocean patchiness. This is emphasized when plotting the correlation between the different types of patchiness (Fig. 4), which we show both for all individual legs and averaged by Longhurst provinces, a partition of the global ocean into biogeochemical regions46 (shown in Fig. 5). The Γ of biological variables are correlated globally, even between chl-a and γ, the mean particle size (Fig. 4a, b). Similarly, Γ of physical variables are well correlated with each other (Fig. 4c, d). Yet, there is little to no correlation between the Γ of biological variables and physical variables (Fig. 4e, f). This suggests that plankton patchiness is an emergent property due to biological processes that modulate the spatial patterns across scales sufficiently such that it is not correlated to the concurrent physical patchiness. Emergent is defined here as a property that emerges from the ensemble of processes in the ecosystem and environment.

Fig. 4: Variance slope (Γ) comparisons between biogeochemical, physical, and biogeochemical vs physical variables.
figure 4

Γchla correlates with the Γ of other biogeochemical parameters such as particulate attenuation (Γcp(443), a) and mean particle size (Γγ, b), and Γtemperature correlates with the Γ of other physical parameters such as salinity (Γsalinity, c) and (Γdensity, d), yet the Γ of biogeochemical vs physical variables don’t have a notable correlation (e, f). Larger colored markers correspond with the Longhurst provinces from Fig. 5. Contours partition the data’s probability mass function into five equal levels. N.b. correlations are shown for all data, not the Longhurst province means, and all plots have a p-value < 0.001.

Fig. 5: Longhurst provinces with S/V Tara legs overlaid in black.
figure 5

Longhurst provinces that contain N = > 25 legs are shown geographically along with the S/V Tara legs within each province.

If we decrease the top-end length scale that is used for the calculation of Γ, for example from 100 km to 5 km, Γchla and Γtemperature are still uncorrelated (R2 = 0.04 for 0.6 km to 5 km vs R2 = 0.02 for 0.6 km to 100 km). This is in contrast with older work which showed that chl-a and temperature patchiness were correlated below 5 km yet not at scales from 5 km to 80 km19.

Similarly to the differences between geographic distributions patterns (Figs. 1 and 3) the correlations between Γ of different physical variables differ substantially from the correlations between absolute values (Figure S7). We show chl-a and γ have minimal correlation in absolute value yet a correlation in Γ (R2 of 0.03 and 0.42 respectively), chl-a and temperature have a strong correlation in absolute value yet minimal in Γ (R2 of 0.36 and 0.02 respectively), and temperature and salinity have minimal correlation in absolute value yet a correlation in Γ (R2 of 0.04 and 0.25 respectively).

Another fundamental difference between patterns of physical and biological ocean patchiness emerges when looking at the data averaged by biogeochemical provinces and plotting the relationship between Γ and the absolute value of each variable (Fig. 6). The biological variables that are concentration dependent (i.e. chl-a and cp(443)) have a positive relationship between their Γ and their absolute value, such that high biomass provinces are less patchy (i.e. associated with higher Γ) than low biomass provinces, (Fig. 6a–c). In contrast, there is no relationship between Γ and the absolute value of the physical variables (Fig. 6d, e). In other words, biomass and biological patchiness are linked (inversely) while heat and salt content does not correlate with their patchiness.

Fig. 6: Absolute value vs variance slope (Γ) for the variables under analysis.
figure 6

The top rows (ac) represent the biogeochemical variables: chlorophyll-a (chla), particulate attenuation at 443 nm (cp(443)), and mean particle size (γ) and the and bottom rows (df) represent the physical variables: temperature, density (σ), and salinity. The average of each province is shown in the larger markers and the colors correspond to the provinces on the map in Fig. 5. Contours partition the data’s probability mass function into five equal levels. N.b. correlations are shown for all data, not the Longhurst province means, and all plots have a p-value < 0.001.

The linkage between biological patchiness and biological productivity is emphasized when taking into account nutrient availability and oxygen saturation averaged by Longhurst Province (Fig. 7). When mapped onto the absolute value of chl-a and Γchla, a range of nutrients have a clear increasing relationship with both (Fig. 7), with the highest nutrient values also having high chl-a and Γchla. Oxygen saturation has a slight decreasing relationship with more considerable outliers.

Fig. 7: Chlorophyll-a (chla) concentration vs variance slope of chla (Γchla) colored by biogeochemical parameters at the province level.
figure 7

These are average nutrient concentration (a, b, c) and oxygen saturation (d) in each Longhurst province. Nutrient and oxygen saturation data is from the World Ocean Atlas 2018.

The nature of the small-scale variability in biological signals, and the way they differ from the physical signals, is emphasized when zooming in on segments of the transects (Fig. 8). Careful examination reveals that our underway observations of biological variables are characterized by high frequency changes that do not appear in the physical signals. While these high frequency changes may appear to be associated with instrumental noise, the individual measurements are robust, and the changes reflect small scale (< 1 km) biological variability patterns (Fig. 8). This robustness can be seen both in the covariation between different sensors (Fig. 8a) and the raw but consistent absorption and attenuation spectra (Fig. 8b, c). This small-scale variability lowers Γ and can make the variance spectra nearly flat in regions without low frequency variation, such as the Sargasso Sea and Subtropical South Pacific.

Fig. 8: High frequency environmental variability is evident in the raw data.
figure 8

Here we show an example of variables along a single leg (a), showing data “spikes” that may initially be considered instrumental noise, but when inspecting the individual spectra from 408 nm to 730 nm (b and c), which are used to derive the various proxies, they are consistent with expectations for particulate absorption (ap) and attenuation (cp) data. The two bottom rows show a 30 minute subset of ap spectra (b) and cp spectra (c) representing the period between the vertical black lines in panel (a). Each spectra is the average of one minute of sampling, is colored by time (from brown to blue), and spans wavelengths from 410 nm to 750 nm. This transect is from the Coral Sea centered on 20.23 S, 152.90 E.

Our work does not tie the variance slope to any particular process. Possibly Γ represents the integrated impact of physical and biogeochemical characteristics within a region. The lower Γchla in oligotrophic regions may be attributed to a sustained injection of variance from ecosystem interactions (e.g. growth, grazing). To investigate this line of thinking we quantified the patchiness of chromophoric dissolved organic matter (CDOM), which can be thought of as a more passive tracer than chl-a and a slow integrator of biological growth products. ΓCDOM appears to be intermediate with a small amount of variance explained by Γchla and Γtemperature (R2 of 0.17 and 0.06 respectively) and a statistical distribution closer to that of Γtemperature (Figure S8).

Recent work investigating zooplankton patchiness found coherent small-scale patches across many taxa dominated by length scales of 10-30 m43 and even highly idealized models have been shown to generate patchiness via grazing47 which could help explain aspects of our results. While previous work using this same metric suggested that the response time of phytoplankton (i.e. growth rate) is faster than the response time of temperature (i.e. equilibration time) as the reason chl-a is patchier33, this doesn’t agree with our results, where the regions with higher assumed growth rates are less patchy (e.g. eastern upwelling zones). This may suggest that certain elements of the ecosystem are responding faster in these highly patchy oligotrophic regions, possibly grazers or growth following fine-scale vertical nutrient fluxes. Stratification may play a role with a more stratified water column leading to more intermittent linkages between the surface and the nutrient-rich deep ocean, resulting in turn in a patchier expression in the highly stratified subtropical regions. While the drivers are not clear from our work, previous studies have noted that an increasingly patchy distribution requires finer scale sampling or grid spacing33, and if these processes are important, our work indicates fine-scale grid spacing is in fact still important across broad swaths of the ocean generally considered “homogenous”.

While the focus of this work is on scaling relationships, the fractal-like patchiness of phytoplankton is not infinite. At a fundamental level, patches are formed by individual phytoplankton living and dying48. Currently it is unclear how to connect the scales being investigated here with the individual organisms at the heart of the matter; Franks25 notes that it is not likely we can diagnose the dynamics underlying the observations from spectral slopes alone. Towards mechanistic understanding of the processes underlying the spatial patterns we expect Lagrangian approaches will be key, particularly to parse out spatial and temporal dynamics. Future work should examine joint spatial patterning across trophic levels, combining the methods here, rapid methods for a wide size range of plankton49, and methods for quantifying zooplankton patchiness10. Additionally, it may be that improved atmosphere correction of satellite products could enable a spatial analyzes across the global ocean. Higher signal to noise regions, such as coastal waters or higher latitudes during the spring bloom, may be more amenable to satellite analyzes of spatial patterns under existing atmospheric correction schemes.

We identified two fundamental differences between patterns of physical and biological patchiness: globally plankton patchiness at the scales analyzed here (~0.6 km–100 km) is spatially organized while physical patchiness is less coherent and organized differently, and patchiness of concentration-dependent biological variables is correlated with the absolute values of these variables, while no such correlation is found for physical variables. The statistical distributions of Γchla and Γtemperature agree with previous work, where Γchla has a generally lower value, i.e. a patchier spatial pattern, compared to Γtemperature. We find global patterns of biological patchiness to coincide with biogeochemical provinces and with nutrients levels. While our analysis shows the patchiness of biological variables are intercorrelated, and patchiness of physical variables are intercorrelated, contrary to common view, we find no correlation between physical and biological patchiness. This suggests that biological processes modulate the physically-driven advective baseline sufficiently to manifest as a different organization of patchiness. We emphasize here that the absolute value of variance itself is correlated at all scales between physical and biological variables (Figure S9). Moreover, while it is well known that there is a relationship between temperature and chl-a, largely attributed to nutrient fluxes50, our findings show this does not extend to variance slope. Our results contradict results from model studies (e.g34.) and from satellite analyzes (e.g51.), including our own (Figure S5 and S6). This discrepancy holds even when comparing to other work on the same range of scales as those examined here, which had concluded that biological patchiness is controlled by mesoscale mixing28. This suggests that these two important tools for understanding ocean ecosystems (models and satellite observations), fail to capture fundamental characteristics of the marine ecosystem. Our results may also help explain the disagreement shown across decades of patchiness research including many in situ studies. It may be that there is no generalizable patchiness relationship between physics and biology in the ocean and no consistent spatiotemporal scale where biological or physical processes dominate the spatial patterns of phytoplankton. Accordingly, basing general theories on the patchiness of a given tracer type from a specific site and over short time scales could be misguided. Instead, the ocean appears to consist of myriad physical and biogeochemical processes manifested as diverse spatial patterns. Thus, while no consistent relationships emerge these observable spatial patterns may provide a window into the processes creating them.

The results presented here support the use of low dimensional spatial encodings such as Γ to study these upper-ocean processes. We speculate these spatial metrics may be sensitive to biogeochemical parameters not represented by the absolute value of chl-a. Importantly, we cannot resolve marine ecosystems at the mesoscale and below in basin-scale biogeochemical models, thus we must increase our ability to observe, invert, and understand what our models are missing52. Spatial patchiness patterns may help drive our models closer to reality and serve as an underutilized source of insight into the underlying processes generating the observed patchiness.

Methods

All in situ measurements are from the S/V Tara underway sampling system which pulls seawater from 2 m depth. Temperature and salinity data is from a Seabird Scientific thermosalinograph (SBE 38 and SBE 45) and all optical data is derived from particulate absorption and attenuation measured with a Seabird Scientific ACs.

We quantify patchiness using variance slope, Γ, following33 which is similar to spectral analysis, but able to ingest vectors (i.e. transects) with small gaps within them. Γ is calculated on groups of 500 samples using a set of 11 log-distributed windows from 3 to 500. The median distance traveled between individual samples is 0.21 km (Figure S10) thus patchiness is calculated for scales from ~0.6 km to 100 km. Variance and window size were log-transformed, and we ran a least-squares fit to find the slope of this line, yielding the Γ of V=LΓ. It is worth noting a few characteristics of Γ to gain some intuition into the metric. The variance slope of a randomly distributed variable is centered around 0. The calculation is not sensitive to whether a variable is log-normally distributed (such as chl-a). For example, Γchla is identical whether raw chl-a values are input or log-transformed chl-a values. The S/V Tara is a sailing vessel and thus does not always travel in a straight line or with the same speed so here we have used time (minute bins of data) instead of distance to calculate the window, but filtered out legs that have a maximum distance traveled under 30 km or over 150 km. We have required at least 80% of the transect to contain data and filtered out transects where the maximum time elapsed is greater than 16 hours leading to a total of 820 transects used in this analysis.

Hyperspectral absorption and attenuation (400 to 735 nm at ~4 nm spectral resolution, AC-s, Seabird Sci.) were measured continuously. A 0.2 μm filter cartridge was connected to the system and we automatically redirected the flow to measure the properties of filtered seawater for 10 minutes every hour. Total (“normal”) seawater was flowing the rest of the time. Absorption and attenuation measurements recorded during the filtered periods were interpolated across the transect and subtracted from the total seawater measurements to obtain an estimate of the particulate absorption (ap), attenuation (cp) spectra. During the Tara Microbiome transect, the filtered periods were interpolated using the variation in CDOM fluorescence (fCDOM) also measured continuously with a SeaPoint ultraviolet fluorometer (SUVF). fCDOM data was recorded with a Seabird Scientific WSCD CDOM fluorometer during the Tara Pacific transect. The signal to noise ratio of the WSCD is lower than the SUVF’s which introduced noise in the fCDOM interpolation and the product derivation. Therefore, the filtered periods were interpolated linearly across the Tara Pacific transect. This approach permits retrieval of particulate optical properties independently from the instrument drift and biofouling53. For each minute of total seawater measurements (sampled at 4 Hz) the signal between the 2.5th and 97.5th percentiles were averaged, and their standard deviation was used to quantify uncertainty. Dropping the 2.5th to 97.5th percentiles filters out noisy spikes from bubbles which can be a major problem in optical measurements. All spectra were manually checked and quality controlled for obviously bad measurements (e.g. bubbles, bad filtered seawater measurements). These inherent optical properties were used as proxies for a range of particulate properties, primarily chl-a line height, a chl-a estimate derived from the absorption peak at 676 nm54,55, and γ, a proxy for mean particle size39.

Underway data was collected using Inlinino56 an open-source logging and visualization program, and processed using InlineAnalysis (https://github.com/OceanOptics/InLineAnalysis) following best practices57.

Longhurst provinces46 (downloaded from https://www.marineregions.org/sources.php#longhurst) were used as approximate delineations of the global ocean into biogeochemical regions. Nutrient data was from the World Ocean Atlas 2018 (downloaded from https://www.ncei.noaa.gov/data/oceans/woa/WOA18).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.