CIGaRS I: combined simulation-based inference from type Ia supernovae and host photometry

Karchev, Konstantin; Trotta, Roberto; Jiménez, Raúl

doi:10.1038/s41550-026-02842-5

Download PDF

Article
Open access
Published: 06 May 2026

CIGaRS I: combined simulation-based inference from type Ia supernovae and host photometry

Nature Astronomy (2026) Cite this article

2922 Accesses
123 Altmetric
Metrics details

Subjects

Abstract

Using type Ia supernovae as cosmological probes requires empirical corrections that are correlated with their host environment. Here we present a unified Bayesian hierarchical model designed to infer, from purely photometric observations, the intrinsic dependence of the brightness of type Ia supernovae on progenitor properties (metallicity and age), the delay-time distribution that governs their rate as a function of age and cosmology, as well as the redshifts of all hosts. The model incorporates physics-based prescriptions for star formation and chemical evolution from Prospector-β, dust extinction of both galaxy and supernova light, and observational selection effects. We show with simulations that intrinsic dependences on metallicity and age have distinct observational signatures, with metallicity mimicking the well-known step of magnitudes of type Ia supernovae across a host stellar mass of ~10¹⁰ M_⊙. We then demonstrate neural-simulation-based inference of all model parameters from mock observations of ~16,000 type Ia supernovae and their hosts up to redshift 0.9. Our joint physics-based approach delivers robust and precise photometric redshifts (~0.01 median scatter) and improves cosmological constraints by a factor of ~4 over analyses of the small fraction of objects with spectroscopic follow-up. This approach unlocks the full power of photometric data and paves the way for an end-to-end simulation-based analysis pipeline in the LSST era.

A gravitationally lensed supernova with an observable two-decade time delay

Article 13 September 2021

Binary progenitor systems for Type Ic supernovae

Article Open access 03 September 2024

A spectroscopic quadruple as a possible progenitor of sub-Chandrasekhar type Ia supernovae

Article 12 May 2022

Main

It has long been known that the brightnesses, stretches and colours of type Ia supernovae are correlated with the global properties of their host galaxies, for example morphology, stellar mass, star-formation rate (SFR), metallicity and stellar age^{1,2,3,4,5,6,7,8,9,10,11,12}. Similar dependences on the local environment of the supernovae have also been found^{13,14,15,16,17,18,19,20,21,22,23,24,25,26,27}. In current cosmological analyses (for example, refs. ^28,29,30), which use type Ia supernovae as standardizable candles, it is common practice to (attempt to) correct for all these effects en masse via an ad hoc ‘mass step’: a magnitude offset between type Ia supernovae hosted in galaxies with stellar masses above and below a given threshold, usually set to the sample median or fixed around 10¹⁰ M_⊙. The rationale is that stellar mass is a simple-to-estimate proxy for nearly all host properties. Unveiling the physical processes underlying host–type Ia supernova connections would, thus, not only enhance our understanding of type Ia supernovae and their formation but also deliver more accurate and precise constraints on dark energy by improving the standardization procedure.

The environment around a supernova also has extrinsic effects on its light, which is dimmed and reddened by dust along the line of sight, both in the immediate surroundings of the progenitor system and in the interstellar space within the host. It can also be scattered into the line of sight, which noticeably affects finite-resolution photometry³¹. Accounting for dust requires careful hierarchical modelling³², as (1) extinction has a similar effect (redder is dimmer) to intrinsic colour-related standardization (bluer is brighter) and (2) the amount and properties of the dust in a galaxy are related to its SFR and stellar mass (for example, refs. ^33,34) and, hence, to the ages and metallicities of the progenitors of the type Ia supernovae. Different studies have come to conflicting conclusions regarding this interplay. Brout and Scolnic³⁵ attributed the mass step entirely to differences in the dust in low- and high-mass hosts rather than any intrinsic, physical or causal effect, whereas the analysis of Thorp and Mandel³⁶ prefers a single dust law for all hosts and a residual mass step of ~0.05 mag, with more massive galaxies hosting intrinsically brighter type Ia supernovae. By contrast, a unified Bayesian model comparison using the same data and models³⁷ disfavoured a non-zero mass step (posterior odds of 2:1), whereas Grayling et al.³⁸ and Grayling and Popovic³⁹ found evidence in more recent datasets both for intrinsic differences in the brightness and stretch distributions of type Ia supernovae as well as different dust properties (wavelength dependence and optical depths) in low- and high-mass hosts. Moreover, Wiseman et al.^40,41 argued in favour of a split in host age rather than mass, considering that the two are connected by galactic evolution through the process known as ‘downsizing’ (see, for example, ref. ⁴²).

Resolving these degeneracies and arriving at a definite conclusion regarding host–type Ia supernova connections require tackling all interrelated effects simultaneously, including the apparent dimming due to cosmological distance and selection biases. Although Bayesian hierarchical modelling provides the principled framework for doing this, traditional Markov chain Monte Carlo methods require a joint sampling of the vast number of latent parameters (tens per supernova) and explicit computation—at every step—of the selection probability, which is intractable in realistic scenarios. To date, analyses either resort to ad hoc approximations^29,43,44 or are split in two separate steps (see, for example, refs. ^36,45): first, the distance moduli of supernovae (that is intrinsic brightnesses) are estimated and corrected for selection effects using a fiducial, and possibly misspecified, model⁴⁶; then, they are correlated with galaxy properties obtained in a separate analysis (rarely considering the associated uncertainties⁴⁷) or used in a cosmological fit.

A two-step approach cannot account for many important statistical effects. For instance, the ages of progenitors of type Ia supernova are not representative of the stellar populations of their hosts, and so regressing using the latter may be misleading. Moreover, standardization implicitly gives higher weight to more massive hosts, where a larger number of supernovae occur, an instance of Eddington bias⁴⁸^,⁴⁹. Likewise, a preference for detecting (apparently) brighter objects biases the selected sample towards intrinsically brighter supernovae to a greater extent in dustier hosts, which are also more massive. Last, an apparent correlation—non-existent intrinsically—between supernova brightnesses and the properties of the hosts may be introduced by the use of photometric redshifts in the standardization process due to the mass–age–redshift degeneracy. Similarly, an apparent redshift dependence of supernova properties^50,51 might arise due to the evolution of their hosts, for example their dust content^52,53, metallicity or age⁵⁴.

In this paper, we present a framework for combined inference and galaxy-related standardization (CIGaRS) of type Ia supernovae and their hosts, which addresses all the above conceptual and methodological issues and can unequivocally disentangle intrinsic and extrinsic effects through physics-based forward modelling. We formulate our model in the context of simulation-based inference (SBI) (for overviews, see refs. ^55,56), a modern suite of Bayesian inference techniques that leverage the flexibility of neural networks (NNs) to obtain posterior distributions, given only training examples from a forward simulator. SBI is seeing rapid adoption across disciplines and has previously been applied in studies of type Ia supernovae to analyse dust distributions from collections of light curves (LCs)^37,57 and to perform cosmological inference from future-sized datasets (with up to 10⁵ type Ia supernovae) in the presence of photometric-like redshift uncertainties⁵⁸ and arbitrary selection effects⁴⁹.

Our simulator (depicted graphically in Fig. 1 and fully elaborated in ‘Unified forward modelling of galaxies and type Ia supernovae’) adopts the prescription for galaxy evolution from Prospector-β^59,60 and the Simple-BayeSN hierarchical model for LC summaries of type Ia supernovae³². It then applies dust extinction, cosmological distance, realistic measurement noise and sample selection to both. We model and infer the intrinsic correlations between (true) host metallicity, (true) progenitor age and (true intrinsic) supernova absolute magnitude, which are latent variables in the hierarchical model. In addition, we allow for a residual mass step and infer its size (in magnitudes) and location (host mass). Last, we include a self-consistent formulation of the occurrence of type Ia supernovae within galaxies through the delay-time distribution (DTD): the rate at which type Ia supernovae arise from the stellar population of their hosts based on their ages, which can be calculated from first principles for plausible progenitor scenarios for type Ia supernovae⁶¹. Our framework seamlessly integrates inference of the DTD under a suitable parameterization, thus offering a tool that can shed light on the much-debated formation mechanism of type Ia supernovae^62,63.

**Fig. 1: Unified Bayesian hierarchical modelling of type Ia supernovae and host galaxies.**

While insights into the DTD and the host–type Ia supernova connections are an important output of our analysis, we envisage its primary future application to be cosmological inference from the Legacy Survey of Space and Time (LSST), soon to commence at the Vera Rubin Observatory. Owing to the large spatial coverage and depth (18,000 deg² reaching redshift ~1) of its primary wide–fast–deep survey component, only ~1% of the ~10⁶ supernova candidates expected over 10 years will have spectroscopic typing and redshift, and even with dedicated campaigns like 4MOST/TiDES⁶⁴, only up to 10% of the host galaxies will be followed up.

In view of this predominantly photometric scenario, we do not consider any spectroscopic observables or explicit redshift estimates, and we provide the inference network only with broadband photometry of the hosts and LC summaries for the type Ia supernovae (we do assume a perfect transient classification, noting that non-Ia contamination can be straightforwardly included in our framework). Owing to the physical modelling of galaxy evolution and the pooling of strength introduced by the joint analysis, our framework delivers state-of-the-art photometric redshift estimates as a by-product as well as considerably more stringent cosmological constraints than using supernova data alone (even with spectroscopic redshifts). We explore more thoroughly the implications of these findings for supernova cosmology with future datasets in a dedicated companion paper, focusing here on host–type Ia supernova connections and the DTD.

Results

To examine the phenomenology and demonstrate inference with our unified model, we generate a mock dataset D₀, with global parameters as listed in Extended Data Table 1 and object-specific (latent) quantities sampled from their hierarchical priors. We scale the counts to be representative of the current flagship Dark Energy Survey (DES), which contains 1,635 photometrically classified supernovae⁶⁵. In our particular D₀, the number of selected objects N_sel,0 = 1,578.

Predictions from the unified forward model

Physical host–type Ia supernova correlations and induced stellar-mass dependence

Our forward model includes an explicit dependence of the magnitude offsets of type Ia supernovae (δM^s) on the true progenitor metallicity (${Z}_{* }^{s}$) and age (${t}_{* }^{s}$): see equation (10). We plot those (separately) in Fig. 2 (with correlation parameters ${\gamma }_{{Z}_{* }}$ and ${\gamma }_{{t}_{* }}$ chosen consistently with observations^8,12 and indicated in the respective panels) against the stellar mass of the hosts. Owing to well-known trends in galaxy evolution (more massive galaxies are, in general, older and more metal-rich), the brightness of type Ia supernovae appears to depend on stellar mass. These are incorporated in Prospector-β through the use—with appropriate scatter—of an empirical star-formation history (SFH)⁶⁶ and mass–metallicity relation⁶⁷, and they give rise to a crucial difference: whereas a metallicity correlation (${\gamma }_{{Z}_{* }}\ne 0$) induces a (smoothed) step across M_* ≈ 10¹⁰, a progenitor-age correlation (${\gamma }_{{t}_{* }}\ne 0$) manifests as an approximately linear trend with $\log \,{M}_{* }$. Moreover, the distributions of magnitude offsets at a given stellar mass, p(δM ∣ M_*), also differ: although the metallicity shows a Gaussian spread, progenitor ages are highly skewed towards younger populations due to the fast-decaying DTD (we adopt a power law with slope b = −1.34; ref. ⁶⁸).

**Fig. 2: Metallicity and age dependences as apparent ‘mass steps’.**

Owing to these differences, in principle, the two scenarios can be distinguished using stellar-mass estimates alone, which are easy to obtain: see our results in section ‘Object-specific parameters’. However, as we show next, this task is complicated by the presence of colour- and stretch-related variations and residual scatter (~0.3 mag in total), as well as uncertainties both in the dependent (δM^s) and independent variables (${M}_{* }^{s}$, ${Z}_{* }^{s}$ and ${t}_{* }^{s}$). Finally, note that splitting at the median mass—as customary—results in a diminished step (dashed lines) as both subsets are ‘pulled’ towards the peak of the M_* distribution (~10¹¹ M_⊙). This is an instance of Eddington bias^48,49.

Redshift distribution and Hubble diagram

A key outcome of our unified approach is the predicted redshift distribution of type Ia supernovae, which results from the combination of galaxy evolution (according to Prospector-β) and a DTD-based occurrence model. At low cosmological redshifts (up to z_c ≈ 0.3), it is qualitatively consistent with a power-law volumetric rate with exponent 1 to 1.5, as inferred by Dilday et al.⁶⁹ from Sloan Digital Sky Survey data and widely used in the literature (for example, in PLAsTiCC⁷⁰). Above z_c ≈ 0.4, the rate starts decreasing due to the decline in the galaxy population. Finally, at z_c ≳ 0.6, corresponding to a Universe of age ≲8 Gyr, the frequency of type Ia supernova occurrence within hosts decreases, as potential hosts are smaller (as there is less time for star formation) and younger (thus, reducing the range of the DTD). This effect is compounded with the drop in detection efficiency, and the result is that—for our selection function, which mimics the LSST wide–fast–deep survey (‘Detection and selection of type Ia supernovae’)—no type Ia supernovae are detected above z_c ≈ 0.8.

In the bottom panel of Fig. 3, we show the different absolute magnitude offsets of the type Ia supernovae included in our forward model. These would be the residuals from a perfectly specified cosmological model and standard magnitude M₀ in a traditional analysis. The dominant source of scatter is standardizable correlations with stretch and colour (~0.26 mag). At high redshift, the sample exhibits a typical Malmquist bias^71,72 towards brighter supernovae (downwards trend) with a reduction of observed scatter due to the removal of dim objects. The effect of dust is also systematic. It dims (shifts upwards) the extrinsic magnitudes ${M}_{{\rm{ext}}}^{s}$ (red-shaded band) with respect to ${M}_{{\rm{int}}}^{s}$ (green-shaded band) by ~0.13 mag. Finally, we show the intrinsic host-related magnitude offset (δM^s, purple-shaded band), which includes metallicity and age correlations, as visualized in Fig. 2, and a residual mass step of 0.05 mag across 10¹⁰ M_⊙. These effects lead to an offset due to the dominance of high-mass (and hence, higher-metallicity and older) galaxies, but their sizes are small in comparison with the above (on the order of the irreducible random scatter: σ₀ = 0.1 mag), making the host–type Ia supernova connections difficult to infer.

Traditional analyses need to incorporate all these effects in an intricate likelihood that attempts to disentangle intrinsic and extrinsic influences, scatter from measurement uncertainty, trends from selection biases to derive accurate constraints on the cosmological parameters, host–type Ia supernova connections, and the DTD. In our SBI framework, on the other hand, it is sufficient to include them in the forward model, and the NN learns from training data how to solve the full inverse problem.

Simultaneous SBI

In this section, we demonstrate the ability of CIGaRS to obtain—simultaneously—marginal posteriors for all parameters in the hierarchical model, given only LC summaries of type Ia supernovae (stretch, apparent brightness, colour at peak and the associated fitting uncertainties) and LSST-like ugrizy photometry of the hosts. We use truncated marginal neural ratio estimation (TMNRE)^73,74 to train a conditioned deep set++ network^49,75,76 (Fig. 4) and apply it to the simulated dataset presented above. With minimal fine-tuning, we then scale to a tenfold larger mock catalogue of ~16,000 type Ia supernovae and their hosts (expected to be discovered in a month’s worth of wide–fast–deep survey observations). Details of the network architecture and the iterative training procedure are presented in section ‘Set-based TMNRE for hierarchical models’.

Global and population parameters

Figure 5 shows the final posteriors for all global and population parameters. All groups are significantly constrained with respect to their initial priors, and the results are fully consistent with the values used to simulate the mock data.

**Fig. 5: Inference results from the two mock datasets.**

We constrain the power-law slope of the DTD, b, to within ±0.05, comparable with the uncertainty in the DES analysis⁷⁷ of 809 objects, which used spectroscopic redshifts. We also recover the parameters of all three kinds of intrinsic host–type Ia supernova dependences that we consider: correlations with the progenitor metallicity or age and the size and location of an additional mass step. Importantly, we can disentangle a metallicity from an age dependence even when both are present (as evident from the uncorrelated posterior in the ${\gamma }_{{Z}_{* }}$–${\gamma }_{{t}_{* }}$ plane). By contrast, a metallicity dependence is harder to distinguish from a mass step (hence the strong a posteriori correlation in the ΔM–${\gamma }_{{Z}_{* }}$ plane), as could be anticipated from our exploration of the model predictions (Fig. 2).

Finally, we present marginal posteriors for the cosmological parameters (dark matter and dark energy densities, Ω_m0 and Ω_Λ0) and compare them with an idealized analysis in which all the latent parameters (redshifts, stretches, intrinsic colours, dust and host-related magnitude offsets) are exactly known, and the only sources of uncertainty are the irreducible magnitude scatter of σ₀ = 0.1 mag and measurement noise. Even this unrealistic scenario, which would at minimum require expensive spectroscopic observations, delivers cosmological constraints (in terms of area in the Ω_m0–Ω_Λ0 plane) that are only 3 times stronger.

However, the main advantage of our fully general and realistic approach is that it can be applied to much larger photometric-only datasets. To illustrate the improvement we expect from including all discovered type Ia supernovae rather than only those with spectroscopic follow-up (about 10% at most with LSST and 4MOST/TiDES), we also show the posteriors obtained from our ×10 larger mock dataset. We verify that the precision of our principled and self-consistent analysis scales as expected with the survey size. The one-dimensional credible intervals are $\sqrt{10}\approx 3$ times narrower, and the areas of two-dimensional regions shrink 10 times. Thus, CIGaRS will allow robust conclusions to be drawn about the progenitors of type Ia supernovae and their environmental dependencies, which has never been achieved before, while also improving cosmological constraints ~4 times over current methodologies that rely on spectroscopy.

Object-specific parameters

Marginal posteriors for the parameters of all 1,578 hosts and type Ia supernovae—derived simultaneously with the global results—are presented in Fig. 6 and Extended Data Fig. 1. The true values are recovered well across the full ranges, and although our results for the stretch and colour parameters are largely driven by the (noisy) direct ‘measurements’ (${\widehat{x}}^{s}\pm {\widehat{\sigma }}_{x}^{s}$ and ${\widehat{c}}^{s}\pm {\widehat{\sigma }}_{c}^{s}$) included in the data, they are also informed by the respective population distributions, which reduces the uncertainties when the observational errors are large. This is an instance of Bayesian shrinkage. For the host stellar mass and metallicity, our uncertainties match those from previous applications of Prospector^59,60. When the ‘signal’ is intrinsically low, for example for low metallicities, our results revert to the prior, which ensures proper Bayesian uncertainty propagation. Moreover, photometric redshifts are inferred with exceptionally high precision: a median posterior standard deviation ~0.01, no outliers with ∣Δz∣ > 0.1 and no signs of a systematic bias, owing to (1) the Prospector-β priors informed by galaxy evolution, which break the age–mass–redshift degeneracy⁶⁰ and (2) the extraction of redshift information from supernova (and host) brightnesses, given the globally inferred cosmological model. This is on par with analyses of ultraviolet–optical–infrared data with much wider coverage⁷⁸ and represents a considerable improvement over the expected scatter of ~0.04 from LSST photometry alone⁷⁹. Despite using only summarized supernova observables, our results already surpass the state of the art with full LC and host photometry. See Chen et al.⁸⁰, who achieved a photo-z scatter of ~0.02 for DES.

**Fig. 6: Inference results for host-related parameters.**

Conclusion and outlook

We have presented CIGaRS, a unified and fully self-consistent framework for the rigorous Bayesian analysis of summary statistics of type Ia supernovae and the photometry of their hosts. It is based on a forward simulator that combines two hierarchical models, Prospector-β and Simple-BayeSN, via a prescription for the occurrence of type Ia supernovae (according to a DTD convolved with non-trivial SFHs) and physical connections between their brightnesses and progenitor metallicity or age. CIGaRS qualitatively reproduces the observed type Ia supernova rate with redshift and the distribution of host stellar masses and makes detailed predictions about the apparent stellar-mass dependence induced by the underlying physical relations. A metallicity correlation resembles a smoothed step function across 10¹⁰ M_⊙ with symmetric Gaussian scatter, whereas an age correlation leads to a linear trend with stellar mass and highly skewed residuals.

We have also demonstrated simultaneous neural SBI of the DTD parameters, brightness versus metallicity and age correlations, a residual mass step, and cosmology purely from LSST-like host photometry and summaries of type Ia supernovae (brightness, stretch and colour) of up to ~16,000 objects. Our approach implicitly handles redshift estimation, host-related standardization and selection effects while propagating all uncertainties, and thus, it remains accurate, precise and scalable. Moreover, due to the unified hierarchical modelling and combination of supernova and host data, we achieve robust photometric redshift estimates with state-of-the-art scatter and cosmological results that rival spectroscopic supernova-only analyses. In a companion work, we investigate the ability of CIGaRS to also standardize the host photometry and the effect that this has on the cosmological constraints.

We note, finally, that the present analysis is performed within a constrained—albeit fully self-consistent—modelling framework, which may not exactly represent reality. For example, we have fixed (based on previous studies) foundational details of the Prospector model, like the stellar initial mass function, stellar isochrones and the intergalactic dust-extinction law. The systematic effect of such choices can be easily examined a posteriori with SBI by quickly analysing various simulated datasets. We show in Extended Data Figs. 2 and 3 one example of replacing the KC13 law⁸¹ with F99⁸². The resulting bias is imperceptible at the individual-object level but significantly affects the pooled global-parameter inference, especially as the data grow larger and more constraining. Nevertheless, with SBI, we can properly account for such ‘known unknowns’ by treating them as nuisance variables, like the (non-)presence of the various host–type Ia supernova connections we already consider. Concretely, by training the network on mock data with a variety of initial mass functions, isochrones, dust laws and so, it can infer the correct models if they are clearly discernible from the data and automatically include the appropriate ‘systematic’ uncertainty associated with the modelling choice.

Further extensions of the model that we envisage include the spatial localization of the stellar populations (and, hence, of the supernovae) and of interstellar dust, the peculiar velocities of the hosts and of the supernovae within them (which will allow inference of cosmological parameters controlling large-scale structure), the presence of non-Ia transients, their detection and characterization directly from a time series of telescope images, and the incorporation of spectroscopic data, where available. Notably, our inference framework can be applied to an arbitrarily complicated simulator and collections of observables without major modifications, while retaining its scalability, and we expect it to be instrumental in maximizing the scientific insight extracted from upcoming vast and detailed LSST observations.

Methods

Unified forward modelling of galaxies and type Ia supernovae

CIGaRS is a unified forward model for type Ia supernova cosmology. It is a diptych (Fig. 1) of two state-of-the-art Bayesian hierarchical simulators for photometric galaxy observations (d^h) and summary statistics of type Ia supernovae (d^s), joined on three levels:

1.
by the probability of occurrence (equivalently, number ${N}_{{\rm{SN}}}^{h}$) of type Ia supernovae in a given host h, based on the intrinsic properties of the latter, g^h;
2.
by the distribution of the intrinsic properties λ^s of the supernovae, which also depend on the g^h(s) of their respective hosts;
3.
by extrinsic effects of the host environment (for example, dust) and other properties (for example, cosmological redshift and peculiar velocity) on supernova light as it travels towards the observer.

The former two, which affect λ, can be considered causal connections, whereas the latter, which affects only the observables and data d, are coincidental. One last connection can arise during data reduction (for example, subtraction of the host light). We plan to implement this in a future extension that also implements the image analysis procedure of transient surveys. In the following, we describe in detail all components of CIGaRS (see also the full hierarchy in Fig. 1 and the list of parameters in Extended Data Table 1) and discuss the planned improvements, which are mainly related to the intrahost localization of the supernovae.

Host model

A comprehensive treatment of the photo-spectral evolution of galaxies—see the pioneering studies using the MOPED method^83,84,85 and, for example, Mo et al.⁸⁶ for a review—is beyond the scope of this work. At a high level, the intrinsic properties of a galaxy h, such as its SFH, metallicity (${Z}_{* }^{h}$) and interstellar dust content, are largely determined by the total stellar mass of the galaxy (${M}_{* }^{h}$) and its cosmological redshift (${z}_{{\rm{c}}}^{h}$), the latter being a proxy (given a cosmological model) for the age of the Universe (T^h) at the time the galaxy’s light was emitted. To represent the intrinsic correlations inherent in galaxy formation, we adopt an off-the-shelf Bayesian framework for galaxy photometry: Prospector⁸⁷, as updated in Prospector-β⁶⁰.

SFH and chemical enrichment: Prospector-β

Prospector encodes a non-trivial SFH by discretizing it into seven time bins (the optimal number for describing moderate signal-to-noise observations⁸⁸) spanning [0; T^h]: the first two are fixed to [0 Myr; 30 Myr] and [30 Myr; 100 Myr], the next four logarithmically spaced within [100 Myr; 0.85T^h], and the last covering [0.85T^h; T^h]. The total mass of a galaxy is then distributed into seven stellar populations that represent the stars born during each history bin:

$${{\bf{SFH}}}^{h}\equiv {\left[{\mathrm{SFH}}^{h,j}\right]}_{j=1}^{7},\,\,\,\,\,\,\mathrm{with}\,\,\,\,\,\,\mathop{\sum }\limits_{i=1}^{7}{\mathrm{SFH}}^{h,j}={M}_{* }^{h}.$$

(1)

In each forward realization, SFH^h is sampled from the prior $p({{\bf{SFH}}}^{h}| {M}_{* }^{h},{T}^{h})$ introduced in Prospector-β (Section 3.3 of ref. ⁶⁰). The model adopts a non-parametric continuity prescription⁸⁹ whereby the ratio of SFRs in adjacent bins exhibits a random scatter around the value dictated by the average (‘cosmic’) SFR density⁶⁶, taking into account also the empirical observation that high-mass galaxies form earlier.

For a given SFH^h, and assuming that all stars in a given population were born at approximately the same time (${t}_{* }^{h,j}$, taken at the (linear) centre of the respective bin), the process of stellar population synthesis (SPS) produces the integrated light (before extinction) from all stars in a galaxy by convolving the emission spectra of single stellar populations with the SFH.

SPS also requires a prescription for the galactic chemical enrichment history: the metallicities of all its stellar populations. Prospector assumes a single metallicity within a galaxy (${Z}_{* }^{h}$), determining it (with appropriate scatter) from the total stellar mass, as in Gallazzi et al.⁶⁷. This has also been shown to be appropriate for modelling the progenitors of type Ia supernovae⁹⁰, which is our ultimate goal.

Host dust extinction

The light emitted by stars is then extinguished (and reddened) by dust within the host. Prospector considers separate birth-cloud and diffuse dust components. The former affects only the youngest stellar population (acting on the emission only from the first time bin) and has a fixed wavelength dependence proportional to λ⁻¹ and optical depth ${\tau }_{1}^{h}$. Extinction due to diffuse (interstellar) dust is described by the KC13 law (ref. ⁸¹, following refs. ^91,92) with shape parameter δ^h and optical depth ${\tau }_{2}^{h}$ related to the column density of dust along the line of sight (as it represents the total extinction in the (rest-frame) V-band, we will label it ${A}_{{{\rm{V}}}}^{h}$ instead).

In Prospector-β, the dust parameters are a priori independent from other galaxy properties; however, many studies have found a posteriori relations from large galaxy surveys. We enforce the empirical relations of Alsing et al. (equations (14) and (15) in ref. ⁹³):

$${\tau}_{2}^{h}\equiv {A}_{{\mathrm{V}}}^{h} \sim {\mathcal{N}}\left(0.2+0.5\,{\mathrm{ReLU}}\left({\log}_{10}\,\left[{\mathrm{SFR}}^{h}\,\left({M}_{\odot}\,{\mathrm{yr}}^{-1}\right.\right)\right],0.{2}^{2}\right),$$

(2)

$${\delta }^{h} \sim {\mathcal{N}}\left(-0.095+0.111{A}_{{\mathrm{V}}}^{h}-0.0066{({A}_{{\mathrm{V}}}^{h})}^{2},0.{4}^{2}\right),$$

(3)

where the SFR is derived from the most recent (current) component of SFH^h: SFR^h ≡ SFH^h,1/30 Myr. Prospector then sets the optical depth of the birth-cloud dust according to ${\tau }_{1}^{h}/{\tau }_{2}^{h}\approx {\mathcal{N}}(1,{0.3}^{2})$.

Importantly, the above description of dust extinction is global, that is averaged over the spatial light distribution of the galaxy, as the host photometry is usually integrated. It may, thus, not faithfully represent the local extinction law or the optical density of dust at any particular location within the galaxy, for example in the proximity of a given type Ia supernova. The spatial distribution of dust properties is a topic of active research (for example, refs. ^94,95,96,97), with mounting evidence for its effect on the standardization of type Ia supernovae^98,99. We will address this in a future refinement of the model, in coordination with the dust effects applied to supernova light.

Photometry and detection of hosts

Given the above quantities (and parameters that describe the presence and emission of interstellar gas and an active galactic nucleus, which we randomly sample from the Prospector-β priors as they are not directly relevant to studies of type Ia supernovae), we compute the rest-frame spectral energy distribution—more accurately, the spectral flux—Φ^h(λ_r) using the NN-based SPS emulator Speculator-α¹⁰⁰. The speed-up this brings over traditional (from first principles) SPS is crucial for simulating the large amount of training data we need.

We then apply a redshift to Φ^h (in this study, we assume no peculiar velocities, ${{z}_{{\rm{c}}}}^{h}={z}^{h}={z}^{{s}}$, as the fraction of objects in our simulations and mock data with z_c ≲ 0.02, where the distinction would be important, is negligible) and apply a cosmological (transverse co-moving) distance ${D}^{h}\equiv D({z}_{{\rm{c}}}^{h},{\mathcal{C}})$, where ${\mathcal{C}}$ are the cosmological parameters, to convert it to the observer-frame spectral flux density:

$${F}^{h}({\lambda }_{{\rm{o}}})\equiv \frac{{{\varPhi}}^{h}({\lambda }_{{\rm{o}}}/(1+{z}^{h}))}{{(1+{z}^{h})}^{3}4{{\uppi }}{({D}^{h})}^{2}},$$

(4)

where λ_o is the observer-frame wavelength. In principle, at this stage we need to account for extinction in intergalactic space^101,102 and in the Milky Way. The former is a small effect (~0.015 mag at z_c = 0.6) and is rarely addressed explicitly in supernova cosmology (see, for example, Section 4.4.7 in ref. ²⁹), whereas the latter can be accounted for by using detailed multi-wavelength attenuation maps of the Milky Way^{103,104,105,106}. Hence, in this work, we ignore the two effects and assume that the observations have been appropriately corrected.

Thus, we directly integrate (numerically) F^h within the Rubin ugrizy passbands, convert to AB magnitudes and apply 0.01 mag Gaussian noise (a conservative estimate for the Rubin Observatory’s absolute photometric calibration¹⁰⁷) to arrive at the simulated galaxy data d^h, a vector of length 6. Finally, to simulate detection, that is the indicator variable S^h, we apply a simple magnitude cut by stipulating that all six measured magnitudes must be brighter than the respective 5σ detection thresholds expected for LSST 10-yr co-added imaging¹⁰⁸.

The description above treats galaxies as isotropically emitting point sources, whereas in reality, measuring their brightnesses requires accounting for inclination and de-blending of overlapping objects, which—to first order—would inflate the error and uncertainty beyond the photometric calibration floor we have assumed. To account for higher-order effects (for example, complicated biases due to inclination, blending or selection effects arising from the complex procedure for detecting and selecting objects for inclusion in a galaxy catalogue), one would need to process the raw image data. Owing to the flexibility of neural SBI, this can be achieved through high-fidelity simulations (see, for example, refs. ^{109,110,111,112,113}) to which the same data-reduction procedure has been applied as to the real data.

Occurrence of type Ia supernovae

The first connection between galaxies and type Ia supernovae (or any other transient) is the emergence of the latter from the stellar populations of the former, which we describe through the DTD: the rate (per unit rest-frame time) of occurrence of type Ia supernovae from a given stellar population (per unit stellar mass within it), as a function of the age of the population t_*. We adopt a power-law parameterization:

$${\rm{DTD}}\,({t}_{* })=A\times {({t}_{* }/{\rm{Gyr}})}^{b}\times {{{\rm{M}}}_{\odot }}^{-1}\,{{\rm{yr}}}^{-1},$$

(5)

which is guided by the theory of binary systems that decay through the emission of gravitational waves (for which b = −1), with independent priors on the normalization and slope from observational constraints⁶⁸:

$${\log }_{10}\,A \sim {\mathcal{N}}\left(-12.15,{0.1}^{2}\right),$$

(6)

$$b \sim {\mathcal{N}}\left(-1.34,{0.2}^{2}\right).$$

(7)

To avoid an infinite total rate, we set the DTD to zero for t_* < 0.1 Gyr (that is, for the two youngest stellar populations in Prospector), corresponding to the minimum time for the creation of a carbon-oxygen white dwarf^114,115. This physical prescription resolves the strong a posteriori correlation between the cutoff time and A, which has previously prevented direct inference of the former⁷⁷. In future analyses, we plan to relax this assumption and include a mixture of formation channels (for example, ref. ¹¹⁶), modelling their DTDs from first principles.

Given the DTD, we calculate the number of type Ia supernovae that arise (on expectation) from the stellar population j of galaxy h (recall that we assumed all stars within it have the same age ${t}_{* }^{h,j}$) during a survey of (observer-frame) duration ${\mathcal{T}}$:

$$\left\langle {N}_{\mathrm{SN}}^{h,\,j}\right\rangle =\frac{{\mathcal{T}}}{1+{z}^{h}}\times {\mathrm{SFH}}^{h,\,j}\times \mathrm{DTD}\,({t}_{* }^{h,\,j}).$$

(8)

We then iterate (in parallel on a GPU) over all stellar populations of all galaxies, that is over all h and j, to generate

$${N}_{\mathrm{SN}}^{h,\,j} \sim \mathrm{Poisson}\left(\left\langle {N}_{\mathrm{SN}}^{h,\,j}\right\rangle \right)$$

(9)

supernovae. We sample their ages, ${\{{t}_{* }^{h,\,j,k}\}}_{k=1}^{{N}_{\mathrm{SN}}^{h,\,j}}$, uniformly within the time bin associated with population h and j. To simplify notation, we uniquely label each supernova with $s\in \{1,\ldots ,{\sum }_{h,\,j}{N}_{\mathrm{SN}}^{h,\,j}\}$ and keep track of the host h(s) of each, so that we can associate the relevant intrinsic galaxy properties: ${z}_{{\rm{c}}}^{\,s}\equiv {z}_{{\rm{c}}}^{\,h(s)}$, ${Z}_{* }^{s}\equiv {Z}_{* }^{\,h(s)}$ and ${M}_{* }^{s}\equiv {M}_{* }^{h(s)}$ and host observations d^h(s). We note that this assumes a perfect host association, which is often not the case in real observations, and we ignore the extra information from instances of several supernovae arising in the same host (see, for example, ref. ¹¹⁷); we plan to make improvements in these respects by extending the simulator towards detection and characterization of the transients directly from telescope images.

Finally, it is also possible to keep track—in the simulator—of the stellar population j(s) from which each supernova has arisen and associate the corresponding properties, if these differ from one stellar population to another within the same host (for example, to represent a fully realistic enrichment history through different ${Z}_{* }^{\,h,\,j}$). Similarly, we can simulate the spatial distribution of stellar populations that give rise to metallicity and age gradients (for example, refs. ^118,119) and differences in dust extinction (for example, refs. ^94,95,96,97). We defer an expansion of the simulator within the hosts to future work.

Model for type Ia supernovae

Once we have determined how many type Ia supernovae have occurred during a survey, we proceed to simulate their observables. This follows a similar path, going from the intrinsic properties derived from a population model informed by the host of each object, through extinction, redshift and distance to noisy measurements and detection or sample selection.

Causal host–type Ia supernova connections

The cornerstone of CIGaRS is an explicit parameterized dependence of the intrinsic properties of type Ia supernovae on the characteristics of their host or of their progenitor stellar population: in general, p(λ^s∣g^h(s),j(s), γ), where γ are global parameters. Our choice of which links to include is motivated by observational evidence^{4,6,7,8,9,10,11,12} and takes the form of a host-dependent magnitude offset δM^s, but we could similarly introduce, for example, a parameterized host-dependent distribution of ${x}_{{\rm{int}}}^{s}$ (ref. ²⁶).

As the physical source of δM^s is not yet established, we allow for several possibilities (in isolation or combination)—namely, metallicity ${Z}_{* }^{s}$ and progenitor age ${t}_{* }^{s}$—with correlation coefficients, respectively ${\gamma }_{{Z}_{* }}$ and ${\gamma }_{{t}_{* }}$, whose priors include 0 and wide ranges of positive and negative values. In keeping with current practice and as a ‘catch-all’ parameter, we also allow for an additional (or residual) ‘mass step’ ΔM with a similar wide prior and a free (inferred) stellar-mass location ${M}_{* }^{{\rm{step}}}$. The full intrinsic host connection is, therefore:

$$\delta {M}^{s}=\underbrace{{\gamma}_{{Z}_{\ast}}\times{{\rm{log}}_{10}}{{Z}_{\ast}^{s}}/{{Z}_{\odot}}}_{\rm{metallicity}}+ \underbrace{{\gamma}_{{t}_{\ast}}\times {t}_{\ast}^{s}}_{\rm{age}}+\underbrace{{\Delta} M \times {\mathbb{I}}({M}_{\ast}^{s} {>} {M}_{\ast}^{\rm{step}})}_{\text{ad hoc mass step}},$$

(10)

where 𝕀 returns 1 or 0 if its argument is true or false, respectively, and ${\gamma }_{{Z}_{* }},{\gamma }_{{t}_{* }},\Delta M,{M}_{* }^{\mathrm{step}}\in {\boldsymbol{\gamma }}$ are global parameters. Inference of the presence and strength of the respective effects is performed by examining their posterior(s) or performing a Bayesian model selection, which is most conveniently and rigorously achieved through simulation-based methods³⁷. Moreover, with SBI, all underlying correlations are accounted for and marginalized over when inferring the cosmology, which ensures the robustness of dark energy inference against systematics arising from the host–type Ia supernova dependences.

Intrinsic properties and dust extinction of type Ia supernovae: Simple-BayeSN

Bayesian hierarchical modelling replaces empirical standardization with a priori correlated latent parameters sampled from hierarchical priors^32,120:

$$\text{stretch:}\,\,{x}_{\mathrm{int}}^{s} \sim {\mathcal{N}}\left({\mu }_{x},{\sigma }_{x}^{2}\right),$$

(11)

$$\text{colour:}\,\,{c}_{\mathrm{int}}^{s} \sim {\mathcal{N}}\left({\mu }_{c}+{\alpha }_{c}{x}_{\mathrm{int}}^{s},{\sigma }_{c}^{2}\right),$$

(12)

$${\rm{absolute B}}{\text{-}}{\text{band magnitude}}\!:\,\,{M}_{\mathrm{int}}^{s} \sim {\mathcal{N}}\left({M}_{0}+\alpha {x}_{\mathrm{int}}^{s}+\beta {c}_{\mathrm{int}}^{s}+\delta {M}^{s},{\sigma }_{0}^{2}\right)$$

(13)

where μ_x, σ_x, μ_c, σ_c, M₀, σ₀, α, α_c, β ∈ γ are population parameters assigned fixed hyper-priors, as listed in Extended Data Table 1.

Like starlight, supernovae are affected by the dust surrounding them and along the line of sight, that is, in intergalactic space and in the Milky Way. We will ignore the latter two, assuming that they have been perfectly corrected for, and adopt the Bayesian formulation of host dust extinction from Simple-BayeSN³², which acts on the intrinsic parameters introduced in equations (11) to (13) to obtain their ‘extrinsic’ versions:

$${x}_{{\rm{ext}}}^{s}={x}_{{\rm{int}}}^{s},$$

(14)

$${c}_{\mathrm{ext}}^{\,s}={c}_{\mathrm{int}}^{s}+{E}_{B-V}^{\,s}={c}_{\mathrm{int}}^{\,s}+{A}_{V}^{s}/{R}_{V}^{s},$$

(15)

$${M}_{\mathrm{ext}}^{s}={M}_{\mathrm{int}}^{s}+{A}_{B}^{s}={M}_{\mathrm{int}}^{s}+{A}_{{\rm{V}}}^{s}({R}_{V}^{s}+1)/{R}_{V}^{s},$$

(16)

where E_B−V ≡ A_B − A_V is the selective extinction, with A_B the total extinction in the rest-frame B-band (in which the peak magnitude of type Ia supernovae is typically standardized) and R_V ≡ A_V/E_B−V. Note that the colour–magnitude standardization coefficient β in equation (13) refers only to c_int rather than the dust-affected c_ext.

Ideally, in a unified model, one would coordinate the extinction applied to type Ia supernovae with the dust properties of the host. However, due to the complicated distribution of dust within galaxies (for example, refs. ^94,95,96,97) and the non-trivial attenuation effects of dust, which include not only extinction but also scattering of galactic and supernova light into the line of sight³¹, the simple approach of adopting the host-global ${A}_{V}^{h}$ and R(δ^h) predicts an order of magnitude larger optical depths than empirically observed in supernova data. We will explore the effect of dust localization in a further study, and here we adopt instead the host-independent dust populations from Simple-BayeSN:

$${R}_{V}^{s} \sim {\mathcal{N}}\left({\mu }_{R},{\sigma }_{R}^{2}\right),$$

(17)

$${A}_{V}^{s} \sim \mathrm{Exponential}(1/\tau ),$$

(18)

with μ_R, σ_R, τ ∈ γ global parameters (Extended Data Table 1).

Cosmological distance

The extrinsic absolute (rest-frame B-band) magnitude ${M}_{\mathrm{ext}}^{s}$ is then transformed into an apparent (still rest-frame B-band) magnitude m^s through the usual distance-modulus relation:

$${m}^{s}={M}_{\mathrm{ext}}^{s}+{\mu }^{s}\,\,\,\,\,\mathrm{with}\,\,\,\,\,{\mu }^{s}=\mu ({z}_{{\rm{c}}}^{s},c),$$

(19)

where ${z}_{{\rm{c}}}^{s}$ is the cosmological redshift of the supernova, and ${\mathcal{C}}$ are the cosmological parameters. In this study, we use Λ-cold dark matter, which is described by the present-day dimensionless densities of cold dark matter and dark energy in the form of a cosmological constant: ${\mathcal{C}}\equiv [{\varOmega }_{{\rm{m}}0},{\varOmega }_{\Lambda 0}]$. Just as when modelling galaxies, we disregard peculiar velocities, including those of the supernovae within their hosts, which are negligible at high redshifts and best accounted for at the level of LCs. To account for the motion of nearby objects (z ≲ 0.03), CIGaRS can be seamlessly extended with an appropriate Bayesian hierarchical model (for example, refs. ^121,122,123), which would allow the incorporation of the parameters controlling large-scale structure formation^124,125 within the unified framework.

Observables of type Ia supernovae

Observations of supernovae take the form of multi-band LCs: collections of flux measurements in different filters at irregularly spaced times, which vary from supernova to supernova. However, cosmological analyses are typically performed after the LCs have been summarized (independently of one another) by subtracting the constant contribution of the host and fitting an LC model, which produces parameter estimates $\widehat{x}({\bf{LC}})$, $\widehat{c}({\bf{LC}})$ and $\widehat{m}({\bf{LC}})$ and fit uncertainties ${\widehat{\sigma }}_{x}({\bf{LC}})$, ${\widehat{\sigma }}_{c}({\bf{LC}})$ and ${\widehat{\sigma }}_{m}({\bf{LC}})$. It is then assumed that these summary statistics are related to the extrinsic supernova parameters ${x}_{{\rm{ext}}}^{s},{c}_{{\rm{ext}}}^{s}$ and m^s by Gaussian sampling distributions:

$${\widehat{x}}^{s} \sim {\mathcal{N}}\left({x}_{\mathrm{ext}}^{s},{({\widehat{\sigma }}_{x}^{s})}^{2}\right),$$

(20)

$${\widehat{c}}^{s} \sim {\mathcal{N}}\left({c}_{\mathrm{ext}}^{s},{({\widehat{\sigma }}_{c}^{s})}^{2}\right),$$

(21)

$${\widehat{m}}^{s} \sim {\mathcal{N}}\left({m}^{s},{({\widehat{\sigma }}_{m}^{s})}^{2}\right).$$

(22)

As the uncertainties depend only on instrumental properties, like noise and cadence, their distributions can be robustly determined and treated as fixed simulator inputs. We used the model of Boyd et al.¹²⁶, which is based on the results of current surveys and LSST simulations:

$$\ln{\widehat{\sigma }}_{x}^{s} \sim {\mathcal{N}}\left(-1.5,{0.5}^{2}\right),$$

(23)

$$\ln{\widehat{\sigma }}_{c}^{s} \sim {\mathcal{N}}\left(-3.5,{0.3}^{2}\right),$$

(24)

$$\ln{\widehat{\sigma }}_{m}^{s} \sim {\mathcal{N}}\left(0.1({m}^{s}-56),{0.6}^{2}\right).$$

(25)

We note that these values assume that the LC fits have been performed given a precise redshift measurement from spectroscopy of the supernova or the host. In its absence, for example, when inferring the redshift from the LC itself¹²⁷, the fit uncertainties will be inflated, as both colour and (to a lesser extent) stretch are a posteriori correlated with redshift⁸⁰ (Fig. 6). This would require adjusting the above distributions accordingly.

It is also possible to extend this description to a dense covariance matrix that represents the systematic correlations between supernovae, as demonstrated in SICRET. However, with hierarchical modelling and SBI, these can, instead, be accounted for explicitly in the simulator, thus replacing the likelihood-centric covariance formulation. Issues related to summary statistics can also be circumvented altogether by extending the simulation and analysis to full LCs, as demonstrated in SIDE-real.

Detection and selection of type Ia supernovae

The detection and inclusion of supernovae in modern cosmological analyses is a complex process that considers the raw data quality, the goodness of LC fits and the estimated summaries. Using host observations (for example, to derive redshifts or to study host–type Ia supernova connections) relies on a further procedure that identifies and associates them correctly (for example, refs. ^128,129). A big advantage of our framework is that arbitrary detection, selection and association (and classification) criteria can be straightforwardly integrated into the forward model and accounted for with SBI, as recently shown in STAR NRE.

As the present study does not specifically focus on selection effects, we adopt the same simple supernova detection and selection procedure as in STAR NRE (Section 3.2 in ref. ⁴⁹). Thus, we derive from the expected LSST observing conditions a probability $p({S}^{s}| {\widehat{m}}^{s},{z}^{s})$ that depends on the observed brightness of the supernova and its redshift (due to the different detection limits in the different (observer-frame) bands in which the peak occurs). We treat as ‘detected and selected’ all host–type Ia supernova pairs (‘objects’) where both S^h and S^s are sampled true. We label their count N_sel and treat it as an observable, as in STAR NRE. Finally, the output of the simulator is a set of length-12 vectors that combine the host- and supernova-related observables for each of the detected and selected objects:

$$D\equiv {\left\{\left({{\bf{d}}}^{h(i)},{\widehat{m}}^{s},{({\widehat{\sigma }}_{m}^{i})}^{2},{\widehat{x}}^{s},{({\widehat{\sigma }}_{x}^{i})}^{2},{\widehat{c}}^{s},{({\widehat{\sigma }}_{c}^{i})}^{2}\right)\right\}}_{i=1}^{{N}_{\mathrm{sel}}}.$$

(26)

Implementation details and simulated counts

Here we discuss two technical details of our forward simulator.

First, we note that Prospector relies on the cosmological model to map redshift to age: $T({z}_{{\rm{c}}},{\mathcal{C}})$. In principle, this calculation can be repeated for each sampled cosmology in our simulator; however, we do not expect this to have a noticeable effect on our results, as type Ia supernovae are mainly informative of cosmological distances rather than times; that is, for values of ${\mathcal{C}}$ consistent with a given sample of type Ia supernovae, $T({z}_{{\rm{c}}},{\mathcal{C}})$ does not vary significantly with ${\mathcal{C}}$. Therefore, we treat T(z_c) as fixed to that consistent with the cosmology from the Wilkinson Microwave Anisotropy Probe¹³⁰, as when training and creating Prospector-β.

The second point concerns the number of objects that we simulate, which, in principle, depends on (and, hence, is informative of) the astrophysical and cosmological models (for example, through a parameter S₈), as well as the surveyed sky area (Ω). At present, our high-level supernova-focused simulator does not include these details, as supernova observations do not carry information about this connection, beyond the distribution of their redshifts, which we assume is extracted mainly from the hosts.

Hence, the purely host-related part of CIGaRS has no free global parameters up to the point where the redshifted (observer-frame) spectral flux

$${\varPhi }_{{\rm{o}}}^{h}({\lambda }_{{\rm{o}}})\equiv \frac{{\varPhi }^{h}({\lambda }_{{\rm{o}}}/(1+{z}^{h}))}{{(1+{z}^{{h}})}^{3}}$$

(27)

is converted to the spectral flux density through cosmological distance, as in equation (4). This means that we can generate a fixed ‘bank’ of galaxies and associated observables—noiseless absolute ugrizy magnitudes obtained by integrating equation (27) through the respective band-pass filter—to serve as potential hosts when simulating supernovae. We choose a bank size of 1,000,000 and note that this represents the total population of galaxies (N_gal).

However, as we already noted in Fig. 3, the distribution of transient hosts is significantly skewed towards high-mass galaxies due to the abundance of prospective progenitors within them. As these are a minority in the total population and we use a fixed simulation bank, we, thus, run the risk of repeatedly including the same objects in our training catalogues, which could cause the network to become overfitted. To prevent this, we modified the distribution from which we sampled galaxy properties when generating the simulation bank to more closely represent the final distribution of type Ia supernova hosts. Specifically, we modified the stellar mass function in Prospector-β:

$$p({M}_{* },{z}_{{\rm{c}}})\to \frac{{M}_{* }}{1+{z}_{{\rm{c}}}}p({M}_{* },{z}_{{\rm{c}}}),$$

(28)

motivated by equation (8). We then exactly undid this when calculating the number of expected supernovae within each host by dividing equation (8) by the same factor, which preserves $\left\langle {N}_{{\rm{SN}}}\right\rangle$ as a function of mass and redshift.

Then, when compiling a mock survey, we calculated the apparent brightnesses of the galaxies (given ${\mathcal{C}}$), added noise and evaluated detection as in section ‘Photometry and detection of hosts’. We then seeded only the detected and selected subset with type Ia supernovae as described in section ‘Occurrence of type Ia supernovae’. This is an implicit selection criterion on the supernovae: we only considered supernovae for which we can observe and uniquely identify the host (with certainty); for our (current) method to be applicable, we need to apply the same cut to the real data, but this is usually not a stronger requirement than the cuts typically applied on the supernova data. For the default cosmological model and DTD (Extended Data Table 1), only about 39% of the galaxies are selected, and of these, the type Ia supernova rate is ~2,500 yr⁻¹, of which about 1/3 pass detection and selection (for the default population parameters and our toy model of LSST).

Last, we need to set a survey duration (${\mathcal{T}}$ equation (8)), but this is only well defined in combination with Ω or a physically meaningful N_gal, which we replaced with the fixed size of the galaxy ‘bank’. Therefore, we set an arbitrary scaling in equation (8) so as to achieve (on expectation) a given number of detected and selected objects, for example, 1,600 or 16,000. Importantly, as Ω and ${\mathcal{T}}$ are well known for real surveys, we can easily extend the simulator to properly calculate the galaxy and type Ia supernova counts when analysing real data. In the present set-up, we can apply the same scaling when generating training data as for the test mocks.

Set-based TMNRE for hierarchical models

Bayesian hierarchical models feature a large number of free parameters, which scales with the number of objects observed. Accurate, precise and fast inference from future data is, thus, a considerable computational challenge that likelihood-based methods are ill-suited to address. Moreover, the likelihood requires explicit calculations of intractable probabilities, often leading to ad hoc approximations. SBI addresses all these issues—scalability, realism and rigour—by delivering marginal Bayesian posteriors, given data D₀, for any parameters of interest θ, implicitly integrated over all nuisance parameters (ν) and relevant stochastic processes (for example, selection and other systematic effects):

$$p({\boldsymbol{\theta }}| {\mathbf{D}}_{0})\propto p({\boldsymbol{\theta }})p({\mathbf{D}}_{0}| {\boldsymbol{\theta }})=p({\boldsymbol{\theta }})\int p({\mathbf{D}}_{0}| {\boldsymbol{\nu }},{\boldsymbol{\theta }})\,p({\boldsymbol{\nu }}| {\boldsymbol{\theta }})\,{\rm{d}}{\boldsymbol{\nu }}.$$

(29)

SBI requires only samples from the prior (p(θ)) and the marginal likelihood (p(D∣θ)), which are provided by a stochastic forward simulator that represents the Bayesian joint model p(θ, D). There are different techniques (flavours of neural SBI) for using simulated pairs (θ, D) to train a NN and later perform inference from observed data D₀. We adopt the approach called neural ratio estimation⁷⁴, as it offers the greatest freedom in the choice of the NN architecture and in choosing priors for training and evaluation. It trains a network $\widehat{\mathrm{r}}({\boldsymbol{\theta }},\mathbf{D})$ to approximate the single real number

$$r({\boldsymbol{\theta }},\mathbf{D})\equiv \frac{p({\boldsymbol{\theta }},\mathbf{D})}{p({\boldsymbol{\theta }})p(\mathbf{D})}=\frac{p({\boldsymbol{\theta }}| \mathbf{D})}{p({\boldsymbol{\theta }})}$$

(30)

by minimizing the binary cross-entropy loss commonly used for classification tasks. Once trained, $\widehat{\mathrm{r}}({\boldsymbol{\theta }},{\mathbf{D}}_{0})$ evaluated with the observed data can simply be multiplied by the prior or used to reweight prior samples to represent the target posterior, according to the second equality in equation (30).

In principle, θ can represent any group of parameters that we wish to derive a posterior for. However, scientific interpretations are usually based on one- or two-dimensional marginal distributions, and consequently, we will define the following groups of (global) parameters of interest γ_g:

one-dimensional: ${M}_{0},{\sigma }_{0},\alpha ,\beta ,{\alpha }_{c},{M}_{* }^{{\rm{step}}}$;
two-dimensional: [A, b] for the DTD and [Ω_m0, Ω_Λ0] for cosmology;
two-dimensional: $[\Delta M,{\gamma }_{{Z}_{* }}]$, $[\Delta M,{\gamma }_{{t}_{* }}]$, and $[{\gamma }_{{Z}_{* }},{\gamma }_{{t}_{* }}]$, forming all combinations of the host–type Ia supernova connection parameters: a so-called corner plot.

For each of them, we train—simultaneously—separate ratio estimators ${\widehat{{\rm{r}}}}_{g}$, following shared data preprocessing.

Object-specific parameters

In addition to the global parameters, we will—simultaneously—train ratio estimators for all ${\mathcal{O}}\left({N}_{\mathrm{sel}}\right)$ object-specific (local) parameters in the hierarchical model (here we will use the unified label λⁱ, which represents g^h and λ^s in CIGaRS collectively). Although we can, as before, form local-parameter groups λ_g from parameters for the same object, we will infer each object-specific parameter λ_g ∈ {z_c, M_*, Z_*, δ, τ₂, x_int, c_int} marginally. Although in Fig. 6 we show only a subset of the results that represent scientific interest in the present study, estimating all local parameters during training helps extract informative features and ultimately improves the inference of the global parameters.

In SICRET and SIDE-real, we demonstrated simultaneous marginal inference of all {λⁱ} for models in which the dataset size N_sel was known a priori. That is, it could be fixed after a survey is performed because selection effects were not considered. Here we extend the approach to simulators that produce datasets with various N_sel, which introduces two complications.

The first concerns identifying the objects. Within Bayesian hierarchical modelling, the {λⁱ} are a priori independent and identically distributed, conditional on the set of global parameters γ, and each influences the sampling distribution of only one observed ‘object’; that is we have the general model structure:

$$p(\gamma ,\{{\lambda }^{i}\},\{{{\bf{d}}}^{i}\})=\left[\mathop{\prod }\limits_{i=1}^{{N}_{\mathrm{sel}}}p({{\bf{d}}}^{i}| {\lambda }^{i},{\boldsymbol{\gamma }})p({\lambda }^{i}| {\boldsymbol{\gamma }})\right]p(\gamma ).$$

(31)

Previously, we used a fixed ordered collection of auxiliary variables ${[{{\bf{a}}}^{i}]}_{i=1}^{{N}_{{\rm{sel}}}}$ to disambiguate the assignment of labels i, which effectively individualized the sampling distributions:

$$p({{\bf{d}}}^{i}| {\lambda }^{i},{\boldsymbol{\gamma }})\to p({{\bf{d}}}^{i}| {{\bf{a}}}^{i},{\lambda }^{i},{\boldsymbol{\gamma }})\to {p}_{i}({{\bf{d}}}^{i}| {\lambda }^{i},{\boldsymbol{\gamma }}).$$

(32)

However, the present simulator produces unordered (exchangeable) sets of a priori undetermined sizes, and so the auxiliary variables need to become an output of the model, that is be incorporated into the observable dⁱ. Indeed, CIGaRS explicitly models the host photometry (from which the redshift is ultimately derived) and the observational (fitted) uncertainties, which comprised aⁱ in SICRET and SIDE-real. Thus, we can treat λⁱ and dⁱ as realizations of singular random variables λ and d:

$$\left[\mathop{\prod }\limits_{i}p({\bf{d}}={{\bf{d}}}^{i}| {\boldsymbol{\lambda }}={\lambda }^{i},{\boldsymbol{\gamma }})p({\boldsymbol{\lambda }}={\lambda }^{i}| {\boldsymbol{\gamma }})\right]p({\boldsymbol{\gamma }}),$$

(33)

rather than collections of separate (independent and identically distributed) random variables, which allows us to train a single network to represent all posteriors:

(34)

where we have ignored the dependence on the full {dⁱ}, as it is approximately redundant with the truncation of global parameters, as argued for SICRET.

Second, a technical complication arises as we need training pairs

$${\boldsymbol{\lambda }},{\bf{d}} \sim p({\boldsymbol{\lambda }},{\bf{d}})=\int \,p({\boldsymbol{\lambda }},{\bf{d}}| {\boldsymbol{\gamma }})p({\boldsymbol{\gamma }})\,{\rm{d}}{\boldsymbol{\gamma }},$$

(35)

but the simulator samples in proportion to $\left\langle {N}_{\mathrm{sel}}\right\rangle({\boldsymbol{\gamma }}) \times p({\boldsymbol{\gamma }})$ rather than simply p(γ). This is not a problem when N_sel is a fixed input and can otherwise be rectified by randomly selecting a fixed number N_sub of objects from the simulator output. In fact, N_sub := 1 would be the natural choice in line with the above treatment of {λⁱ} and {dⁱ} as realizations of λ and d, and we chose this when generating a validation set, which also represents the prior p(λ). Still, by setting a larger N_sub, we can cheaply (without extra simulation) enlarge the effective batch size for training local-parameter inference networks, and so we chose N_sub = 100 in this case. Finally, when evaluating the results for the N_sel,0 objects in D₀, we simply skipped the subselection and used the full set (N_sub := N_sel,0).

Network architecture: conditioned deep set++

The NN we use (depicted, together with details of the sizes of its various layers, in Fig. 4) is based on the conditioned deep-set architecture from STAR NRE (following Zaheer et al.⁷⁶) but augmented in depth as in Zhang et al.⁷⁵ due to the sophistication of the simulator and the inference tasks and with the addition of local-parameter estimators as in SICRET and SIDE-real.

Given a dataset D ≡ {dⁱ} with cardinality N_sel, we first preprocess each element dⁱ with a small feed-forward network to (automatically) derive (nonlinear) ‘features’:

$${{\mathtt{d}}}^{i}={\mathtt{DataPre}}({{\bf{d}}}^{i}),$$

(36)

which are generally useful across all inference tasks. These may include galaxy colours, (fiducially) standardized supernova magnitudes, observational uncertainty (from the provided variances) and estimates of object-specific parameters directly usable in the respective downstream ratio estimators. Although they are not forced to be directly ‘meaningful’ to a human scientist, we plan to explore and interpret these features in future work. The {dⁱ} are then used as the input for all ratio estimators.

Following Zaheer et al.’s representation theorem⁷⁶ and the considerations laid out in STAR NRE, the estimator for a group of global parameters θ_g makes use of two feed-forward networks ${\widehat{\phi }}_{{\rm{g}}}$ and ${\widehat{\rho }}_{{\rm{g}}}$ and takes the form:

$$\ln\,{\widehat{\mathrm{r}}}_{{\rm{g}}}({\boldsymbol{\theta}}_{{\rm{g}}},\mathbf{D})\equiv {\widehat{\rho }}_{{\rm{g}}}({\boldsymbol{\theta}}_{{\rm{g}}},{\mathtt{S}}_{{\rm{g}}},{N}_{\mathrm{sel}}),$$

(37)

with

$${\mathtt{S}}_{{\rm{g}}}\equiv \frac{1}{{N}_{\mathrm{sel}}}\mathop{\sum}\limits_{i}{\hat{\phi}}_{{\rm{g}}}\left({\boldsymbol{\theta}}_{{\rm{g}}},\frac{{{\bf{d}}}^{i}-{\mathtt{m}}}{{\mathtt{s}}},{\mathtt{m}},{\mathtt{s}}\right),$$

(38)

where m and s are, respectively, the mean and standard deviation of the {dⁱ}. This implements the SetNorm operation advocated by Zhang et al.⁷⁵ and explicitly preserves the information contained in the set moments m and s. For expressivity (which allows the set aggregation to be a simple averaging operation), ${\widehat{\phi }}_{{\rm{g}}}$ is a deep residual network that implements ‘skip connections’ as proposed by Zhang et al.⁷⁵ after combining its inputs $(\boldsymbol{\theta}_{\rm{g}},\frac{{\bf{d}}^{i}-{\mathtt{m}}}{\mathtt{s}},{\mathtt{m}},{\mathtt{s}})$ into one vector through a single hidden network layer (Fig. 4). Finally, ${\widehat{\rho }}_{{\rm{g}}}$ similarly combines its inputs (θ_g, S_g, N_sel) and feeds them through a few fully connected layers to output the single number $\ln\,{\widehat{{\rm{r}}}}_{{\rm{g}}}(\boldsymbol{\theta }_{{\rm{g}}},\mathbf{D})$.

The ratio estimator for a local or object-specific parameter λ_g is simpler, as it does not need to aggregate information across the set, and so it consists of a single feed-forward network:

$$\ln\,{\widehat{{\rm{r}}}}_{{\rm{g}}}({\lambda }_{{\rm{g}}},{{\bf{d}}}^{i})={\mathtt{LocalNRE}}_{{\rm{g}}}({\lambda }_{{\rm{g}}},{{\bf{d}}}^{i}),$$

(39)

which can be applied in parallel over the N_sub input data at the same time.

Truncation and fine-tuning

Neural SBI is amortized: once the network is trained, results can be quickly derived from numerous simulated (or real, if available) data realizations. While this can be exploited to verify and calibrate inference (see, for example, refs. ^57,58), that would require training over a wide range of data, which can be inefficient if one is interested only in the results for a single dataset D₀.

In such cases, the simulation budget, training time and network capacity can be optimized through various sequential SBI strategies, which modify (either continuously or in a succession of stages) the prior $p({\boldsymbol{\theta }})\to \widetilde{p}({\boldsymbol{\theta }})$ from which parameters are drawn, based on intermediate results during training. A simple yet effective prescription is prior truncation⁷³, in which the shape or form of $\widetilde{p}({\boldsymbol{\theta }})$ is unchanged, but its support is restricted to a region T(D₀) in which the posterior density is significantly different from zero:

$${\boldsymbol{\theta }}\notin T({D}_{0})\,\Rightarrow \,p({\boldsymbol{\theta }}| {D}_{0})\approx 0,$$

(40)

as approximated by a previously trained network evaluated at D₀. Thus, the only modification needed is a trivial renormalization to account for the excluded prior probability mass:

$$p(\boldsymbol{\theta})\to {\widetilde{p}}_{T(\mathbf{D}_{0})}({\boldsymbol{\theta }})=\left\{\begin{array}{l}\frac{p({\boldsymbol{\theta }})}{{\int }_{T(\mathbf{D}_{0})}p({\boldsymbol{\theta }})\,{\rm{d}}{\boldsymbol{\theta }}}\,\,\,\,\,\,\mathrm{if}\,{\boldsymbol{\theta }}\in T(\mathbf{D}_{0}),\\ \begin{array}{cc}0, & \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\mathrm{otherwise},\end{array}\end{array}\right.$$

(41)

which leaves the posterior (given D₀) unchanged while restricting simulated examples to more closely resemble D₀.

We apply truncation separately for each global inference group. In one dimension, we use a simple contiguous interval that contains 99.99% approximate credibility and then sample training examples by analytically modifying the (simple, whether uniform or normal) prior. For two-dimensional groups, we either define an iso-likelihood contour (which again contains 99.99% of approximate posterior mass) and use rejection sampling within it or apply one-dimensional truncation separately for the two parameters (we truncate to an axis-aligned rectangular region) when they are not strongly correlated. The truncation regions are illustrated with grey shading in Fig. 5.

Once new training data are generated, we resume—instead of re-initializing—training, which means that the network is simply fine-tuned to give more accurate results in the ‘zoomed-in’ parameter space, instead of being forced to relearn the analysis from scratch. This is especially relevant for the preprocessor and deep-set featurizers (described below), whose computations should not appreciably depend on the parameter ranges.

Training details

Analysing the mock data with ~1,600 objects required two rounds of truncation (starting from the priors in Extended Data Table 1 and with the network in Fig. 4) for the results to converge (judged by the difference between posteriors from successive rounds). In each round, we generated 64,000 full example datasets (with 6,400 more for validation and to represent the priors when evaluating results) and trained using the Adam optimizer¹³¹ with learning rate 10⁻³ (reduced to 10⁻⁴ when fine-tuning) for at most 200 epochs, although the loss typically plateaus earlier. On two NVIDIA A100 GPUs with a combined mini-batch size of 64, one stage takes ~24 h, with most of the computation taken up by backpropagating through all 11 deep sets (for the separate global-parameter groups).

For the larger (N_sel ≈ 16,000) dataset, we took the final network from above and fine-tuned it on simulations from the same final truncated prior region but with a tenfold increase in output size (on expectation). As the deep-set architecture processes the full dataset at once, training it now required roughly ten times more compute and memory, so we used eight GPUs and a batch size of 32, which again required 24 h per truncation stage (we needed only one in this case).

Finally, for each global-parameter group γ_g, we picked the checkpoint with the lowest validation loss for the specific group, whereas for object-specific parameters λ_g, we picked the best checkpoint overall (lowest loss averaged over all groups). To represent the posteriors, we evaluated $\widehat{\mathrm{r}}({{\boldsymbol{\theta }}}_{{\rm{g}}},\mathbf{D}_{0})$ over the parameter samples in the validation set, which represent p(θ_g), and used the results as importance weights for plotting contours and calculating posterior moments.

Data availability

Scripts with the implementation of CIGaRS, trained networks, mock data and results are available via GitHub at https://github.com/snai-analysis/cigars and are available via Zenodo at https://doi.org/10.5281/zenodo.18705764 (ref. ¹³²).

Code availability

CIGaRS is based on the Clipppy convenience layer for inference and probabilistic programming in Python (https://github.com/kosiokarchev/clipppy). The forward simulator is implemented in Pyro¹³³ and makes use of the cosmographic utilities of phytorch (https://github.com/kosiokarchev/phytorch), the prospector code (https://github.com/bd-j/prospector) and the pretrained Speculator-α emulator (https://github.com/justinalsing/speculator), embedded in SLICsim (https://github.com/kosiokarchev/slicsim). The NNs were defined and trained using PyTorch¹³⁴ and PyTorch Lightning¹³⁵.

References

Sullivan, M. et al. The Hubble diagram of type Ia supernovae as a function of host galaxy morphology. Mon. Not. R. Astron. Soc. 340, 1057–1075 (2003).
Article ADS Google Scholar
Sullivan, M. et al. Rates and properties of type Ia supernovae as a function of mass and star formation in their host galaxies. Astrophys. J. 648, 868–883 (2006).
Article ADS Google Scholar
Aubourg, É. et al. Evidence of short-lived SN Ia progenitors. Astron. Astrophys. 492, 631–636 (2008).
Article ADS Google Scholar
Neill, J. D. et al. The local hosts of type Ia supernovae. Astrophys. J. 707, 1449–1465 (2009).
Article ADS Google Scholar
Brandt, T. D. et al. The ages of type Ia supernova progenitors. Astron. J. 140, 804–816 (2010).
Article ADS Google Scholar
Lampeitl, H. et al. The effect of host galaxies on type Ia supernovae in the SDSS-II supernova survey. Astrophys. J. 722, 566–576 (2010).
Article ADS Google Scholar
Sullivan, M. et al. The dependence of type Ia supernovae luminosities on their host galaxies. Mon. Not. R. Astron. Soc. 406, 782–802 (2010).
ADS Google Scholar
Childress, M. et al. Host galaxy properties and Hubble residuals of type Ia supernovae from the nearby supernova factory. Astrophys. J. 770, 108 (2013).
Article ADS Google Scholar
Johansson, J. et al. SN Ia host galaxy properties from Sloan Digital Sky Survey-II spectroscopy. Mon. Not. R. Astron. Soc. 435, 1680–1700 (2013).
Article ADS Google Scholar
Lee, Y.-W. et al. Evidence for strong progenitor age dependence of type Ia supernova luminosity standardization process. Mon. Not. R. Astron. Soc. 517, 2697–2708 (2022).
Article ADS Google Scholar
Chung, C. et al. On the root cause of the host ‘mass step’ in the Hubble residuals of type Ia supernovae. Astrophys. J. 959, 94 (2023).
Article ADS Google Scholar
Chung, C., Park, S., Son, J., Cho, H. & Lee, Y.-W. Strong progenitor age bias in supernova cosmology. I. Robust and ubiquitous evidence from a larger sample of host galaxies in a broader redshift range. Mon. Not. R. Astron. Soc. 538, 3340–3350 (2025).
Article ADS Google Scholar
Rigault, M. et al. Evidence of environmental dependencies of type Ia supernovae from the nearby supernova factory indicated by local Hα. Astron. Astrophys. 560, A66 (2013).
Article Google Scholar
Rigault, M. et al. Confirmation of a star formation bias in type Ia supernova distances and its effect on the measurement of the Hubble constant. Astrophys. J. 802, 20 (2015).
Article ADS Google Scholar
Rigault, M. et al. Strong dependence of type Ia supernova standardization on the local specific star formation rate. Astron. Astrophys. 644, A176 (2020).
Article Google Scholar
Jones, D. O., Riess, A. G. & Scolnic, D. M. Reconsidering the effects of local star formation on type Ia supernova cosmology. Astrophys. J. 812, 31 (2015).
Article ADS Google Scholar
Jones, D. O. et al. Should type Ia supernova distances be corrected for their local environments? Astrophys. J. 867, 108 (2018).
Article ADS Google Scholar
Moreno-Raya, M. E. et al. Using the local gas-phase oxygen abundances to explore a metallicity dependence in SNe Ia luminosities. Mon. Not. R. Astron. Soc. 462, 1281–1306 (2016).
Article ADS Google Scholar
Moreno-Raya, M. E. et al. On the dependence of type Ia SNe luminosities on the metallicity of their host galaxies. Astrophys. J. 818, L19 (2016).
Article ADS Google Scholar
Kim, Y.-L., Smith, M., Sullivan, M. & Lee, Y.-W. Environmental dependence of type Ia supernova luminosities from a sample without a local-global difference in host star formation. Astrophys. J. 854, 24 (2018).
Article ADS Google Scholar
Kim, Y.-L., Kang, Y. & Lee, Y.-W. Environmental dependence of type Ia supernova luminosities from the YONSEI supernova catalog. J. Korean Astron. Soc. 52, 181–205 (2019).
ADS Google Scholar
Roman, M. et al. Dependence of type Ia supernova luminosities on their local environment. Astron. Astrophys. 615, A68 (2018).
Article Google Scholar
Rose, B. M., Garnavich, P. M. & Berg, M. A. Think global, act local: the influence of environment age and host mass on type Ia supernova light curves. Astrophys. J. 874, 32 (2019).
Article ADS Google Scholar
Kelsey, L. et al. The effect of environment on type Ia supernovae in the Dark Energy Survey three-year cosmological sample. Mon. Not. R. Astron. Soc. 501, 4861–4876 (2021).
Article ADS Google Scholar
Kelsey, L. et al. Concerning colour: the effect of environment on type Ia supernova colour in the Dark Energy Survey. Mon. Not. R. Astron. Soc. 519, 3046–3063 (2023).
Article ADS Google Scholar
Ginolin, M. et al. ZTF SN Ia DR2: colour standardisation of type Ia supernovae and its dependence on the environment. Astron. Astrophys. 694, A4 (2025).
Article Google Scholar
Ginolin, M. et al. ZTF SN Ia DR2: environmental dependencies of stretch and luminosity for a volume-limited sample of 1000 type Ia supernovae. Astron. Astrophys. 695, A140 (2025).
Article Google Scholar
Brout, D. et al. The Pantheon+ analysis: cosmological constraints. Astrophys. J. 938, 110 (2022).
Article ADS Google Scholar
Rubin, D. et al. Union through UNITY: cosmology with 2000 SNe using a unified Bayesian framework. Astrophys. J. 986, 231 (2025).
Article ADS Google Scholar
Vincenzi, M. et al. The Dark Energy Survey supernova program: cosmological analysis and systematic uncertainties. Astrophys. J. 975, 86 (2024).
Article ADS Google Scholar
Duarte, J. et al. Assessing differences between local dust attenuation and point source extinction within the same galactic environments. Astron. Astrophys. 700, A169 (2025).
Article Google Scholar
Mandel, K. S., Scolnic, D. M., Shariff, H., Foley, R. J. & Kirshner, R. P. The type Ia supernova color–magnitude relation and host galaxy dust: a simple hierarchical Bayesian model. Astrophys. J. 842, 93 (2017).
Article ADS Google Scholar
Garn, T. & Best, P. N. Predicting dust extinction from the stellar mass of a galaxy. Mon. Not. R. Astron. Soc. 409, 421–432 (2010).
Article ADS Google Scholar
Zahid, H. J., Yates, R. M., Kewley, L. J. & Kudritzki, R. P. The observed relation between stellar mass, dust extinction, and star formation rate in local galaxies. Astrophys. J. 763, 92 (2013).
Article ADS Google Scholar
Brout, D. & Scolnic, D. It’s dust: solving the mysteries of the intrinsic scatter and host-galaxy dependence of standardized type Ia supernova brightnesses. Astrophys. J. 909, 26 (2021).
Article ADS Google Scholar
Thorp, S. & Mandel, K. S. Constraining the SN Ia host galaxy dust law distribution and mass step: hierarchical BayeSN analysis of optical and near-infrared light curves. Mon. Not. R. Astron. Soc. 517, 2360–2382 (2022).
Article ADS Google Scholar
Karchev, K., Trotta, R. & Weniger, C. SimSIMS: Simulation-based Supernova Ia Model Selection with thousands of latent variables. Preprint at http://arxiv.org/abs/2311.15650 (2023).
Grayling, M. et al. Scalable hierarchical BayeSN inference: investigating dependence of SN Ia host galaxy dust properties on stellar mass and redshift. Mon. Not. R. Astron. Soc. 531, 953–976 (2024).
Article ADS Google Scholar
Grayling, M. & Popovic, B. BayeSN and SALT: a comparison of dust inference across SN Ia light-curve models with DES5YR. Mon. Not. R. Astron. Soc. 542, 2060–2073 (2025).
Article ADS Google Scholar
Wiseman, P. et al. A galaxy-driven model of type Ia supernova luminosity variations. Mon. Not. R. Astron. Soc. 515, 4587–4605 (2022).
Article ADS Google Scholar
Wiseman, P., Sullivan, M., Smith, M. & Popovic, B. Further evidence that galaxy age drives observed type Ia supernova luminosity differences. Mon. Not. R. Astron. Soc. 520, 6214–6222 (2023).
Article ADS Google Scholar
Fontanot, F., De Lucia, G., Monaco, P., Somerville, R. S. & Santini, P. The many manifestations of downsizing: hierarchical galaxy formation models confront observations. Mon. Not. R. Astron. Soc. 397, 1776–1790 (2009).
Article ADS Google Scholar
Rubin, D. et al. UNITY: confronting supernova cosmology’s statistical and systematic uncertainties in a unified Bayesian framework. Astrophys. J. 813, 137 (2015).
Article ADS Google Scholar
Hinton, S. R. et al. Steve: a hierarchical Bayesian model for supernova cosmology. Astrophys. J. 876, 15 (2019).
Article ADS Google Scholar
Thorp, S., Mandel, K. S., Jones, D. O., Ward, S. M. & Narayan, G. Testing the consistency of dust laws in SN Ia host galaxies: a BayeSN examination of foundation DR1. Mon. Not. R. Astron. Soc. 508, 4310–4331 (2021).
Article ADS Google Scholar
Kessler, R. & Scolnic, D. Correcting type Ia supernova distances for selection biases and contamination in photometrically identified samples. Astrophys. J. 836, 56 (2017).
Article ADS Google Scholar
Shariff, H., Jiao, X., Trotta, R. & van Dyk, D. A. BAHAMAS: new analysis of type Ia supernovae reveals inconsistencies with standard cosmology. Astrophys. J. 827, 1 (2016).
Article ADS Google Scholar
Eddington, A. S. On a formula for correcting statistics for the effects of a known error of observation. Mon. Not. R. Astron. Soc. 73, 359–360 (1913).
Article ADS Google Scholar
Karchev, K. & Trotta, R. STAR NRE: solving supernova selection effects with set-based truncated auto-regressive neural ratio estimation. J. Cosmol. Astropart. Phys. 2025, 031 (2025).
Article Google Scholar
Lee, Y.-W., Chung, C., Kang, Y. & Jee, M. J. Further evidence for significant luminosity evolution in supernova cosmology. Astrophys. J. 903, 22 (2020).
Article ADS Google Scholar
Nicolas, N. et al. Redshift evolution of the underlying type Ia supernova stretch distribution. Astron. Astrophys. 649, A74 (2021).
Article Google Scholar
Thorp, S. et al. pop-cosmos: scaleable inference of galaxy properties and redshifts with a data-driven population model. Astrophys. J. 975, 145 (2024).
Article ADS Google Scholar
Popovic, B. et al. ZTF SN Ia DR2: evidence of changing dust distribution with redshift using type Ia supernovae. Astron. Astrophys. 694, A5 (2025).
Article Google Scholar
Childress, M. J., Wolf, C. & Zahid, H. J. Ages of type Ia supernovae over cosmic time. Mon. Not. R. Astron. Soc. 445, 1898–1911 (2014).
Article ADS Google Scholar
Cranmer, K., Brehmer, J. & Louppe, G. The frontier of simulation-based inference. Proc. Natl Acad. Sci. USA 117, 30055–30062 (2020).
Article ADS Google Scholar
Lueckmann, J.-M., Boelts, J., Greenberg, D., Goncalves, P. & Macke, J. Benchmarking simulation-based inference. In Proc. 24th International Conference on Artificial Intelligence and Statistics, Vol. 130 (eds Banerjee, A. & Fukumizu, K.) 343–351 (PMLR, 2021).
Karchev, K. et al. SIDE-real: supernova Ia dust extinction with truncated marginal neural ratio estimation applied to real data. Mon. Not. R. Astron. Soc. 530, 3881–3896 (2024).
Article ADS Google Scholar
Karchev, K., Trotta, R. & Weniger, C. SICRET: supernova Ia cosmology with truncated marginal neural ratio estimation. Mon. Not. R. Astron. Soc. 520, 1056–1072 (2023).
Article ADS Google Scholar
Leja, J., Johnson, B. D., Conroy, C., van Dokkum, P. G. & Byler, N. Deriving physical properties from broadband photometry with Prospector: description of the model and a demonstration of its accuracy using 129 galaxies in the local Universe. Astrophys. J. 837, 170 (2017).
Article ADS Google Scholar
Wang, B. et al. Inferring more from less: Prospector as a photometric redshift engine in the era of JWST. Astrophys. J. 944, L58 (2023).
Article ADS Google Scholar
Greggio, L. The rates of type Ia supernovae. I. Analytical formulations. Astron. Astrophys. 441, 1055–1078 (2005).
Article ADS Google Scholar
Palicio, P. A., Matteucci, F., Della Valle, M. & Spitoni, E. Cosmic type Ia supernova rate and constraints on supernova Ia progenitors. Astron. Astrophys. 689, A203 (2024).
Article Google Scholar
Ruiter, A. J. & Seitenzahl, I. R. Type Ia supernova progenitors: a contemporary view of a long-standing puzzle. Astron. Astrophys. Rev. 33, 1 (2025).
Article ADS Google Scholar
Frohmaier, C. et al. TiDES: The 4MOST Time Domain Extragalactic Survey. Astrophys. J. 992, 158 (2025).
Article ADS Google Scholar
Sánchez, B. O. et al. The Dark Energy Survey supernova program: light curves and 5 yr data release. Astrophys. J. 975, 5 (2024).
Article ADS Google Scholar
Behroozi, P., Wechsler, R. H., Hearin, A. P. & Conroy, C. UNIVERSEMACHINE: the correlation between galaxy growth and dark matter halo assembly from z = 0–10. Mon. Not. R. Astron. Soc. 488, 3143–3194 (2019).
Article ADS Google Scholar
Gallazzi, A., Charlot, S., Brinchmann, J., White, S. D. M. & Tremonti, C. A. The ages and metallicities of galaxies in the local Universe. Mon. Not. R. Astron. Soc. 362, 41–58 (2005).
Article ADS Google Scholar
Heringer, E., Pritchet, C. & van Kerkwijk, M. H. The delay times of type Ia supernova. Astrophys. J. 882, 52 (2019).
Article ADS Google Scholar
Dilday, B. et al. A measurement of the rate of type Ia supernovae at redshift z ≈ 0.1 from the first season of the SDSS-II supernova survey. Astrophys. J. 682, 262–282 (2008).
Article ADS Google Scholar
Kessler, R. et al. Models and simulations for the Photometric LSST Astronomical Time Series Classification Challenge (PLAsTiCC). Publ. Astron. Soc. Pac. 131, 094501 (2019).
Article ADS Google Scholar
Malmquist, K. G. On some relations in stellar statistics. Meddelanden fran Lunds Astronomiska Observatorium Series I 100, 1–52 (1922).
ADS Google Scholar
Malmquist, K. G. A contribution to the problem of determining the distribution in space of the stars. Meddelanden fran Lunds Astronomiska Observatorium Series I 106, 1–12 (1925).
ADS Google Scholar
Miller, B. K., Cole, A., Forré, P., Louppe, G. & Weniger, C. Truncated marginal neural ratio estimation. In Proc. Advances in Neural Information Processing Systems, Vol. 34 (eds Ranzato, M. et al.) 129–143 (Curran Associates, 2021).
Hermans, J., Begy, V. & Louppe, G. Likelihood-Free MCMC with amortized approximate ratio estimators. In Proc. 37th International Conference on Machine Learning, Vol. 119 (eds Daumé, H. & Singh, A.) 4239–4248 (PMLR, 2020).
Zhang, L., Tozzo, V., Higgins, J. & Ranganath, R. Set norm and equivariant skip connections: putting the deep in deep sets. In Proc. 39th International Conference on Machine Learning, Vol. 162 (eds Chaudhuri, K. et al.) 26559–26574 (PMLR, 2022).
Zaheer, M. et al. Deep sets. In Proc. Advances in Neural Information Processing Systems, Vol. 30 (eds Guyon, I. et al.) (Curran Associates, 2017).
Wiseman, P. et al. Rates and delay times of Type Ia supernovae in the Dark Energy Survey. Mon. Not. R. Astron. Soc. 506, 3330–3348 (2021).
ADS Google Scholar
Leistedt, B., Alsing, J., Peiris, H., Mortlock, D. & Leja, J. Hierarchical Bayesian inference of photometric redshifts with stellar population synthesis models. Astrophys. J. Suppl. Ser. 264, 23 (2023).
Article ADS Google Scholar
Merz, G. et al. DeepDISC-photoz: deep learning-based photometric redshift estimation for Rubin LSST. Open J. Astrophys. 8, 40 (2025).
Article ADS Google Scholar
Chen, R. C. et al. Evaluating cosmological biases using photometric redshifts for type Ia supernova cosmology with the Dark Energy Survey supernova program. Mon. Not. R. Astron. Soc. 536, 1948–1966 (2025).
Article ADS Google Scholar
Kriek, M. & Conroy, C. The dust attenuation law in distant galaxies: evidence for variation with spectral type. Astrophys. J. 775, L16 (2013).
Article ADS Google Scholar
Fitzpatrick, E. L. Correcting for the effects of interstellar extinction. Publ. Astron. Soc. Pac. 111, 63–75 (1999).
Article ADS Google Scholar
Panter, B., Heavens, A. F. & Jimenez, R. Star formation and metallicity history of the SDSS galaxy survey: unlocking the fossil record. Mon. Not. R. Astron. Soc. 343, 1145–1154 (2003).
Article ADS Google Scholar
Heavens, A., Panter, B., Jimenez, R. & Dunlop, J. The star-formation history of the Universe from the stellar populations of nearby galaxies. Nature 428, 625–627 (2004).
Article ADS Google Scholar
Panter, B., Jimenez, R., Heavens, A. F. & Charlot, S. The star formation histories of galaxies in the Sloan Digital Sky Survey. Mon. Not. R. Astron. Soc. 378, 1550–1564 (2007).
Article ADS Google Scholar
Mo, H., van den Bosch, F. & White, S. Galaxy Formation and Evolution (Cambridge Univ. Press, 2010).
Johnson, B. D., Leja, J., Conroy, C. & Speagle, J. S. Stellar population inference with Prospector. Astrophys. J. Suppl. Ser. 254, 22 (2021).
Article ADS Google Scholar
Tojeiro, R., Heavens, A. F., Jimenez, R. & Panter, B. Recovering galaxy star formation and metallicity histories from spectra using VESPA. Mon. Not. R. Astron. Soc. 381, 1252–1266 (2007).
Article ADS Google Scholar
Leja, J., Carnall, A. C., Johnson, B. D., Conroy, C. & Speagle, J. S. How to measure galaxy star formation histories. II. Nonparametric models. Astrophys. J. 876, 3 (2019).
Article ADS Google Scholar
Bravo, E. & Badenes, C. Is the metallicity of their host galaxies a good measure of the metallicity of type Ia supernovae? Mon. Not. R. Astron. Soc. 414, 1592–1606 (2011).
Article ADS Google Scholar
Calzetti, D. et al. The dust content and opacity of actively star-forming galaxies. Astrophys. J. 533, 682–695 (2000).
Article ADS Google Scholar
Noll, S. et al. Analysis of galaxy spectral energy distributions from far-UV to far-IR with CIGALE: studying a SINGS test sample. Astron. Astrophys. 507, 1793–1813 (2009).
Article ADS Google Scholar
Alsing, J., Peiris, H., Mortlock, D., Leja, J. & Leistedt, B. Forward modeling of galaxy populations for cosmological redshift distribution inference. Astrophys. J. Suppl. Ser. 264, 29 (2023).
Article ADS Google Scholar
Mosenkov, A. V. et al. Dust emission profiles of DustPedia galaxies. Astron. Astrophys. 622, A132 (2019).
Article Google Scholar
Hahn, C. et al. IQ Collaboratory. III. The empirical dust attenuation framework-taking hydrodynamical simulations with a grain of dust. Astrophys. J. 926, 122 (2022).
Article ADS Google Scholar
Zhang, J. et al. Dust attenuation, dust content, and geometry of star-forming galaxies. Mon. Not. R. Astron. Soc. 524, 4128–4147 (2023).
Article ADS Google Scholar
Baes, M. et al. The TNG50-SKIRT Atlas: post-processing methodology and first data release. Astron. Astrophys. 683, A181 (2024).
Article Google Scholar
Hill, R. et al. Projected distances to host galaxy reduce SNIa dispersion. Mon. Not. R. Astron. Soc. 481, 2766–2777 (2018).
Article ADS Google Scholar
Toy, M. et al. Reduction of the type Ia supernova host galaxy step in the outer regions of galaxies. Mon. Not. R. Astron. Soc. 538, 181–197 (2025).
Article ADS Google Scholar
Alsing, J. et al. SPECULATOR: emulating stellar population synthesis for fast and accurate galaxy spectra and photometry. Astrophys. J. Suppl. Ser. 249, 5 (2020).
Article ADS Google Scholar
Ménard, B., Scranton, R., Fukugita, M. & Richards, G. Measuring the galaxy-mass and galaxy-dust correlations through magnification and reddening. Mon. Not. R. Astron. Soc. 405, 1025–1039 (2010).
ADS Google Scholar
Johansson, J. & Mörtsell, E. Combined constraints on intergalactic dust from quasar colours and the soft X-ray background. Mon. Not. R. Astron. Soc. 426, 3360–3368 (2012).
Article ADS Google Scholar
Schlafly, E. F. & Finkbeiner, D. P. Measuring reddening with Sloan Digital Sky Survey stellar spectra and recalibrating SFD. Astrophys. J. 737, 103 (2011).
Article ADS Google Scholar
Green, G. M., Schlafly, E., Zucker, C., Speagle, J. S. & Finkbeiner, D. A 3D dust map based on Gaia, Pan-STARRS 1, and 2MASS. Astrophys. J. 887, 93 (2019).
Article ADS Google Scholar
Zhang, R., Yuan, H. & Chen, B. An R_V map of the Milky Way revealed by LAMOST. Astrophys. J. Suppl. Ser. 269, 6 (2023).
Article ADS Google Scholar
Dharmawardena, T. E. et al. All-sky three-dimensional dust density and extinction maps of the Milky Way out to 2.8 kpc. Mon. Not. R. Astron. Soc. 532, 3480–3498 (2024).
Article ADS Google Scholar
Burke, D. L. et al. Forward global photometric calibration of the Dark Energy Survey. Astron. J. 155, 41 (2018).
Article ADS Google Scholar
Bianco, F. B., Jones, L., Ivezić, Ž. & Ritz, S. Updated Estimates of the Rubin System Throughput and Expected LSST Image Depth Technical Report No. PSTN-054 (Vera C. Rubin Observatory, 2022).
Korytov, D. et al. CosmoDC2: a synthetic sky catalog for dark energy science with LSST. Astrophys. J. Suppl. Ser. 245, 26 (2019).
Article ADS Google Scholar
LSST Dark Energy Science Collaboration (LSST DESC) The LSST DESC DC2 Simulated Sky Survey. Astrophys. J. Suppl. Ser. 253, 31 (2021).
Article ADS Google Scholar
Sánchez, B. O. et al. SNIa cosmology analysis results from simulated LSST images: from difference imaging to constraints on dark energy. Astrophys. J. 934, 96 (2022).
Article ADS Google Scholar
Petrecca, V. et al. Recovered supernova Ia rate from simulated LSST images. Astron. Astrophys. 686, A11 (2024).
Article Google Scholar
Duan, Y., Li, X., Avestruz, C., Regier, J. & LSST Dark Energy Science Collaboration Neural posterior estimation for cataloging astronomical images from the Legacy Survey of Space and Time. Astron. J. 171, 112 (2026).
Article Google Scholar
Chen, M. C., Herwig, F., Denissenkov, P. A. & Paxton, B. The dependence of the evolution of type Ia SN progenitors on the C-burning rate uncertainty and parameters of convective boundary mixing. Mon. Not. R. Astron. Soc. 440, 1274–1280 (2014).
Article ADS Google Scholar
Heringer, E. et al. Type Ia supernovae: colors, rates, and progenitors. Astrophys. J. 834, 15 (2017).
Article ADS Google Scholar
Marquardt, K. S. et al. Type Ia supernovae from exploding oxygen-neon white dwarfs. Astron. Astrophys. 580, A118 (2015).
Article Google Scholar
Dhawan, S. et al. ZTF SN Ia DR2: cosmology-independent constraints on type Ia supernova standardisation from supernova siblings. Astron. Astrophys. 702, A190 (2025).
Article Google Scholar
Poetrodjojo, H. et al. The SAMI Galaxy Survey: reconciling strong emission line metallicity diagnostics using metallicity gradients. Mon. Not. R. Astron. Soc. 502, 3357–3373 (2021).
Article ADS Google Scholar
Mosleh, M., Riahi-Zamin, M. & Tacchella, S. Reconstructing star formation histories of high-redshift galaxies: a comparison of resolved parametric and nonparametric models. Astrophys. J. 983, 181 (2025).
Article ADS Google Scholar
March, M. C., Trotta, R., Berkes, P., Starkman, G. D. & Vaudrevange, P. M. Improved constraints on cosmological parameters from yype Ia supernova data. Mon. Not. R. Astron. Soc. 418, 2308–2329 (2011).
Article ADS Google Scholar
Rahman, W., Trotta, R., Boruah, S. S., Hudson, M. J. & van Dyk, D. A. New constraints on anisotropic expansion from supernovae type Ia. Mon. Not. R. Astron. Soc. 514, 139–163 (2022).
Article ADS Google Scholar
Piras, D., Sorrenti, F., Durrer, R. & Kunz, M. Anchors no more: using peculiar velocities to constrain H₀ and the primordial Universe without calibrators. J. Cosmol. Astropart. Phys. 2025, 005 (2025).
Article Google Scholar
Tsaprazi, E. & Heavens, A. F. Field-level inference of H₀ from simulated type Ia supernovae in a local Universe analogue. Mon. Not. R. Astron. Soc. 539, 1448–1457 (2025).
Article ADS Google Scholar
Carreres, B. et al. Type Ia supernova growth-rate measurement with LSST simulations: intrinsic scatter systematics. Astrophys. J. 994, 178 (2025).
Article ADS Google Scholar
Rosselli, D. et al. Forecast for a growth-rate measurement using peculiar velocities from LSST supernovae. Astron. Astrophys. 701, A119 (2025).
Article Google Scholar
Boyd, B. M., Grayling, M., Thorp, S. & Mandel, K. S. Accounting for selection effects in supernova cosmology with simulation-based inference and hierarchical Bayesian modelling. Preprint at http://arxiv.org/abs/2407.15923 (2024).
Kessler, R. et al. Photometric estimates of redshifts and distance moduli for type Ia supernovae. Astrophys. J. 717, 40–57 (2010).
Article ADS Google Scholar
Gupta, R. R. et al. Host galaxy identification for supernova surveys. Astron. J. 152, 154 (2016).
Article ADS Google Scholar
Gagliano, A., Narayan, G., Engel, A., Carrasco Kind, M. & LSST Dark Energy Science Collaboration GHOST: using only host galaxy information to accurately associate and distinguish supernovae. Astrophys. J. 908, 170 (2021).
Article ADS Google Scholar
Hinshaw, G. et al. Nine-year Wilkinson Microwave Anisotropy Probe (WMAP) observations: cosmological parameter results. Astrophys. J. Suppl. Ser. 208, 19 (2013).
Article ADS Google Scholar
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. Preprint at http://arxiv.org/abs/1412.6980 (2017).
Karchev, K. snai-analysis/cigars: initial release. Zenodo https://doi.org/10.5281/zenodo.18705764 (2026).
Bingham, E. et al. Pyro: deep universal probabilistic programming. J. Mach. Learn. Res. 20, 973–978 (2019).
Google Scholar
Paszke, A. et al. PyTorch: an imperative style, high-performance deep learning library. In Proc. Advances in Neural Information Processing Systems, Vol. 32 (eds Wallach, H. et al.) 8024–8035 (Curran Associates, 2019).
Falcon, W. & The PyTorch Lightning Team PyTorch Lightning. Zenodo https://doi.org/10.5281/zenodo.18432694 (2023).

Download references

Acknowledgements

R.T. acknowledges funding from Next Generation EU through the National Recovery and Resilience Plan, Investment PE1 under Project FAIR (Future Artificial Intelligence Research), which is co-financed by the Next Generation EU (Grant No. DM 1555/11.10.22). R.T. is partially supported by the Fondazione ICSC, Spoke 3 ‘Astrophysics and Cosmos Observations’ through the National Recovery and Resilience Plan under Project ID CN00000013, ‘Italian Research Center on High-Performance Computing, Big Data and Quantum Computing’, which is funded by MUR Missione 4 Componente 2 Investimento 1.4: Potenziamento strutture di ricerca e creazione di ‘campioni nazionali di R&S (M4C2-19)’ through Next Generation EU. Funding for the work of R.J. was partially provided by Project PID2022-141125NB-I00 through Grant No. CEX2024-001451-M, which is funded by MICIU/AEI/10.13039/501100011033. This work was supported by a grant from the Simons Foundation (Grant Award ID BD-Targeted-00017375, UB).

Funding

Open access funding provided by Scuola Internazionale Superiore di Studi Avanzati - SISSA within the CRUI-CARE Agreement.

Author information

Authors and Affiliations

Theoretical and Scientific Data Science, International School for Advanced Studies (SISSA), Trieste, Italy
Konstantin Karchev & Roberto Trotta
Institute of Cosmos Sciences (ICC), University of Barcelona, Barcelona, Spain
Konstantin Karchev & Raúl Jiménez
Department of Physics, Imperial College London, London, UK
Roberto Trotta
Institute for Fundamental Physics of the Universe (IFPU), Trieste, Italy
Roberto Trotta
Italian Research Center on High-Performance Computing, Big Data and Quantum Computing (ICSC), Casalecchio di Reno, Italy
Roberto Trotta
Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain
Raúl Jiménez

Authors

Konstantin Karchev
View author publications
Search author on:PubMed Google Scholar
Roberto Trotta
View author publications
Search author on:PubMed Google Scholar
Raúl Jiménez
View author publications
Search author on:PubMed Google Scholar

Contributions

K.K. led the research and its implementation with guidance from R.T. and R.J. All authors contributed to producing the paper.

Corresponding author

Correspondence to Konstantin Karchev.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Astronomy thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Inference results for (true/intrinsic) stretch and colour of the 1578 SNe Ia in our mock data.

Top: Marginal posterior means and standard deviations vs. the true values in the simulation, as in Fig. 6. Bottom: the posterior standard deviations (sizes of the error bars in the top plots) vs. the observational uncertainties (for example from a light-curve fit), which we treat as part of the analysed data. The colour scheme corresponds to the true parameter values (the abscissa in the top plots). Note the effect of shrinkage for objects with large observational uncertainty: the posteriors are more strongly constrained (smaller st. dev.) than the scatter in the data ($\widehat{\sigma }$) due to the known population distribution. Note further that inference of SN colour is complicated by the presence of dust, which manifests as an increase in posterior uncertainty, and that this effect is larger for intrinsically redder objects, for which the colour can be attributed to dust reddening.

Extended Data Fig. 2 Inference with a mis-specified model.

Marginal posterior densities and regions with 39 % and 86 % marginal posterior credibilities, respectively, for one and two-dimensional global-parameter groups (as in Fig. 5), derived with the same networks used in the main text but from mock observations generated with the F99⁸² dust law instead of KC13⁸¹, which was used for training. The resulting bias is most pronounced for parameters affecting the Hubble diagram (M₀, σ₀, and cosmology) and increases when the data set is larger and more constraining. Note that we have not retrained the model using constrained priors appropriate for the new data, which results in some posteriors (M₀, σ₀, and cosmology) appearing at the edges of the respective (truncated) prior regions: one of the signs that the model may be mis-specified. With SBI, one can include the systematic uncertainty from modelling choices through model averaging during training, that is by simulating with a diverse set of dust laws, etc.

Extended Data Fig. 3 Change in the results for object-specific parameters.

Shift of the marginal posterior means plotted against the true parameter values and coloured by true dust-law parameter δ when mock data is generated with the F99⁸² dust law but analysed with that of KC13⁸¹ (as in Extended Data Fig. 2). Changes are only mildly significant at the individual-object level: compare with the scatter from the true values, which we show as 1-, 2-, and 3-σ shaded areas as in the middle panels of fig. 6 (compare also with the uncertainties in the bottom panels there), but being correlated, they can dramatically affect the inference of global parameters.

Extended Data Table 1 Parameters in the model, their (hierarchical) priors, and the values used to simulate the analysed mock data

Full size table

Supplementary information

Peer Review File (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Karchev, K., Trotta, R. & Jiménez, R. CIGaRS I: combined simulation-based inference from type Ia supernovae and host photometry. Nat Astron (2026). https://doi.org/10.1038/s41550-026-02842-5

Download citation

Received: 21 August 2025
Accepted: 13 March 2026
Published: 06 May 2026
Version of record: 06 May 2026
DOI: https://doi.org/10.1038/s41550-026-02842-5

Subjects

Abstract

Similar content being viewed by others

A gravitationally lensed supernova with an observable two-decade time delay

Binary progenitor systems for Type Ic supernovae

A spectroscopic quadruple as a possible progenitor of sub-Chandrasekhar type Ia supernovae

Main

Results

Predictions from the unified forward model

Physical host–type Ia supernova correlations and induced stellar-mass dependence

Redshift distribution and Hubble diagram

Simultaneous SBI

Global and population parameters

Object-specific parameters

Conclusion and outlook

Methods

Unified forward modelling of galaxies and type Ia supernovae

Host model

SFH and chemical enrichment: Prospector-β

Host dust extinction

Photometry and detection of hosts

Occurrence of type Ia supernovae

Model for type Ia supernovae

Causal host–type Ia supernova connections

Intrinsic properties and dust extinction of type Ia supernovae: Simple-BayeSN

Cosmological distance

Observables of type Ia supernovae

Detection and selection of type Ia supernovae

Implementation details and simulated counts

Set-based TMNRE for hierarchical models

Object-specific parameters

Network architecture: conditioned deep set++

Truncation and fine-tuning

Training details

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Extended Data Fig. 1 Inference results for (true/intrinsic) stretch and colour of the 1578 SNe Ia in our mock data.

Extended Data Fig. 2 Inference with a mis-specified model.

Extended Data Fig. 3 Change in the results for object-specific parameters.

Supplementary information

Peer Review File (download PDF )

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links