Abstract
Materials databases built from calculations based on density functional approximations play an important role in the discovery of materials with improved properties. Most databases thus constructed rely on the generalized gradient approximation (GGA) for electron exchange and correlation. This limits the reliability of these databases, as well as that of the artificial intelligence (AI) models trained on them, in particular for materials and properties which are not accurately described by GGA. Here, we describe a database of 7,024 inorganic materials presenting diverse structures and compositions. Crucially, the database was generated using hybrid functional calculations,efficiently implemented in the all-electron code FHI-aims. The database is used to evaluate the thermodynamic and electrochemical stability of oxides relevant to catalysis and energy related applications. We illustrate how the database can be used to train AI models for material properties using the sure-independence screening and sparsifying operator (SISSO) approach.
Similar content being viewed by others
Background & Summary
Materials serve as the cornerstone of critical economic sectors, including transportation, health, information technology and energy. Hence, the discovery of materials with improved properties is crucial. Materials databases (e.g., Materials Project1, OQMD2, AFLOW3, JARVIS4, Materials Cloud5) constructed with high-throughput calculations based on density functional approximations (DFAs) play a key role in accelerating materials discovery. For instance, these databases enable the construction of artificial intelligence (AI) models for material properties6,7. However, the usefulness of such databases, and consequently the applicability of AI models trained on them, are constrained by the accuracy of the underlying theoretical assumptions used to generate the data. As a result, the AI models might fail when deployed to explore materials and properties that are not well described by the chosen DFA. For example, electron exchange correlation (XC) functionals under the generalized gradient approximation (GGA) such as the Perdew-Burke-Ernzerhof (PBE)8, face accuracy limitations for estimating the electronic properties, especially for systems with localized electronic states such as transition-metal oxides9,10,11. Recently, high-throughput studies using the meta-GGA functional SCAN12 or its regularized variants13 and non-local, range-separated hybrid functional HSE0614,15 have been used for constructing materials databases which, to some extent, address the accuracy limitations of GGA. However, many of these studies have focused on specific material properties (e.g., band gaps) or system types (e.g., binary solids). Additionally, most of them employed plane-wave basis sets with pseudopotential approximation for core electrons, which may face accuracy and transferability challenges across materials with diverse crystal structures16,17. All-electron calculations using beyond-GGA DFAs can provide more reliable data, and databases constructed from such calculations could play a crucial role in developing AI models with enhanced generalizability.
Here, we present a database of 7,024 materials constructed from all-electron DFT calculations with hybrid functional for XC with a focus on oxides relevant for catalysis and energy related applications. The construction of this database has been enabled by the scalability-improved implementation in FHI-aims which allows to perform hybrid functional calculations at feasible costs for a large number of materials18. We also illustrate how the database facilitates the development of AI models for material properties using the symbolic-regression based Sure-Independence Screening and Sparsifying Operator (SISSO)19 approach, which identifies the key parameters correlated with material properties, making the model interpretable.
Methods
The database consists of 7,024 materials. We considered M-O-H (M-N-O-H) as the reference chemical system for selecting binary (ternary) materials as we are interested in evaluating their thermodynamic and electrochemical stability. This involves oxides (M-O/M-N-O), intermetallics (M-N), hydrides (M-H/N-H/M-N-H), hydroxides (M-O-H/N-O-H/M-N-O-H) and elemental solids (M/N). The initial crystal structures for these materials are queried from the Inorganic Crystral Structure Database (ICSD, v2020). Since ICSD consists of a large number of materials with the same composition (duplicate entries or polymorphs), filtering of entries for which the ICSD-id is associated with at least one Materials Project ID (MP-id, v2023.11.1) is done based on the lowest energy/atom criteria according to MP (GGA/GGA+U) data. In case there is no MP-id found for a formula, the ICSD entry with the lowest number of atoms in the unit cell is considered. Notably, we imposed no restrictions on unit cell sizes or crystal prototypes, resulting in structures with up to 616 atoms per unit cell. A schematic representation of the computational workflow used for constructing the database is provided in Fig. 1.
For the selected structures, geometry optimizations are performed with the PBEsol functional20 as it provides an accurate estimation of lattice constants of solids21. Using the structures optimized with PBEsol, we performed HSE06 energy evaluation and electronic structure calculations, as HSE06 gives more accurate electronic properties22,23. Our previous studies confirmed that HSE06 provides only slight improvements in lattice constants with respect to GGA functionals9,10. For both PBEsol and HSE06, properties such as the electronic band structure, density of states, Hirshfeld charges are also computed. All the calculations are performed using the all-electron code FHI-aims using numerically atom-centered orbtial (NAO) basis sets24. The “light” settings are chosen for NAO basis sets, as our previous study10 has demonstrated that they offer a reasonable trade-off between accuracy and computational efficiency for formation energies compared to “tight” or “really tight” settings. A computational workflow based on the Taskblaster framework25 is used to automate the multiple tasks involved in curating the data. A convergence criterion of 10−3 eV/Å is considered for forces. For all the potentially magnetic structures (ie., either labelled as magnetic in MP or containing elements such as Fe, Ni, Co, etc.), spin-polarized calculations are performed. Our DFT-HSE06 calculations failed to converge for 198 materials, with the majority (167) containing 3d- or 4f-elements. This is not unexpected, as HSE06 electronic-structure calculations are known to exhibit challenging convergence behaviour due to their higher sensitivity to localized states, often requiring denser k-point sampling or more sophisticated computational settings. In spin-polarized calculations, HSE06 may also favor different spin configurations than GGA, leading to adjustments in lattice geometry for different magnetic orders. Furthermore, HSE06 calculations can encounter multiple energy minima with comparable energies, necessitating case-specific parameter tuning. Implementing such careful, system-dependent adjustments is non-trivial within the high-throughput framework of this study, and we therefore excluded these cases.
Data Records
The electronic structure data from FHI-aims calculations can be accessed at the NOMAD archive26. The SQLite3 ASE databases and computed properties in tabular format can be downloaded from the figshare repository27. In Table 1, we provide a description of the properties computed using PBEsol and HSE06.
Figure 2a reflects the chemical diversity of the materials in the database. The materials of the database present elements covering the majority of the periodic table except actinides, noble gases and elements with the atomic number ≥ 80. Figure 2b shows the distribution of space groups (determined using spglib28 with tolerance of 10−5 Å) across the database. The most frequent spacegroups observed are P21/c, Pnma, Fm-3m, C2/m, and P63mc. As shown in Fig. 2c, we observe that a significant proportion of materials contain at least one transition metal. Figure 2d illustrates the distribution of the number of atoms per unit cell (limited to 200 for ease of visualization). While the majority of materials have 25 or fewer atoms in their unit cell, 14% of materials have unit cells with more than 50 atoms.
(a) Periodic table style heatmap showing the frequency of elements constituting the materials present in the database, distribution of (b) space group indexes, (c) material types further classified by containing (red) and lacking (blue) transition metals, (d) number of atoms in the unit cell across the materials in the database.
Technical Validation
We first compared the formation energies and band gaps computed using PBEsol and HSE06. The formation energies are calculated by considering bulk phases for the elements except oxygen for which gaseous O2 is considered as the reference phase. In general, HSE06 provides lower formation energies compared to PBEsol as shown in Fig. 3a. A mean absolute deviation (MAD) of 0.15 eV/atom is found between the formation energies calculated by the two methods. Compared to formation energies, a higher disparity (MAD = 0.77 eV) is observed in the band gaps estimated by PBEsol and HSE06, as evident from Fig. 3b with HSE06 showing a shift in band gap values toward higher ranges. This is expected owing to the well-known underestimation of band gaps by GGA methods which is partially corrected by HSE06. For 342 materials, PBEsol estimates a metallic character whereas HSE06 provides a band gap value >= 0.5 eV. We also performed a benchmarking against experimental band gap data for binary systems curated by Borlido et al.29. For the 121 materials common to both our dataset and theirs (determined by MP-id), PBEsol yields a mean absolute error (MAE) of 1.35 eV, which improves by over 50% with HSE06 (0.62 eV). For a few materials, PBEsol predicts a larger band gap, while HSE06 unexpectedly yields a metallic behavior with zero band gap. This discrepancy arises because the calculations are spin-unconstrained, allowing PBEsol and HSE06 to converge to different spin configurations (e.g., MgV2O5, Rb2V3O8). Note that the limited availability and significant uncertainties of existing experimental data preclude a comprehensive benchmarking of formation energies and band gaps against experimental measurements. This limitation is further compounded by the challenges high-throughput simulations face in accurately capturing magnetic ordering and defect-related effects under experimental conditions30,31.
To understand whether the accuracy differences in formation energies translate to thermodynamic stability, we constructed the convex hull phase diagrams (CPDs) of the materials in the database. We choose Li-Al and Co-Pt-O as representative chemical systems for binary and ternary compositions and their CPDs are shown in Fig. 4. Distinct CPDs are obtained by PBEsol and HSE06 for these systems. For example, Li2Al is stable with PBEsol but slightly unstable by 4 meV with HSE06 and similarly, Co(PtO3)2 is unstable with PBEsol by 11 meV but stable with HSE06. The critical decomposition reactions (decomposition in the CPD with the highest positive reaction energy) are determined for all the phases present in the CPDs along with the associated decomposition energy (ΔHd) as a quantitative metric of phase stability. For Li-Al, all the decomposition reactions are identical for both PBEsol and HSE06. However, different critical decomposition reactions are identified for Pt3O4, Co3O4 and CoPtO2 from the Co-Pt-O CPD. Some phases such as PtO6 are found to have the same critical decomposition reaction but with significantly differing ΔHd values for PBEsol (0.17 eV/atom) and HSE06 (0.5 eV/atom).
To analyze the phase stability prediction trends between PBEsol and HSE06, we plot the distribution of ΔHd for all the materials in the database, obtained from both methods in Fig. 5a. 78% and 75% of the considered materials are found having ΔHd ≤ 50 meV/atom for PBEsol and HSE06, respectively. The higher ΔHd values for the remaining materials despite them being synthesized (ie., associated with an ICSD-id) could be attributed to synthetic conditions which are not accounted for by the zero-Kelvin DFT stability prediction. The general tendency of HSE06 estimating lower formation energies does not lead to more materials being stable in the corresponding CPD, as many of these materials decompose into binary or ternary compounds. It is important to note that this analysis is limited by the absence of experimental data for certain phases and also by the exclusion of polymorphs, both of which are crucial for constructing more accurate CPDs.
In addition to phase stability, we have also determined the electrochemical stability of the materials as this is relevant for their potential applicability to areas such as electrocatalysis, corrosion resistance, batteries etc. This is carried out by constructing Pourbaix diagrams32,33 which provide information regarding the material’s stability across different redox conditions within a potential-pH coordinate space, electrochemical decomposition pathways, passivation and corrosion regions. Representative PBEsol and HSE06 calculated Pourbaix diagrams for Ru and Ag-Rh chemical systems by considering RuO2 and AgRhO2 as the target materials of interest are given in Fig. 6. These oxides have been identified to be stable as part of our previous study employing the DFT-HSE06 method10. For both systems, we observe qualitative differences in the Pourbaix diagrams obtained across the methods. In the case of Ru, the stability domain of both RuO2 and \({\text{Ru}}_{3}^{+}\) are enlarged in the HSE06 Pourbaix diagram compared to PBEsol. An additional phase \({\text{RuO}}_{4}^{2-}\) is also present in the HSE06 diagram. In this study, we limit our focus to analyzing the acid-stability (hence used equivalent to electrochemical stability from hereafter) of material under conditions (pH=0 and applied potential U=1.23 V) relevant to oxygen evolution reaction (OER), an important reaction involved in electrocatalytic water splitting. As a quantitative metric of a material’s acid-stability, we computed the Pourbaix decomposition free energy (\(\Delta {G}_{pbx}^{\text{OER}}\)) at these conditions. RuO2 has a \(\Delta {G}_{pbx}^{\text{OER}}\) of -0.06 eV/atom and -0.26 eV/atom with PBEsol and HSE06, respectively, indicating both the methods predict RuO2 as acid-stable though to different extents. The Ag-Rh Pourbaix diagrams generated using PBEsol and HSE06 exhibit differences in both the stable phases present and the pH-U stability range of the common phases between the two methods. A notable difference is that PBEsol predicts AgRhO2 to be acid-unstable with \(\Delta {G}_{pbx}^{\text{OER}}\) = 0.21 eV/atom, whereas HSE06 predicts it to be stable with \(\Delta {G}_{pbx}^{\text{OER}}\) = -0.01 eV/atom. A comparison of \(\Delta {G}_{pbx}^{\text{OER}}\) calculated using PBEsol and HSE06 is provided in Fig. 5b for all the materials in the database for which Pourbaix diagrams are constructed. A MAD of 0.27 eV/atom is observed between the two methods. This indicates that the disparity in predicting the electrochemical stability is much higher compared to the phase stability which is expected due to the increased number of redox reactions under aqueous conditions. Under a stability criterion of \(\Delta {G}_{pbx}^{\text{OER}}\) <= 0.1 eV/atom, 222 materials are acid-stable with PBEsol which is 255 for HSE06.
Pourbaix diagrams of Ru (a,b) and Ag-Rh (c,d) chemical systems calculated using PBEsol (left panel) and HSE06 (right panel). An aqueous ion concentration of 10−6 M and temperature of 298.15 K are considered for the construction of Pourbaix diagrams. The chemical potentials of ions are retrieved from ASE database. The green and red regions indicate stable and unstable domains of the pH-U coordinate space with respect to the reference solid. The stability domain of H2O is shown with white dashed lines.
We also developed AI models based on the database. Specifically, we focused on predicting the HSE06 band gaps of materials using SISSO, an interpretable AI method that identifies descriptors correlated with material properties. For this, a set of primary features, ie., basic material parameters which could be potentially correlated with the target property of interest are collected. These include elemental properties (composition-weighted), as well as structural and electronic properties determined using PBEsol. Since a strong focus on the database is on energy related applications, only materials with non-zero HSE06 band gaps were considered, as these are more suitable for applications such as photovoltaics. A dataset of 3091 materials is considered for the model training. For accessing the model performance, we performed a nested cross validation (NCV) where the model validation and selection are done in the inner loop and model performance estimation is done on the outer loop. The SISSO model thus obtained is:
where Egap,PBEsol is PBEsol calculated band gap. NVAL is the number of valence orbitals, Rs is the s-orbital radii, RVAL is the valence orbital radii, EA is the electron affinity, EN is the electronegativity, AN is the atomic number, all computed as composition based weight averaged (indicated by ⟨⟩) properties. The coefficients of the SISSO model are c0 = 2.45 eV, a0 = −0.07ÅeV, a1 = −0.08 Å eV2 and a2 = −1.05. Figure 7 illustrates the distribution of test errors from NCV scheme, showing that the model achieves reasonable accuracy, with 90% of errors falling within the 7.5% of the ratio between mean test errors and the standard deviation of the band gaps in the dataset. This indicates that AI models trained on this database can achieve reliable HSE06 band gap predictions.
Distribution of nested cross-validation test errors (primary y-axis) of the SISSO model for the Egap,HSE06 prediction. In the secondary y-axis, the ratio of mean absolute error and the standard deviation of Egap,HSE06 is shown. The area between the upper and lower edges of the black box represents the interquartile range of errors, and the whiskers from bottom to top correspond to 10 percentile, median and 90 percentile errors. The mean and median of absolute errors are indicated by the cross mark and horizontal line inside the interquartile range, respectively.
The database presented in this work includes properties related to stability and electronic structure, evaluated at the HSE06 level of theory while assuming that PBEsol provides reasonable geometries for the material classes considered. However, for studying certain properties such as defects or achieving more accurate band edge alignments, consideration of HSE06-relaxed geometries may be necessary. Additionally, considerations of van der Waals interactions using advanced methods such as the nonlocal many-body dispersion34 and/or using HSE06 for relaxation9 could provide improved geometries. Also, our formation energy calculations do not account for temperature and pressure effects, which could be important for a more realistic phase diagrams under experimental conditions. Similarly, for more accurate band gap estimations, advanced hybrid functionals such as the dielectric-dependent hybrid functionals could be employed35,36. The database is expected to serve as a reference for further investigations, including these refinements. Future improvements in the quality of materials data could also be enabled by employing higher-accuracy electronic structure methods beyond hybrid functionals, such as GW approaches.
Data availability
The row electronic structure data is available at the NOMAD archive26, where users can browse and download both input and output files from the FHI-aims calculations. Alternatively, ASE-compatible databases are provided on figshare repository27, which can be queried for a given material. These databases also include materials for which the calculation output files were not successfully processed by the NOMAD parsers. The processed data of material properties is also shared in CSV and JSON formats in figshare.
Code availability
The scripts and Jupyter notebooks used for database construction, analysis, plotting, and AI model training are available via GitLab: https://gitlab.com/akhilsnair/mat_database.
References
Jain, A. et al. Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. APL mater. 1 (2013).
Kirklin, S. et al. The open quantum materials database (OQMD): assessing the accuracy of DFT formation energies. Npj Comput. Mater. 1, 1–15 (2015).
Curtarolo, S. et al. AFLOW: An automatic framework for high-throughput materials discovery. Comput. Mater. Sci. 58, 218–226 (2012).
Choudhary, K. et al. The joint automated repository for various integrated simulations (JARVIS) for data-driven materials design. Npj comput. mater. 6, 173 (2020).
Talirz, L. et al. Materials Cloud, a platform for open computational science. Sci. Data 7, 299 (2020).
Batatia, I. et al. A foundation model for atomistic materials chemistry. arXiv preprint arXiv:2401.00096 (2023).
Merchant, A. et al. Scaling deep learning for materials discovery. Nature 624, 80–85 (2023).
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865 (1996).
Zhang, G.-X., Reilly, A. M., Tkatchenko, A. & Scheffler, M. Performance of various density-functional approximations for cohesive properties of 64 bulk solids. New J. Phys. 20, 063020 (2018).
Nair, A. S., Foppa, L. & Scheffler, M. Materials-discovery workflow guided by symbolic regression for identifying acid-stable oxides for electrocatalysis. Npj Comput. Mater. 11, 150 (2025).
Wang, L., Maxisch, T. & Ceder, G. Oxidation energies of transition metal oxides within the GGA+ U framework. Phys. Rev. B 73, 195107 (2006).
Schmidt, J., Wang, H.-C., Cerqueira, T. F., Botti, S. & Marques, M. A. A dataset of 175k stable and metastable materials calculated with the PBEsol and SCAN functionals. Sci. Data 9, 64 (2022).
Kingsbury, R. et al. Performance comparison of r2scan and SCAN MetaGGA density functionals for solid materials via an automated, high-throughput computational workflow. Phys. Rev. M. 6, 013801 (2022).
Kim, S. et al. A band-gap database for semiconducting inorganic materials calculated with hybrid functional. Sci. Data 7, 387 (2020).
Liu, M., Gopakumar, A., Hegde, V. I., He, J. & Wolverton, C. High-throughput hybrid-functional DFT calculations of bandgaps and formation energies and multifidelity learning with uncertainty quantification. Phys. Rev. M. 8, 043803 (2024).
Lejaeghere, K. et al. Reproducibility in density functional theory calculations of solids. Science 351, aad3000 (2016).
Mazdziarz, M. Uncertainty of DFT calculated mechanical and structural properties of solids due to incompatibility of pseudopotentials and exchange–correlation functionals. J. Chem. Theory Comput. 20, 9734–9740 (2024).
Kokott, S. et al. Efficient all-electron hybrid density functionals for atomistic simulations beyond 10,000 atoms. J. Chem. Phys. 161, 024112 (2024).
Ouyang, R., Curtarolo, S., Ahmetcik, E., Scheffler, M. & Ghiringhelli, L. M. SISSO: A compressed-sensing method for identifying the best low-dimensional descriptor in an immensity of offered candidates. Phys. Rev. M 2, 083802 (2018).
Perdew, J. P. et al. Restoring the density-gradient expansion for exchange in solids and surfaces. Phys. Rev. L 100, 136406 (2008).
Csonka, G. I. et al. Assessing the performance of recent density functionals for bulk solids. Phys. Rev. B 79, 155107 (2009).
Chevrier, V. L., Ong, S. P., Armiento, R., Chan, M. K. & Ceder, G. Hybrid density functional calculations of redox potentials and formation energies of transition metal compounds. Phys. Rev. B 82, 075122 (2010).
Garza, A. J. & Scuseria, G. E. Predicting band gaps with hybrid density functionals. J. Phys. Chem. Lett. 7, 4165–4170 (2016).
Blum, V., Gehrke, R., Hanke, F., Havu, P., Havu, V., Ren, X., Reuter, K. & Scheffler, M. Ab initio molecular simulations with numeric atom-centered orbitals. Comput. Phys. Commun. 180, 2175–2196 (2009).
Larsen, A. H., Kuisma, M. J., Boland, T. M., Nilsson, F. A. & Thygesen, K. S. Taskblaster: A generic framework for automated computational workflows. Digit. Discov. (2025).
Nair, A. S., Foppa, L., Scheffler, M. Materials database from all-electron hybrid functional DFT calculations. NOMAD Dataset. https://doi.org/10.17172/NOMAD/2025.05.01-1 (2025).
Nair, A. S., Foppa, L., Scheffler, M. Materials database from all-electron hybrid functional DFT calculations. figshare. Dataset. https://doi.org/10.6084/m9.figshare.28375538.v1 (2025).
Togo, A., Shinohara, K. & Tanaka, I. Spglib: a software library for crystal symmetry search. STAM Methods 4, 2384822 (2024).
Borlido, P. et al. Large-scale benchmark of exchange–correlation functionals for the determination of electronic band gaps of solids. J. Chem. Theory Comput. 15, 5069–5079 (2019).
Horton, M. K., Montoya, J. H., Liu, M. & Persson, K. A. High-throughput prediction of the ground-state collinear magnetic order of inorganic materials using density functional theory. Npj Comput. Mater. 5, 64 (2019).
Nieminen, R. M. Issues in first-principles calculations for defects in semiconductors and oxides. Model. Simul. Mater. 17, 084001 (2009).
Pourbaix, M. Atlas of electrochemical equilibria in aqueous solutions. NACE (1966).
Persson, K. A., Waldwick, B., Lazic, P. & Ceder, G. Prediction of solid-aqueous equilibria: Scheme to combine first-principles calculations of solids with experimental aqueous states. Phys. Rev. B 85, 235438 (2012).
Hermann, J. & Tkatchenko, A. Density functional model for van der Waals interactions: Unifying many-body atomic approaches with nonlocal functionals. Phys. Rev. Lett. 124, 146401 (2020).
Ghosh, A. et al. Accurate and efficient prediction of the band gaps and optical spectra of chalcopyrite semiconductors from a nonempirical range-separated dielectric-dependent hybrid: Comparison with many-body perturbation theory. Phys. Rev. B 109, 045133 (2024).
Chen, W., Miceli, G., Rignanese, G.-M. & Pasquarello, A. Nonempirical dielectric-dependent hybrid functional with range separation for semiconductors and insulators. Phys. Rev. M 2, 073803 (2018).
Acknowledgements
This work was funded by the ERC Advanced Grant TEC1p (European Research Council, Grant Agreement No. 740233). A.S.N would like to thank Beate Paulus and the German Research Foundation (DFG) for their support through the Walter-Benjamin Fellowship Program (project No. 540316537). This work used computing resources from the North German Supercomputing Alliance (Project ID 20814).
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
A.S.N. developed the workflow, constructed the database and analyzed the data. L.F and M.S contributed to the discussion and revision.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Nair, A.S., Foppa, L. & Scheffler, M. Materials Database from All-electron Hybrid Functional DFT Calculations. Sci Data 12, 1518 (2025). https://doi.org/10.1038/s41597-025-05867-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-025-05867-z