Introduction

Advances in the manufacturing of architected materials with intricate inner structures have ushered in a new era of engineering design and analysis1,2,3,4. Architected materials, also known as metamaterials, derive their effective properties primarily through their internal structure rather than their chemical composition5,6,7. For a given base alloy, the effective metamaterial performance can be substantially tuned through the appropriate selection of the underlying topology8,9. This enables the design of lightweight high-specific-strength materials with extraordinary functionalities10, finding applications in the mechanical11, aerospace12, automotive, and blast protection sectors, to name a few.

Until now a wide range of unit-cell metamaterial architectures have been developed7,13, several of them inspired by nature14,15,16. Diverse strut-based architectures have been developed17,18, following hierarchical and biomimetic patterns19,20. Optimizing their pattern21,22 has allowed for ultra-lightweight design spaces to be probed, concurrently controlling the effective mechanics22,23,24,25. In addition to strut-based designs, sheet-based TPMS architectures have received substantial attention because of their superior specific strength18,26,27,28,29. Their zero mean curvature architecture supports the minimization of stress concentrations, rendering them particularly favourable for structural and heat transfer applications30,31,32,33. Since Schoen’s pioneering study of the gyroid (G) lattice in the early 1970s34, more than hundred TPMS architectures have been identified26. The most widely used include Primitive, Gyroid, I-graph-Wrapped Package (IWP), and Diamond topologies. Several patterns are inspired by natural structures, such as the exoskeletons of insects (butterfly, beetle)35. Each TPMS configuration has unique stiffness, strength, and energy absorption characteristics, covering a broad spectrum of effective mechanical properties36,37,38.

Given the vast design space of metamaterial architectures, computational methods have become essential for evaluating and optimizing their performance. Several strategies have been proposed to reduce the associated computational cost, including reduced database schemes39, surrogate modeling, task parallelization40, and spectral decomposition methods41. Recent advances in neural networks and deep learning have introduced new opportunities in material science and engineering, particularly for accelerating the design and property prediction of architected materials42,43. Convolutional neural networks (CNNs) have been widely adopted for predicting the mechanical responses of composites and architected structures44. CNN-based surrogate models have demonstrated strong potential for topology optimization in TPMS architectures, such as Schoen’s Gyroid34. Furthermore, 2D CNNs have been developed for predicting the effective thermal conductivity directly from microstructural cross-sections, offering practical advantages over 3D reconstructions45. Beyond purely data-driven formulations, physics-informed neural networks and deep operator learning frameworks have simplified the modeling of multi-phase architected materials and complex mechanical responses43,46.

In systems where dendrogram-type relations among input parameters can be identified, graph-based models have been favored. Graph Neural Network (GNN) and Graph Convolutional Network (GCN) architectures exhibit exceptional modeling capabilities for a wide range of physical and chemical processes47. Variational GCNs scale linearly with the number of graph edges, learning features through connectivity matrices. Based on GCNs, autoencoders that encapsulate the mechanical properties of lattice structures that associate structural properties with lattice architectures have been developed. Modified crystal graph CNN models have accurately described the structure-property relations for shell-lattices, leveraging graph representations to enable the design and discovery of new materials with exceptional mechanical and thermal properties47. Through finite element simulations on more than 40,000 microstructural configurations, comprehensive stiffness and heat conductivity tensors have been established, demonstrating the scalability of graph-based approaches. Conditional generative adversarial networks (CGAN) have demonstrated accurate and robust predictions of fundamental mechanical properties of Voronoi-type metamaterials, including the effective Young’s modulus and Poisson’s ratio48.

The data representation strategy directly affects the performance and accuracy limits of a given modeling architecture. Different representations have been used depending on the application of interest. In modeling molecular structures, text strings49 and fingerprint vectors50 have been widely used, encoding lattice architectures into compact descriptors. Furthermore, 8-bit geometric descriptor modeling approaches51 have been proposed to create lattice patterns, often combined with long-short memory architectures. Among the diverse data representation approaches, pseudo-randomized strut geometries have been encoded as binary data (0,1) that distinguish void from solid regions and train CNN models to predict the complete stress-strain response42. Moreover, 3D, nodal-scale representation of the topological space has been shown effective in the compact storage of structural design information, which can be effectively processed through transpose CNN operations for the concurrent prediction of a wide range of effective properties43. Mixed approaches combining string-type and image-based information have also been presented. Three canonical categories have emerged for intelligent metamaterials applications: wave-based neural networks, mathematical operations, and logic operations. These categories directly execute computation, detection, and inference tasks in physical space52.

Despite the substantial progress in data-driven analysis and design of architected materials, generalizable machine-learning formulations that operate directly at the unit-cell geometry level remain scarce. Most existing approaches rely on deep-learning architectures-typically CNNs or graph-based networks-that are either topology-specific, require high-dimensional voxel or image representations, or act as black boxes with limited interpretability. As a result, they offer little insight into why particular TPMS architectures exhibit superior stiffness, shear resistance, yield strength, or thermal conductivity. Consequently, a physics- and data-based understanding of which geometric attributes fundamentally govern distinct mechanical and thermal responses has yet to be established, and the minimal set of descriptors required to capture these relations remains unknown. Moreover, predictive frameworks capable of simultaneously addressing elastic, plastic limits, and thermal properties using only CAD-level information, without reliance on topology-specific training or expensive preprocessing, are currently lacking.

The present work addresses these limitations by introducing a feature-based, interpretable machine-learning framework that links TPMS unit-cell CAD geometry to effective elastoplastic and thermal performance through a compact set of physically meaningful descriptors. Leveraging fundamental geometric features-quantifying compactness, surface area, inertia, and spatial heterogeneity-the proposed formulation enables accurate prediction of Young’s modulus, Shear modulus, Yield strength, and Thermal conductivity across a diverse set of sheet-based TPMS architectures. Crucially, the framework supports systematic explainability analysis, revealing how different geometric characteristics selectively control uniaxial, shear, plastic, and conductive responses, and establishing clear feature-importance hierarchies. Beyond predictive accuracy, the study identifies a minimal, topology-agnostic descriptor set sufficient to capture the full performance space within a narrow error margin, thereby advancing physics-motivated understanding, reducing model complexity, and providing a scalable pathway for rational design and discovery of architected materials directly from CAD data.

Results

Feature engineering of architected sheet-based TPMS materials

The unit-cell topology of architected materials is defined by a set of geometric features that uniquely characterize its architecture. For the construction of the relevant database, diverse TPMS topologies are considered. In particular, nine TPMS architectures, including Gyroid (GY), I-graph-wrapped Package (IWP), Schwartz Primitive (SCP), Fischer-Koch (FKS), Schwartz Diamond (SCD), Lidinoid (LD), Split-P (SLP), I2-Y (I2Y) and Schoen’s F-RD (FRD) are analyzed, their level set description summarized in Table 3 (Section 4.1). The different unit-cell configurations are illustrated in Fig. 1a.

Fig. 1: Overview of TPMS unit-cell topologies and geometric feature extraction.
Fig. 1: Overview of TPMS unit-cell topologies and geometric feature extraction.The alternative text for this image may have been generated using AI.
Full size image

a Unit cells of TPMS sheet-based architected materials, b Volume fraction Vf, compactness C and inner normalized surface area An. The compactness is the ratio of the material phase area to the area of a sphere B(ρ*) with equal volume, i.e \(| \Omega | =| B({\rho }^{* })| =\frac{4}{3}\pi {({\rho }^{* })}^{3}\), c Description of the algorithm for the shape irregularity metrics: The blue points belong to the spherical surface, whereas the purple ones represent their maps to the corresponding TPMS, and, d the algorithm for the pixel-based calculation of inertia features.

For each TPMS topology, a set of features is computed, including intuitive design features, such as: the unit-cell volume fraction Vf (Eq. (5)), that is the material content within a given unit cell volume, the effective normalized surface area An (Eq. (6)), defined as the total area of the inner topological pathways, normalized by the area of the unit-cell cube; and the compactness C (Eq. (7)), which measures the surface area of the material phase relative to that of a solid sphere of equal volume.

Additional features include the directional effective moments of inertia I11, I22, I12, characterizing the average resistance of the architected material to curvature-inducing loads (Eq. (8)). For their computation, a voxel representation of the unit-cell topology is elaborated, averaging the moments of inertia computed along several cross-sections of the unit-cell, as schematically depicted in Fig. 1d. The moments are normalized with respect to the reference moment of inertia value of the solid unit-cell cube for dimensional analysis consistency purposes.

Furthermore, four additional geometric features derived from point-wise comparisons of the unit-cell architecture with the surface of a sphere having the same volume fraction with the one considered are elaborated (see Section 4.1). In particular, a cloud of NS points \(G\circ {S}^{2}=\left\{{{\boldsymbol{\xi }}}^{i}\in {S}^{2}(\rho ),i=1,2,\ldots ,{N}_{S}\right\}\) distributed on a spherical surface S2(ρ) is mapped to the TPMS surface M, forming the following collection of point pairs: \(f(G\circ {S}^{2},G\circ M)=\left\{({{\boldsymbol{\xi }}},{{\bf{x}}})| {{\boldsymbol{\xi }}}\in G\circ {S}^{2}(\rho ),{{\bf{x}}}\in G\circ M,{\bf{R}}({\bf{x}};{\boldsymbol{\xi }})={\bf{0}}\right\}\). The set of point pairs allows for the computation of an average distance \(\bar{D}\), a variance Var(D), as well as a minimum min(D) and maximum distance max(D), among the unit-cell topology and the points of the sphere (Eq. (16)).

A detailed overview of the geometric features is provided in Fig. 1b–d. In particular, Fig. 1b presents the evolution of Vf for an IWP unit cell, along with examples of different compactness values relative to the limiting value C = 1 of a sphere B(ρ*). Moreover, a graphical example of the effective surface area An is provided, defining the associated surface boundaries. Figure 1c presents the shape-distance features: a Primitive minimal surface is shown together with the point clouds G(S2(ρ)) (blue) and G(M), where the distances d(xi; ξi) are defined, as well as the description of the associated metrics \(\bar{D}\), Var(D), \({D}_{\min }\), \({D}_{\max }\). Figure 1d illustrates the pixel-based calculation procedure of the inertia features I11, I22, I12, according to53. For a given TPMS unit cell, the inertia values are obtained averaging with respect to n slices along the x3 direction, where, for each slice k, the corresponding inertia integrals are approximated using the corresponding set of pixels (Eq. (9)).

A detailed statistical assessment of the selected geometric features is presented in Fig. 2a, c-e, for the considered topology set (Fig. 1a). For the analysis, the prescribed volume fraction Vf values vary in the interval [0.2, 0.5] (Fig. 1b). The Pearson correlation map in Fig. 2a indicates weak correlations among most features, with the exception of those inherently correlated, attesting to the accuracy of the underlying computations. In particular, a unity colleration among the moments of inertia I11 and I22, along the x1 and x2 axis is revealed, reflecting the centrosymmetry of most unit-cell designs. Moreover, the compactness C and effective area An metrics yield high correlation values, both of them being area-related metrics. Furthermore, the unit-cell moments of inertia are strongly correlated with the compactness C and effective area An metrics, a dependence that can be appreciated by the Steiner’s parallel axis theorem, which includes area attributes from its very definition (Eq. (8)). All remaining features exhibit comparatively lower correlations, indicating a significant contribution to the overall geometric information available for the hypermodel creation.

Fig. 2: Geometric feature analysis of TPMS sheet networks.
Fig. 2: Geometric feature analysis of TPMS sheet networks.The alternative text for this image may have been generated using AI.
Full size image

a Pearson correlation matrix between geometric descriptors with hierarchical clustering dendrogram. b Compactness C comparison across topologies for 26% and 42% Vf. c Probability density distributions of distance-related metrics on a logarithmic scale. d Statistical distributions of distance metrics (boxplots shown on log scale). e Average normalized inner surface area An across topologies. f Evolution of the moment of inertia components I11, I22, and I12 with volume fraction for each topology.

The evolution of compactness C for each topology and for two representative volume fractions, Vf = 0.26 and Vf = 0.42 is provided in polar diagrams in Fig. 2a. Higher compactness values are recorded for the low-volume case (Vf = 0.26) consistent with the fact that materials with smaller Vf exhibit relatively larger internal surface areas within the unit cell. Significant compactness variations as a function of topology are recorded, with the volume fraction to modify the values, but not the overall distribution among the different unit-cell designs. For both volume fractions, I2Y, Fischer-Koch, and Lidinoid topologies yield the highest C values.

Probability density distributions of the distance metrics (\(\bar{D}\), Var(D)) for all unit-cell designs are provided in Fig. 2c, summarized in the form of box-plots in Fig. 2d. The widest spread is observed for \({D}_{\min }\) (red-colored filled distribution and boxplot in Fig. 2c) with a mean value in the order of 0.003 and a considerable variance. In contrast, the maximum distance \({D}_{\max }\) (salmon color in Fig. 2c) presents the lowest variance variance. The mean point-wise distance \(\bar{D}\) has a value of 0.153, while the variance Var(D) a value of 0.018, being nearly an order of magnitude lower. It should be noted that the distance metrics are insensitive to the volume fraction Vf, their value controlled by the topological architecture. Indicatively, Gyroid unit-cell designs have a mean distance \(\bar{D}\) of 0.09, while IWP architectures have almost twice the value, namely 0.172 (Fig. 2c)).

The inner surface area An distribution (Fig. 2e) overall follows the pattern recorded for the compactness C metric (Fig. 2b), with I2Y, Fischer-Koch, and Gyroid, SPC architectures to yield the highest (An = 3.92, An = 3.71) and lowest (An = 2.28, An = 1.76) values, respectively. However, while the topological classification closely matches that recorded in the compactness C metric, the relative scaling among the topologies well differs. Interestingly, unit-cell architectures with low surface area An metrics are those with the highest inertia \({\widehat{I}}_{11}\) values, with SCP, Gyroid and IWP architectures to populate the upper part of Fig. 2f. Moreover, the relative density Vf dependence of \({\widehat{I}}_{11}\) is non-linear, with certain architectures to achieve high values already at moderate densities, with the actual value to be highly topology dependent.

Mechanical and thermal performance space

Comprehensive insights into the mechanical and thermal conductivity response space are provided in Fig. 3. The effective Young’s modulus \(\widehat{E}\), thermal conductivity \(\widehat{\kappa }\), shear modulus \(\widehat{G}\), and uniaxial loading yield stress \({\widehat{\sigma }}_{y}\) are evaluated for the different sheet-based TPMS architectures of Fig. 1, for relative density values as low as 20% and up to 50% through voxel-based finite element analysis (Section 4.4). Dedicated voxelization convergence analysis and validation comparisons with existing studies are provided in the Supplementary Material (S1) for completeness purposes. In the sequel, the effective performance metrics are normalized by the base material properties of solid aluminum: the Young’s and shear moduli are normalized by Eref = 69 GPa, the effective thermal conductivity by κref = 237 W/mK, while the yield stress by the yield limit of aluminum (σy = 250 MPa).

Fig. 3: Effective mechanical properties of TPMS sheet topologies.
Fig. 3: Effective mechanical properties of TPMS sheet topologies.The alternative text for this image may have been generated using AI.
Full size image

ad Density relationships between normalized mechanical and thermal properties and the geometric descriptors compactness and normalized inner surface area An. eh Violin distributions of the effective properties across topologies, aggregating all investigated volume fractions, with representative von Mises stress contour plots under (e) uniaxial elastic, (f) thermal, (g) shear, and (h) uniaxial elastoplastic loading.

The kernel density estimation (KDE) plots in Fig. 3a illustrate the variation of the average value of each effective property with respect to the Compactness and Inner Surface area metrics (An, C), across the entire geometric features space (gray background). A broad Young’s modulus distribution is recorded, with the highest average value observed for the Schoen’s F-RD topology (FRD, \(\bar{\widehat{E}}=0.21\)), and the lowest for Primitive SPC architectures, for which nearly half average values are computed (\(\bar{\widehat{E}}=0.11\)). Notably, high mean Young’s moduli are recorded for IWP (\(\bar{\widehat{E}}=0.18\)), FKS (\(\bar{\widehat{E}}=0.17\)), and SLP (\(\bar{\widehat{E}}=0.17\)) TPMS architectures, classifying them among the stiffest -on average- unit-cell patterns. The corresponding mean effective thermal conductivity distributions demonstrate a narrower spread (Fig. 3b). The lowest thermal conductivities are recorded for Lidinoid architectures (\(\bar{\widehat{\kappa }}=0.21\)), with Primitive SPC designs to furnish the highest, on-average, mean value (\(\bar{\widehat{\kappa }}=0.25\), SPC) for the lowest combination of An, C.

The shear modulus distribution well differs from that recorded for the Young’s modulus and thermal conductivity values (Fig. 3c). Primitive SPC designs appear to be the shear stiffest patterns (\(\bar{\widehat{G}}=0.06\)), despite their low Young’s modulus. Interestingly, SCD unit-cell architectures record the lowest mean shear modulus values, despite their moderate Young’s modulus and thermal conductivity (Fig. 3a, b). The distribution of the average uniaxial yield stresses (Fig. 3d) better compare with that of the Young’s modulus. However, their mean values indicate smoother topological dependencies, with the normalized bounds ranging between \({\bar{\sigma }}_{y}=0.11\) and \({\bar{\sigma }}_{y}=0.15\) for the Lidinoid and Schoen’s F-RD unit-cell architectures.

Further statistical information is provided in the violin plots of Fig. 3e–h. The plots provide insights not only into the comparative performance for each unit-cell design, but also into the property distribution of a given architecture, upon varying volume fractions Vf (vertical axis). In particular, the width of each violin distribution represents the probability density of observing a particular property value.

The Young’s modulus distributions of Fig. 3e indicate a wide range of values, as low as \(\widehat{E}=4\) and up to \(\widehat{E}=35\) MPa, with the Schoen’s F-RD architecture populating the highest moduli space and Primitive SPC the lowest. Across all unit-cell designs, the lower part of the corresponding violins are wider, indicating that the probability density distributions with lower \(\widehat{E}\) values are more frequent for the same Vf space. The representative uniaxial loading von Mises (\({\sigma }_{VM}\)) stress distributions (Fig. 3e, Section 2.2) highlight the strong differences in the internally highly stressed regions for IWP, SPC, and SCD unit-cell designs with a common Vf value of 0.30 under the same magnitude of macroscopic stress.

The distribution of effective thermal conductivities \(\widehat{\kappa }\) is smoother, with comparable violin column heights across most unit-cell designs, as well as relatively uniform probability density distributions denoted by the low variation of the violin widths (Fig. 3f). Primitive SPC and FRD furnish the highest average \(\widehat{\kappa }\) values, their distribution ranging between \(\widehat{\kappa }=0.14-0.36\) and \(\widehat{\kappa }=0.15-0.38\) W/(m K), respectively. Contour plots of the temperature gradient magnitude θ illustrate distinct thermal flux pathways among the IWP, Primitive SPC, and SCD unit-cell designs, with the overall distributions exhibiting similarities with the von Mises stress distributions of Fig. 3e.

Contrary to the rather smooth distribution of thermal conductivities, shear moduli reveal significant unit-cell dependencies (Fig. 3g). Primitive SPC and Schoen’s F-RD yield shear moduli values that range between \(\widehat{G}=0.03\) to 0.10 and \(\widehat{G}=0.025\) to 0.09, respectively, with Primitive architectures exhibiting a more uniform density distribution across the violin height. Lidinoid unit cells are comparatively shear shofter, their upper modulus range restraining below 6.939043 MPa. The associated shear von Mises \({\sigma }_{VM}\) contour plots highlight the utterly different inner unit-cell material regions primarily loaded compared to the tensile or thermal loading cases (Figs. 3g vs e, f).

The violin plots of the effective yield stresses (Fig. 3h) reveal overall comparable topology dependencies with those recorded for the Young’s modulus (Fig. 3e). FRD TPMS architectures define the upper performance bounds, achieving the highest mean value of \({\widehat{\sigma }}_{y}=0.153\). In contrast, Lidinoid unit-cell architectures exhibit the lowest mean yield stress, with \({\widehat{\sigma }}_{y}\approx 0.11\) across the investigated volume-fraction range. The accompanying yield stress contour plots reveal spatial patterns that are analogous to the von Mises stress distributions observed under uniaxial elastic loading (Fig. 3e), highlighting the strong correlation between stiffness and yield-dominated response.

Explainability analysis

Predictive modeling of the effective mechanical and thermal properties of TPMS-based architected materials was carried out using Random Forest (RF) and Extreme Gradient Boosting (XGBoost) regressors. Deep Learning (DL) modeling has been as well evaluated, relevant insights provided in the Supplementary Material (S5) for completeness purposes. The input feature vector X consists of the geometric descriptors defined in Section 4.2, excluding I22, as it is fully correlated with I11(Fig. 2):

$${\bf{X}}=({V}_{f},\,C,\,{A}_{n},\,{I}_{11},\,{I}_{12},\,\overline{D},\,{\rm{Var}}(D),\,{D}_{\max },\,{D}_{\min })\in {({{\mathbb{R}}}^{+})}^{9}$$
(1)

The regression targets are the effective Young’s modulus \(\widehat{E}\), thermal conductivity \(\widehat{\kappa }\), shear modulus \(\widehat{G}\), and yield stress \({\widehat{\sigma }}_{y}\):

$${\bf{Y}}=(\widehat{E},\,\widehat{\kappa },\,\widehat{G},\,{\widehat{\sigma }}_{y})\in {({{\mathbb{R}}}^{+})}^{4}$$
(2)

The models were trained employing stratified train-test splitting, elaborated in Section 4.2, while 5-fold cross validation was as-well performed. A schematic workflow of the machine learning training process is provided in Fig. 6. For the analyzed performance space, consisting of a total of 144 samples, RF and XGBoost models exhibited an overall higher robustness in the modeling of the effective performance compared to DL models (see Supplementary Material S5), consistently achieving high coefficients of determination (R2 > 94%) across all predicted properties. In the sequel, RF and XGBoost models are employed as the basis for feature importance and explainability analysis tasks. In particular, model interpretability is assessed using built-in feature importance metrics and SHapley Additive exPlanations (SHAP)54, which quantify the contribution of each feature to individual predictions. More information on SHAP analysis and feature interactions is provided in Section 4.3.

Entire feature set X

In the SHAP feature importance rankings of Fig. 4(a–d) developed based on the complete set of input features X (Eq. (1)), Vf dominates the prediction of all performance metrics, including the Young’s modulus \(\widehat{E}\), thermal conductivity \(\widehat{\kappa }\), and yield stress \({\widehat{\sigma }}_{y}\), except for the shear modulus \(\widehat{G}\). For \(\widehat{G}\), the moment of inertia I11 is comparable in significance to the volume fraction. For the Young’s modulus \(\widehat{E}\) and the yield stress \({\widehat{\sigma }}_{y}\), the mean absolute Vf SHAP values are more than three and six times higher than the average surface-to-sphere distance \(\bar{D}\), which ranks second. Interestingly, \(\bar{D}\) is comparatively more important for the Young’s modulus \(\widehat{E}\), than for \({\widehat{\sigma }}_{y}\). The moment of inertia I11 and compactness C populate the third and fourth positions of importance for \(\widehat{E}\), which are otherwise occupied by the maximum distance Dmax and coupling moment of inertia I12 for the yield stress metric \({\widehat{\sigma }}_{y}\).

Fig. 4: SHAP analysis of XGBoost feature importance with progressive feature exclusion.
Fig. 4: SHAP analysis of XGBoost feature importance with progressive feature exclusion.The alternative text for this image may have been generated using AI.
Full size image

ad SHAP feature importance with full feature set including volume fraction Vf. eh SHAP feature importance excluding volume fraction Vf. il SHAP feature importance excluding both Vf and principal moment of inertia I11 to reveal intrinsic contributions of remaining geometric descriptors.

The contributions of I11, Dmax and \(\bar{D}\) to the effective thermal conductivity performance \(\widehat{\kappa }\) are highlighted in the mean SHAP values of Fig. 4b, which appear as the next most important features after Vf in the corresponding importance metrics. However, for the shear modulus \(\widehat{G}\), the effective moment of inertia I11 is approximately 60% as important as the volume fraction. It is the only response metric in which the dominance of Vf is not as prominent. In the set of the most influential features for shear \(\widehat{G}\) performance, the maximum distance \({D}_{\max }\) and unit-cell compactness C occupy the third and fourth most important positions, respectively.

Higher volume fraction values monotonically increase all effective metrics, as schematically illustrated in the violin density diagrams of Fig. 5. The same monotonic dependence is recorded for the mean distance \(\bar{D}\), primarily affecting the Young’s modulus \(\widehat{E}\) and yield stress \({\widehat{\sigma }}_{y}\) (Fig. 5a, d). An analogous nonlinear increasing property dependence is observed for I11, as of its high correlation with Vf.

Fig. 5: SHAP violin plots for mechanical and thermal properties.
Fig. 5: SHAP violin plots for mechanical and thermal properties.The alternative text for this image may have been generated using AI.
Full size image

ad Top box: Young’s modulus \(\widehat{E}\), thermal conductivity \(\widehat{\kappa }\), shear modulus \(\widehat{G}\), and yield stress \({\widehat{\sigma }}_{y}\) using the full feature set. eh Bottom box: Corresponding SHAP distributions after exclusion of volume fraction Vf and principal moment of inertia I11.

Feature set X-{V f}

Excluding the volume fraction Vf from the list of features and retraining the ML model reveals a dominance for the moment of inertia I11, which ranks first for all four performance metrics (Figs. 4(e–h)). For the effective Young’s modulus \(\widehat{E}\) and yield stress \({\widehat{\sigma }}_{y}\), surface area and distance metrics appear on the 2nk to 4rth ranking positions, with An, \(\bar{D}\) and Var(D), \(\bar{D}\) to populate the second and third place, respectively. For \(\widehat{\kappa }\), the effective surface area An and distance variance Var(D) arise as the 2nd and 3rd most important features, while for the shear modulus \(\widehat{G}\), their order is inverted, with the distance variance Var(D) being more important than the effective surface area An. It should be highlighted that the moment of inertia I11 is highly correlated with the volume fraction Vf, the corresponding Pearson’s correlation metric exceeding 90% in Fig. 2.

Feature set X-{V f, I 11}

Upon the omission of both Vf and I11 from the set of characteristics (Fig. 4(i–l)), the strong correlation of compactness C with I11 (85%, Fig. 2a), and the implicit calibration of compactness C to the volume fraction (see Eq. (7)) drives its importance to the leading position. The Young’s modulus \(\widehat{E}\), and yield stress \({\widehat{\sigma }}_{y}\) continue to place the surface area An and Var(D) as the two most important features after C. Accordingly, the surface area An and distance variance Var(D) continue being the most important metrics for the effective thermal conductivity \(\widehat{\kappa }\) and shear modulus \(\widehat{G}\) (Fig. 4k), consistent with the observations made for the X-{Vf} feature set case (Fig. 4g). It should be underlined that both surface-related (An) and spatial heterogeneity (Var(D)) descriptors are strongly correlated with the mean distance \(\bar{D}\) metric (Fig. 2a), an observation justifying the comparatively lower importance placement of \(\bar{D}\) in the corresponding ranking.

The violin density diagrams of Fig. 5(e–h) verify the above-elaborated feature importance metrics in the form of distribution plots. It is noteworthy that compactness C follows a highly comparable distribution for all effective performance attributes, the distribution differences concentrated in the second and third most important feature, namely the surface area An and distance variance Var(D). While surface area is visibly more informative than distance variance Var(D) for the Young’s modulus \(\widehat{E}\) and thermal conductivity \(\widehat{\kappa }\), this is not case for the shear response (\(\widehat{G}\)) and yield limit \({\widehat{\sigma }}_{y}\) (Fig. 5g, h). Increased distance variances Var(D) lead to higher \(\widehat{G}\) and \({\widehat{\sigma }}_{y}\) values, their positive SHAP distribution density and range being directly comparable or higher of that recorded for the surface area An.

The remaining features (I12, \(\bar{D}\), \({D}_{\max }\), \({D}_{\min }\)) exhibit substantially lower contributions and act as tertiary descriptors of fine-scale geometric variation. Overall, the three-stage SHAP analysis establishes a clear hierarchy of geometric influence: (i) Vf controls global scaling of all properties, (ii) I11 captures dominant inner design structural effects, and (iii) compactness C, surface-area An and topology heterogeneity Var(D) descriptors encode the intrinsic morphological contribution to mechanical and thermal performance.

Beyond individual feature importance, interaction analysis reveals synergistic effects among geometric descriptors (Table 1). When volume fraction and I11 are excluded, compactness C becomes the dominant feature, showing significant interactions with I12 across multiple properties (correlations 0.45–0.83). This coupling is most pronounced for the shear modulus (0.83), suggesting that shape irregularity and biaxial moment distribution critically contribute to the shear response in the absence of volumetric information. For the yield stress, C demonstrates moderate interactions with I12 (0.52) and mean distance \(\bar{D}\) (0.36), while the distance variance Var(D) shows strong coupling with \(\bar{D}\) (0.46), indicating that void spacing distribution patterns govern plastic deformation onset. The shift from Vf-I11 dominated interactions to C-I12 coupling upon feature removal highlights the hierarchical nature of geometric influence on mechanical response.

Table 1 SHAP feature interactions excluding the volume fraction Vf and principal moment of inertia I11 metrics

Minimum Feature Set X min-{V f, I 11,…,D min}

A progressive feature elimination process was followed to reduce the set of input features while preserving those informative enough for predicting the effective mechanical and thermal properties. The process began by removing the metrics contributing least to the SHAP importance rankings (Figs. 4(i–l) and 5(e–h)). Removing the maximum and minimum unit-cell distances (Dmax, Dmin) and retraining the XGBoost model with the remaining five descriptors results in comparable predictive performance, with all coefficients of determination R2 exceeding 0.96 across the target properties. Further excluding the mean distance \(\bar{D}\) and the coupling moment of inertia I12 still preserves high predictive fidelity, with R2 remaining above 0.95 for all targets. Ultimately, only three features-compactness C, normalized surface area An, and distance variance Var(D)-are sufficient to capture the full performance space, with all R2 values above 0.95 (Fig. 7d in Section 4.3). Any additional reduction in the feature set leads to substantial drops in R2, confirming that this minimal set contains the essential topological information for accurate property prediction. As further evidenced by the 5-fold cross-validation results in Table 2, the minimal feature set provides higher average R2 values than the two intermediate feature-reduced configurations, highlighting its core predictive efficiency.

Table 2 Five-fold cross-validation performance (R2, mean ± standard deviation) for Random Forest (RF) and XGBoost (XGB) models across all feature configurations

Minimum feature set estimates on scarce and extrapolated data

The performance of the minimum feature set Xmin = {C, An, Var(D)} is subsequently evaluated on extreme modeling cases, namely on scarce data (a) and extrapolated data (b). In the scarce dataset case, a particularly critical case for machine learning modeling55,56, the 144 initial samples are reduced to 117, a reduction of ~ 19%, randomly removing three (3) relative density datapoints per topology. Interestingly, XGBoost modeling achieves R2 values above 94% for all performance metrics except for the yield stress (\({R}_{{\sigma }_{y}}^{2}= \sim 92 \%\)), the relevant results summarized in the Supplementary Material (S3).

In the extrapolation testing case (b), a leave-one-topology-out (b-i) and a substantial out-of-volume-fraction-bounds (b-ii) scenario were tested. In the first case (b-i), the Lidinoid (LD) topology has been completely removed from the training set and has been exclusively used for testing, training the model on the remaining TPMS topologies. As such, a completely blind test has been performed, practices at the limits or beyond the performance bounds of supervised learning methods57,58. Under this leave-one-topology-out testing setting, the model achieved an R2 of 0.82 and 0.84 for the Young’s modulus and yield stress, respectively, retaining R2 scores above 94% for the effective thermal conductivity and shear modulus. In the second out-of-volume-fraction-bounds case, the model was trained in the 20–50% volume fraction range and tested in two separate ranges: in the \(\left[0.17-0.18-0.19,0.51-0.52-0.53\right]\) for (b-ii-a), and in \(\left[0.10-0.20\right]\) for (b-ii-b). In (b-ii-a), R2 scores above 95% were obtained for all elastic properties, with the yield stress accuracy reduced to ~ 88%, while in the (b-ii-b) case, R2 elastic property scores ranged in-betwen 85 and 95%, with the yield stress accuracy reducing to 73% (see Supplementary Material S2, S4). Considering the fact that the model is evaluated in blind spaces, not favorable to supervised learning, the results highlight the strong predictive performance of the identified minimal feature set Xmin for sheet-based TPMS topologies.

Discussion

The results provide evidence that the effective mechanics of TPMS metamaterials are driven by a finite set of fundamental topological features26, with substantially low dimensionality, well below the representation spaces arising from image-based convolutional feature extraction processes. Interestingly, a set of three fundamental features that encompasses compactness, inner surface area, and distance variance metrics is sufficient to accurately predict the elastoplastic and conductive performance of sheet-based TPMS architectures with remarkable accuracy, indicating that the dominant structure–property relations can be captured upon minimal, yet high-density, information.

This significantly reduces the computational cost of prediction and yields interpretable descriptors that directly link internal geometric characteristics to macroscopic response. From a design perspective, the identification of a fundamental set of CAD-accessible features enables rapid screening and inverse exploration of architected materials without the need for the development of dedicated, case-study-specific, finite element models. The resulting machine learning models can be directly integrated into design workflows for multi-objective optimization of mechanical and thermal performance, while remaining interpretable and computationally efficient.

Nevertheless, despite the significant number of TPMS unit-cell architectures and volume fractions considered for the development of the feature-property database, all designs fall within the class of sheet-based structures. Extending the framework to additional topological classes-such as strut-based, hybrid, graded, and stochastic architectures-would enable a level of generalization that remains largely infeasible for existing machine learning formulations, thereby supporting substantially broader applicability. Within this expanded design space, however, the incorporation of additional feature descriptors-such as connectivity-related metrics and material gradient information, among others-may be necessary to adequately capture the underlying structural complexity. Such an objective would require the development of sufficiently large and representative databases, overcoming fundamental bottlenecks inherent to machine learning modeling55. In turn, enhanced and extended models could integrate additional effective performance descriptors, including effective buckling limits59, and dynamic transmissibility attributes, which have the potential to decisively expand the model’s predictive deployment across a wide range of engineering applications.

Overall, feature-based, interpretable machine learning models provide a robust and data-efficient pathway for establishing inner structure–functional property relations of architected materials. By bridging CAD-level geometry with macroscopic performance through a compact set of physically meaningful descriptors, the elaborated framework lays the foundation for rational, explainable and scalable design of multifunctional architected materials.

Methods

Triply periodic minimal surface-based metamaterials

The present study focuses on architected materials with topology that is based on Triply Periodic Minimal Surfaces (TPMS)29,60. TPMS represent a class of surfaces with zero mean curvature, i.e:

$$\begin{array}{rcl}{M}_{TPMS} & = & \left\{{\bf{x}}=({x}_{1},{x}_{2},{x}_{3})\in {{\mathbb{R}}}^{3}| \varphi ({\bf{x}})=t,H({\bf{x}})=0\right\}\\ H({\bf{x}}) & = & \frac{1}{2}\nabla \cdot \left(\frac{\nabla \varphi ({\bf{x}})}{| \nabla \varphi ({\bf{x}})| }\right)\end{array}$$
(3)

Consider the unit cell \({\Omega }_{0}={[0,L]}^{3}\in {{\mathbb{R}}}^{3}\), with L = 1, forming a metamaterial structure V by repetitions in every direction of the space. Let Ω be the part of the unit cell Ω0, that corresponds to the metamaterial with topology that can be characterized as a solid or sheet based network by the following definition:

$$\,{\rm{Sheet}}\; {\rm{network}}-{\rm{based}}\; {\rm{unit}}\; {\rm{cell}}\; :\,\Omega =\{{\bf{x}}\in {\Omega }_{0}| -t\le \varphi ({\bf{x}})\le t\}$$
(4)

Geometric features of 3D shapes

An important task for the design of a metamaterial with desired effective performance is to define accurate geometric features.

Volume Fraction: The Volume fraction Vf is defined as the fraction of the material part of Ω with respect to the volume of the unit cell, i.e.:

$${V}_{f}=\frac{| \Omega | }{| {\Omega }_{0}| }$$
(5)

where Ω = ∫ΩdΩ.

Inner Surface Area: This feature concerns the total normalized area of the inner-architecture that is enclosed in the unit cell Ω0 and is defined as:

$${A}_{n}=\frac{| \partial \Omega | }{| \partial {\Omega }_{0}| }$$
(6)

with ∂Ω0 = 6L2 denoting the total area of the unit cell.

Compactness: Compactness is metric that indicates how much is the area of surface compared to the surface of a sphere with equal volumes. Therefore, the definition of Compactness C is given by the following equation:

$$C=\frac{| \partial \Omega | }{| \partial B({\rho }^{* })| }$$
(7)

which an adapted form of the definitions provided in61. In Eq. (7), B(ρ*) is a solid sphere with radius ρ* such that \(| B({\rho }^{* })| =| \Omega | =\frac{4}{3}\pi {\left({\rho }^{* }\right)}^{3}\).

Moments of Inertia: If the unit cell Ω0 is divided into n x1x2 plane sections along the x3 axis, the moments of inertia with respect to x1 and x2 plane directions are obtained by the following averages:

$$\begin{array}{rcl}{I}_{11} = \frac{1}{n}\mathop{\sum }\limits_{k=1}^{n}{\widehat{I}}_{11}({x}_{3}^{k}),&{I}_{22}=\frac{1}{n}\mathop{\sum }\limits_{k=1}^{n}{\widehat{I}}_{22}({x}_{3}^{k}),&{I}_{12}=\frac{1}{n}\mathop{\sum }\limits_{k=1}^{n}{\widehat{I}}_{12}({x}_{3}^{k})\\ {\widehat{I}}_{11}({x}_{3}^{k}) = {\int }_{{A}_{k}}{x}_{2}^{2}dA,&{\widehat{I}}_{22}({x}_{3}^{k})={\int }_{{A}_{k}}{x}_{1}^{2}dA,&{\widehat{I}}_{12}({x}_{3}^{k})={\int }_{{A}_{k}}{x}_{1}{x}_{2}dA\end{array}$$
(8)

Following the methodology proposed in53, the unit cell is approximated using a voxelized mesh (Fig. 1d). Thus, the integrals of Eq. (8) are approximated as the following sums:

$${\widehat{I}}_{11}({x}_{3}^{k})\cong \mathop{\sum }\limits_{i=1}^{n({A}_{k})}{({x}_{2}^{i})}^{2}{a}_{i},{\widehat{I}}_{22}({x}_{3}^{k})\cong \mathop{\sum }\limits_{i=1}^{n({A}_{k})}{({x}_{1}^{i})}^{2}{a}_{i},{\widehat{I}}_{12}({x}_{3}^{k})\cong \mathop{\sum }\limits_{i=1}^{n({A}_{k})}{x}_{1}^{i}{x}_{2}^{i}{a}_{i}$$
(9)

where n(Ak) is the number of the voxel faces that are aligned with the Ak cross-section and ai denotes the voxel face area.

Radial Distance Metrics: Consider a surface \(M\subset {{\mathbb{R}}}^{3}\) implicitly defined as the level set:

$$M=\{{\bf{x}}\in {{\mathbb{R}}}^{3}| F({\bf{x}})=c\}$$
(10)

where \(F:{{\mathbb{R}}}^{3}\to {\mathbb{R}}\) is a smooth level set function. In addition, introduce an auxiliary spherical surface \({S}^{2}(\rho )=\{{\boldsymbol{\xi }}\in {{\mathbb{R}}}^{3}| \parallel {\boldsymbol{\xi }}\parallel =\rho \}\) where ξ = (ξ1, ξ2, ξ3) and ρ > 0. Therefore, the position of ξ can be described using the spherical coordinates:

$$\widehat{\rho }({\boldsymbol{\xi }})=\sqrt{{\xi }_{1}^{2}+{\xi }_{2}^{2}+{\xi }_{3}^{2}}\,\,\,\,\widehat{\theta }({\boldsymbol{\xi }})={\cos }^{-1}\frac{{\xi }_{3}}{\sqrt{{\xi }_{1}^{2}+{\xi }_{2}^{2}+{\xi }_{3}^{2}}}\,\,\,\,\,\widehat{\varphi }({\boldsymbol{\xi }})={\cos }^{-1}\frac{{\xi }_{1}}{\sqrt{{\xi }_{1}^{2}+{\xi }_{2}^{2}}}$$
(11)

For a given point ξ S2(ρ), we seek a point x M lying on the ray emanating from the origin in the direction of ξ. This is achieved by solving the nonlinear system:

$$\begin{array}{rcl}{R}_{1}({\bf{x}};{\boldsymbol{\xi }}) & = & F({\bf{x}})-c=0\\ {R}_{2}({\bf{x}};{\boldsymbol{\xi }}) & = & \cos \widehat{\varphi }({\boldsymbol{\xi }})-\cos \widehat{\varphi }({\bf{x}})={c}_{1}({\boldsymbol{\xi }})\sqrt{{x}_{1}^{2}+{x}_{2}^{2}}-{x}_{1}=0\\ {R}_{3}({\bf{x}};{\boldsymbol{\xi }}) & = & \cos \widehat{\theta }({\boldsymbol{\xi }})-\cos \widehat{\theta }({\bf{x}})={c}_{2}({\boldsymbol{\xi }})\sqrt{{x}_{1}^{2}+{x}_{2}^{2}+{x}_{3}^{2}}-{x}_{3}=0\end{array}$$
(12)

where \({c}_{1}({\boldsymbol{\xi }})=\cos \widehat{\varphi }({\boldsymbol{\xi }})=\frac{{\xi }_{1}}{\sqrt{{\xi }_{1}^{2}+{\xi }_{2}^{2}}}\) and \({c}_{2}({\boldsymbol{\xi }})=\cos \widehat{\theta }({\boldsymbol{\xi }})=\frac{{\xi }_{3}}{\sqrt{{\xi }_{1}^{2}+{\xi }_{2}^{2}+{\xi }_{3}^{2}}}\). For convenience, (12) is written compactly in a vector format as:

$${\bf{R}}({\bf{x}};{\boldsymbol{\xi }})={\bf{0}}$$
(13)

In the present study, we restrict attention to the positive octant ξ1, ξ2, ξ3 > 0, which ensures that the angular coordinates in (11) are well defined and avoids ambiguities arising from symmetry.

For a given ξ, the projection f: ξ x, is obtained by applying a Newton-Raphson iteration:

$$\begin{array}{rcl}\frac{\partial }{\partial {\bf{x}}} {\bf{R}}({{\bf{x}}}^{k-1};{\boldsymbol{\xi }}) \cdot \Delta {\bf{x}} & = & \left.-{\bf{R}}({{\bf{x}}}^{k-1};{\boldsymbol{\xi }})\right)\\ {{\bf{x}}}^{k} & = & {{\bf{x}}}^{k-1}+\Delta {\bf{x}}\end{array}$$
(14)

starting from an initial guess x0.

Due to the multiply connected nature of triply periodic minimal surfaces, not all rays admit a well-posed intersection with M. Therefore, only rays for which the Newton procedure converges to a unique intersection point within the prescribed unit cell are retained. Rays leading to no intersection or multiple intersections are discarded. The resulting set is referred to as the set of admissible rays.

Let GS2(ρ) = {ξi S2, i = 1, 2, …, NS} denote a discrete sampling of the spherical surface. Solving Eq. (13) for each admissible ξi yields a collection of pairs:

$$f(G\circ {S}^{2},G\circ M)\,\left.=\left\{({{\boldsymbol{\xi }}},{{\bf{x}}})\right|{{\boldsymbol{\xi }}}\in G\circ {S}^{2}(\rho ),{{\bf{x}}}\in G\circ M,{\bf{R}}({\bf{x}};{\boldsymbol{\xi }})={\bf{0}}\right\}$$
(15)

With the aid of the collected pairs (15), the following metrics are defined:

$$\begin{array}{rcl}\bar{D} & = & \frac{1}{{N}_{S}}\mathop{\sum }\limits_{i=1}^{{N}_{S}}d({{\boldsymbol{\xi }}}^{i},{{\bf{x}}}^{i})\\ \,\mathrm{Var}\,(D) & = & \frac{1}{{N}_{S}}\mathop{\sum }\limits_{i=1}^{{N}_{S}}{\left(d({{\boldsymbol{\xi }}}^{i},{{\bf{x}}}^{i})-\widehat{d}\right)}^{2}\\ {D}_{\max } & = & \,\max \,\left(\left\{d({{\boldsymbol{\xi }}}^{i},{{\bf{x}}}^{i})\right\}\right)\\ {D}_{\min } & = & \,\min \,\left(\left\{d({{\boldsymbol{\xi }}}^{i},{{\bf{x}}}^{i})\right\}\right)\end{array}$$
(16)

where d(ξi, xi) = xiξi2 denotes the Euclidean distance.

In the present work, the surface M corresponds to a triply periodic minimal surface MTPMS defined over a single unit cell of characteristic length L. The radius of the auxiliary spherical surface is chosen as ρ = 0.5L which ensures that ray origins lie within the unit cell and avoids intersections with periodic replicas of the surface. The discrete sampling GS2(ρ) is obtained by a distribution of NS = 5810 Lebedev quadrature points62.

Machine learning modeling and explainability analysis

Building on the feature definitions and predictive framework introduced in Section 2.3, this section summarizes the training, evaluation, and interpretability pipeline for the Random Forest and Extreme Gradient Boosting models. Both ensemble methods are well suited for nonlinear regression on tabular geometric descriptors and provide strong generalization performance63,64. The models are trained using the geometric input vector X defined in Eq. (1) to predict the effective properties Y defined in Eq. (2). An overview of the Machine Learning Modeling Framework is depicted in Fig. 6.

Fig. 6: Overview of the Machine Learning Modeling Framework.
Fig. 6: Overview of the Machine Learning Modeling Framework.The alternative text for this image may have been generated using AI.
Full size image

Schematic of the predictive pipeline. Input Data consisting of geometric features and performance targets undergoes Train/Test split. The data is processed through feature scaling before being used to train the models. The framework evaluates predictions using the coefficient of determination and includes an iterative loop for 5-fold cross-validation to ensure model robustness.

Random Forest (RF, scikit-learn v1.6.1) and XGBoost regressors (XGB, xgboost v2.1.4) were employed, both implemented in Python 3.9.18. Random Forest predictions are obtained as the ensemble average of ntrees = 100 decision trees, using the default hyperparameter setting of the associated library.

$${\hat{{\bf{Y}}}}_{{\rm{RF}}}={{\mathcal{F}}}_{{\rm{RF}}}({\bf{X}})=\frac{1}{{n}_{{\rm{trees}}}}\mathop{\sum }\limits_{j=1}^{{n}_{{\rm{trees}}}}{{\mathcal{F}}}_{j}({\bf{X}})$$
(17)

where \({{\mathcal{F}}}_{j}:{\bf{X}}\,\mapsto\, \hat{{\bf{Y}}}\) denotes the jth decision tree. Given the data \({\left\{({{\bf{X}}}_{i},{{\bf{Y}}}_{i})\right\}}_{i=1}^{N}\), the learning procedure aims seeks the parameters λ*, such that:

$$\begin{array}{rcl}{{\boldsymbol{\lambda }}}^{* } & = & \arg \mathop{\min }\limits_{{\boldsymbol{\lambda }}}{\mathcal{L}}({\boldsymbol{\lambda }})\\ {\mathcal{L}}({\boldsymbol{\lambda }}) & = & \frac{1}{N}\mathop{\sum }\limits_{i=1}^{N}{({{\mathcal{F}}}_{\mathrm{RF}}({{\bf{X}}}_{i})-{\hat{{\bf{Y}}}}_{i})}^{2}\end{array}$$
(18)

XGBoost, in contrast, is a boosting-based ensemble strategy where the prediction of the regressor: \({\mathcal{F}}:{\bf{X}}\,\mapsto\, \hat{{\bf{Y}}}\) is updated iteratively based on a Newton-Raphson procedure:

$$\begin{array}{rcl}{{\mathcal{F}}}_{m}({{\bf{X}}}_{i}) & = & {{\mathcal{F}}}_{m-1}({{\bf{X}}}_{i})-{\mathcal{H}}{({{\bf{X}}}_{i})}^{-1}{\mathcal{G}}({{\bf{X}}}_{i})\\ {\mathcal{G}}({{\bf{X}}}_{i}) & = & \frac{\partial {\mathcal{L}}}{\partial {{\mathcal{F}}}_{m-1}({{\bf{X}}}_{i})}=\frac{2}{N}({{\mathcal{F}}}_{{\rm{RF}}}({{\bf{X}}}_{i})-{\hat{{\bf{Y}}}}_{i})\\ {\mathcal{H}}({{\bf{X}}}_{i}) & = & \frac{{\partial }^{2}{\mathcal{L}}}{\partial {{\mathcal{F}}}_{m-1}({{\bf{X}}}_{j})\partial {{\mathcal{F}}}_{m-1}({{\bf{X}}}_{i})}=\frac{2}{N}{\delta }_{ji}\end{array}$$
(19)

where, at each boosting iteration m, the update term \(h({{\bf{X}}}_{i})={\mathcal{H}}{({{\bf{X}}}_{i})}^{-1}{\mathcal{G}}({{\bf{X}}}_{i})\) is fitted to the residual \({{\mathcal{F}}}_{{\rm{RF}}}({{\bf{X}}}_{i})-{\hat{{\bf{Y}}}}_{i}\).

All input features were standardized using z-score normalization, where the scaling parameters (mean and standard deviation) were computed on the training subset only and subsequently applied to the test data:

$${{\bf{X}}}_{i}^{({\rm{scaled}})}=\frac{{{\bf{X}}}_{i}-{\mu }_{x}}{{\sigma }_{x}}$$
(20)

The trained machine-learning model, held-out test set, fitted feature standardization object and a standalone evaluation script are publicly available in a Zenodo repository for reproducibility (https://doi.org/10.5281/zenodo.18362271) 65. Model performance was evaluated on a total of 144 samples, using a stratified 90%/10% train–test split, where stratification was performed with respect to the TPMS topology label, ensuring that each fold contains approximately equal proportions of samples from each TPMS topology. The computing time required for the features estimation and the merely elastic part of the dataset creation (\(\widehat{E}\), \(\widehat{G}\), \(\widehat{\kappa }\)) on an AMD Ryzen 9 7950X 16-core processor equipped with a 32 GB RAM was approximately 7.5 hours, corresponding to ~ 188 s per sample. However, for the numerical evaluation of the yield stress limit using the nonlinear elastoplastic FEM analysis, substantially higher computing times were required. In particular, for the yield stress \({\widehat{\sigma }}_{y}\) estimations (Section 4.4), approximately 45 minutes (2700 sec) were needed per sample, corresponding to a total of 108 hours for the full set of 144 samples. This leads to an average total time of 2888 s per sample, highlighting the significant computational resources required for the dataset creation. To assess the robustness of the learned structure–property relationships and to identify a reduced yet informative descriptor set, a progressive feature elimination strategy was employed. Four training configurations were considered: (i) the full feature set; (ii) exclusion of the volume fraction Vf; (iii) exclusion of both Vf and the principal moment of inertia I11; and (iv) a minimal feature set comprising only C, An, and Var(D), identified through this elimination process.

Predictive accuracy was quantified using the coefficient of determination,

$${R}^{2}=1-\frac{{\sum }_{i=1}^{N}{({{\bf{Y}}}_{i}-{\hat{{\bf{Y}}}}_{i})}^{2}}{{\sum }_{i=1}^{N}{({{\bf{Y}}}_{i}-\bar{Y})}^{2}}$$
(21)

Both RF and XGBoost achieve high predictive fidelity across all four target properties and across all training configurations (Fig. 7). Using the full feature set, all models exceed R2 = 0.95, with values reaching up to R2 = 0.99 for thermal conductivity. Importantly, predictive performance remains consistently high under successive feature removal, remaining above R2 > 0.92 even when both Vf and I11 are excluded. This confirms that the remaining geometric descriptors retain strong predictive content independent of volumetric and inertial information. Notably, the minimal descriptor set also preserves very high predictive accuracy, with R2 values exceeding 0.96 across all target properties.

Fig. 7: Parity plots comparing Random Forest (RF) and XGBoost (XGB) predictions.
Fig. 7: Parity plots comparing Random Forest (RF) and XGBoost (XGB) predictions.The alternative text for this image may have been generated using AI.
Full size image

a Full feature set. b Excluding volume fraction Vf. c Excluding both Vf and I11. d Minimum feature set, only using C, An, and Var(D). Results are shown for the 10% hold-out test set.

In addition to the train–test evaluation, model robustness was further assessed using 5-fold cross-validation for all feature configurations. This provides a more statistically reliable estimate of model performance by averaging results across multiple folds and reducing the influence of any single data split. Cross-validation performance is reported in terms of the mean and standard deviation of the coefficient of determination across folds (Table 2). Consistent with the hold-out results, both RF and XGBoost exhibit strong and stable predictive performance across all four target properties.

Model interpretability was assessed using both built-in feature importance metrics and SHapley Additive exPlanations (SHAP)54 computed on the 10% hold-out test set. For a given prediction, the SHAP decomposition is expressed as

$$\hat{Y}={\phi }_{0}+\mathop{\sum }\limits_{i=1}^{M}{\phi }_{i}$$
(22)

where ϕi denotes the marginal contribution of feature i. The resulting feature rankings and hierarchical geometric interpretation are presented in Section 2.3.

Beyond first-order effects, second-order feature interactions were quantified by computing pairwise correlations between SHAP values of all descriptors. For each target property, the five most influential features were identified based on mean absolute SHAP magnitude. Interaction strength was then evaluated using the absolute Pearson correlation between their SHAP values and those of the remaining features. For each dominant descriptor, the three strongest interaction partners were retained, forming structured interaction maps for all three training configurations. This interaction analysis complements the feature-level hierarchy reported in Section 2.3 and enables systematic assessment of coupled geometric effects.

Data generation through voxel finite element method

Assuming infinitesimal strains and a homogeneous, isotropic material in the domain Ω, and neglecting both distributed elastic and thermal body loads, the steady-state displacement and temperature fields u: x (u1(x), u2(x), u3(x)) and θ: x θ(x) satisfy the following system of partial differential equations:

$$\begin{array}{l}\nabla \cdot \tilde{{\boldsymbol{\sigma }}}({\bf{u}})={\bf{0}},\nabla \cdot {\bf{q}}(\theta )=0\\ \tilde{{\boldsymbol{\sigma }}}({\bf{u}})=f(\tilde{{\boldsymbol{\epsilon }}}({\bf{u}})),\tilde{{\boldsymbol{\epsilon }}}({\bf{u}})=\frac{1}{2}\left(\nabla {\bf{u}}+{\bf{u}}\nabla \right),{\bf{q}}(\theta )=-\kappa \nabla \theta \end{array},{\bf{x}}\in {\Omega }_{0}$$
(23)

where \(\tilde{{\boldsymbol{\sigma }}}({\bf{u}})={\tilde{\sigma }}_{ij}({\bf{u}}){\hat{{\bf{x}}}}_{i}\otimes {\hat{{\bf{x}}}}_{j}\), \(\tilde{{\boldsymbol{\epsilon }}}({\bf{u}})={\tilde{e}}_{ij}({\bf{u}}){\hat{{\bf{x}}}}_{i}\otimes {\hat{{\bf{x}}}}_{j}\) and \({\bf{q}}(\theta )={q}_{i}(\theta ){\hat{{\bf{x}}}}_{i}\) denote the stress and strain second-rank tensors and heat flux vector, respectively. The mechanical and thermal boundary value problems are complemented employing uniform displacement and temperature periodic boundary conditions25,66 on the boundary ΓD. For the linear elastic case, the mechanical constitutive law of Eq. (23), that relates the stress with the strain tensor, reduces to Hooke’s law: \(\widetilde{{\boldsymbol{\sigma }}}({\bf{u}})=\lambda (\nabla \cdot {\bf{u}})\widetilde{{\bf{I}}}+\mu (\nabla {\bf{u}}+{\bf{u}}\nabla )\), where the Lame parameters λ = λ(E, ν) and μ = μ(E, ν) are functions of the phase material Young’s modulus E and Poisson’s ratio ν 25.

Considering the functional spaces \({V}_{w}=\left\{{\bf{w}}\in {[{H}^{1}({\Omega }_{0})]}^{3}| {\bf{w}}={\bf{0}}\,{\rm{on}}\,{\Gamma }_{D}\right\}\) and Vυ = {υ H10)υ = 0 on ΓD}, the following weak forms are obtained by multiplying the first-row equations of Eq. (23) with w Vw and υ Vυ, respectively, and integrating on Ω067:

$$\begin{array}{l}\,{\mathrm{Find}}\,{\bf{u}}\in {V}_{u}\,{\mathrm{such}}\; {\rm{that}}\; :\,{\int }_{\partial {\Omega }_{0}}{\bf{w}}\cdot \left[{\bf{n}}\cdot \widetilde{{\boldsymbol{\sigma }}}({\bf{u}})\right]d\Gamma -{\int }_{{\Omega }_{0}}\nabla {\bf{w}}:\widetilde{{\boldsymbol{\sigma }}}({\bf{u}})d\Omega =0,\forall {\bf{w}}\in {V}_{w}\\ \,{\mathrm{Find}}\,\theta \in {V}_{\theta }\,{\mathrm{such}}\; {\rm{that}}\; :\,{{\int }_{\partial {\Omega }_{0}}\upsilon \left[\kappa {\bf{n}}\cdot \nabla \theta \right]}d\Gamma -{{\int }_{{\Omega }_{0}}\kappa \nabla \upsilon \cdot \nabla \theta} d\Omega =0,\forall \upsilon \in {V}_{\upsilon }\end{array}$$
(24)

where \({V}_{u}=\left\{{\bf{u}}\in {[{H}^{1}({\Omega }_{0})]}^{3}| {\bf{u}}={{\bf{u}}}_{0}\,{\rm{on}}\,{\Gamma }_{D}\right\}\) and \({V}_{\theta }=\left\{\theta \in {H}^{1}({\Omega }_{0})| \theta ={\theta }_{0}\,{\rm{on}}\,{\Gamma }_{D}\right\}\).

The unit cell Ω0 is discretized into a set of \({N}_{{\Omega }_{0}}\) 8-noded brick elements K (voxels), defined as \({\mathcal{T}}({\Omega }_{0})={\bigcup }_{j=1}^{{N}_{{\Omega }_{0}}}{K}_{j}\). The set of voxels belonging to the material phase is defined as \({\mathcal{T}}(\Omega )=\left\{K\in {\mathcal{T}}({\Omega }_{0})| K\cap \Omega ={\rm{\varnothing }}\right\}\). Assuming linear elasticity, the weak forms of Eq. (24) lead to finite element formulations, therefore, the displacement u and temperature values θ are obtained on the nodes of the voxel mesh \({\mathcal{T}}({\Omega }_{0})\). Commercial Ansys APDL and voxel-finite element analysis comparisons are provided in Figure S1 of the Supplementary Material, along with detailed convergence analysis results, and extended literature comparisons on the effective performance. The obtained nodal values, are employed to calculate the effective material properties based to the following relations25,66:

$$\widehat{E}=\frac{{\left.\left\langle {\sigma }_{11}\right\rangle \right|}_{{\Omega }_{0}}}{{\epsilon }_{0}}\,\,\,\,\widehat{G}=\frac{{\left.\left\langle {\sigma }_{12}\right\rangle \right|}_{{\Omega }_{0}}}{{\gamma }_{0}}\,\,\,\,\,\,\,\widehat{\kappa }=\frac{{\left.\left\langle {q}_{1}\right\rangle \right|}_{{\Omega }_{0}}}{{q}_{0}}$$
(25)

where \({\left.\left\langle {\sigma }_{11}\right\rangle \right|}_{{\Omega }_{0}}\), \({\left.\left\langle {\sigma }_{12}\right\rangle \right|}_{{\Omega }_{0}}\), \({\left.\left\langle {q}_{1}\right\rangle \right|}_{{\Omega }_{0}}\) denote the average components of the stress tensor and heat flux vector, with \({\left.\left\langle f\right\rangle \right|}_{{\Omega }_{0}}=\frac{1}{| {\Omega }_{0}| }{\int }_{{\Omega }_{0}}fd\Omega\) representing the average of a function f. It should be noted that a value of 27000 for the \({N}_{{\Omega }_{0}}\) appears to be sufficient for convergent results on the above performance metrics, as summarized in the Supplementary Material (S1).

The effective elastoplastic response of the unit cell Ω0 is modeled, based on the J2 plasticity with isotropic hardening67. For a bilinear material law, the plastified stress tensor \(\widetilde{{\boldsymbol{\sigma }}}\) should satisfy the yield surface, described by the following level set equation:

$$f(\widetilde{{\boldsymbol{\sigma }}},\kappa )=\parallel \widetilde{{\boldsymbol{\sigma }}}{\parallel }_{VM}-({\sigma }_{y,0}+H{\mathop{\epsilon }\limits^{.}}_{p})=0$$
(26)

where \(\parallel \widetilde{{\boldsymbol{\sigma }}}{\parallel }_{VM}=\sqrt{\frac{3}{2}\widetilde{{\bf{s}}}:\widetilde{{\bf{s}}}}\) denotes the von Mises equivalent stress, with \(\widetilde{{\bf{s}}}\) being the deviator of \(\widetilde{{\boldsymbol{\sigma }}}\), while σy,0, H and \({\mathop{\epsilon }\limits^{^\circ }}_{p}\) are the base material yield limit, the hardening rate and the accumulated plastic strain, respectively. Based on the yield criterion, the material plastification is characterized by the following binary indicator:

$$P({\bf{x}})=\left\{\begin{array}{l}1,\,f(\widetilde{{\boldsymbol{\sigma }}}({\bf{x}}),\kappa ({\bf{x}}))=0\\ 0,\,f(\widetilde{{\boldsymbol{\sigma }}}({\bf{x}}),\kappa ({\bf{x}})) < 0\end{array}\right.$$
(27)

which distinguishes plastified and elastic regions within the unit cell. This definition leads to the set of plastified voxels:

$${\mathcal{P}}({\Omega }_{0})=\left\{K\in {\mathcal{T}}({\Omega }_{0})| P({\hat{{\bf{x}}}}_{K})=1\right\}$$
(28)

where \({\hat{{\bf{x}}}}_{K}\) is the centroid of the voxel K. In the sequel, the plastification volume fraction Vp is defined as:

$${V}_{p}=\frac{1}{| {\Omega }_{0}| }{\int }_{{\Omega }_{0}}P({\bf{x}})d\Omega \cong \frac{1}{| {\Omega }_{0}| }\mathop{\sum }\limits_{K\in {\mathcal{P}}({\Omega }_{0})}| K|$$
(29)

Therefore, the effective yield stress is defined as the average von-Mises stress in order to achieve Vp = 0.1%.

$${\widehat{\sigma }}_{y}=\frac{{\left.\left\langle \parallel \widetilde{{\boldsymbol{\sigma }}}{\parallel }_{VM}\right\rangle \right|}_{{\Omega }_{0}}}{{\sigma }_{y,0}}$$
(30)

The threshold value of Vp = 0.1% is chosen sufficiently small to characterize the initiation of plastic deformation. In the present study, the base material is considered to be aluminium with an initial yield limit σy,0 = 250MPa and an assumed hardening rate H = 0.1 × E = 6.9 GPa.

It should be noted that the use of Voxel-FEM methods for the mechanical analysis of such a large number of TPMS structures is to a great extent dictated by the limitations of current CAD TPMS modeling and mechanical analysis programs. These are not directly linked, so that the post-processing steps required for the translation of a mere CAD TPMS model, typically provided in an .stl file, thus surface type formatting to a solid FEM model that can be properly meshed and analyzed in standard Finite element analysis software packages are particularly labor, and computational resources intensive. In particular, the intermediate surface to volume conversion steps are not automated, requiring substantial computational resources and excessive working times for the processing of hundreds of models. These intermediate conversion steps are decisively simplified in the Voxel-FEM formulation, as the CAD geometry is directly converted to a solid voxel mesh through the use of the implicit field φ function of Table 3 for each topology, which is used as a domain identification equation, separating solid voxels from the surrounding empty material space. As such, parametric, sufficient sample dataset creation tasks are particularly facilitated.

Table 3 Level-set equations of selected TPMS and related surfaces