Multi-modal conditional diffusion model using signed distance functions for metal-organic frameworks generation

Park, Junkil; Lee, Youhan; Kim, Jihan

doi:10.1038/s41467-024-55390-9

Download PDF

Article
Open access
Published: 02 January 2025

Multi-modal conditional diffusion model using signed distance functions for metal-organic frameworks generation

Nature Communications volume 16, Article number: 34 (2025) Cite this article

10k Accesses
6 Citations
13 Altmetric
Metrics details

Subjects

Abstract

The design of porous materials with user-desired properties has been a great interest for the last few decades. However, the flexibility of target properties has been highly limited, and targeting multiple properties of diverse modalities simultaneously has been scarcely explored. Furthermore, although deep generative models have opened a new paradigm in materials generation, their incorporation into porous materials such as metal-organic frameworks (MOFs) has not been satisfactory due to their structural complexity. In this work, we introduce MOFFUSION, a latent diffusion model that addresses the aforementioned challenges. Signed distance functions (SDFs) are employed for the input representation of MOFs, marking their first usage in representing porous materials for generative models. Using the suitability of SDFs in describing complicated pore structures, MOFFUSION exhibits exceptional generation performance, and demonstrates its versatile capability of conditional generation with handling diverse modalities of data, including numeric, categorical, text data, and their combinations.

A multi-modal pre-training transformer for universal transfer learning in metal–organic frameworks

Article 13 March 2023

Connecting metal-organic framework synthesis to applications using multimodal machine learning

Article Open access 01 July 2025

Chemically routed interpore molecular diffusion in metal-organic framework thin films

Article Open access 18 April 2023

Introduction

Renowned for their high porosity and structural diversity, metal-organic frameworks (MOFs) have become a promising class of materials for a range of applications, including gas storage^1,2,3, catalysis^4,5,6, and drug delivery^7,8. A variety of MOFs with unique structures and novel functionalities have been investigated and synthesized over the past decades, pushing boundaries of MOFs across various industries and research areas^9,10. The most distinctive characteristic and greatest advantage of MOFs lie in their modular nature. The diverse combinations of metal nodes, organic linkers and their geometric topologies make MOFs highly tunable, offering numerous experimentally reported structures^11,12 and virtually unlimited chemical search space^13,14.

Given the large number of candidate structures, researchers have made considerable efforts to search for the top-performing MOFs for specific applications. The most commonly attempted method is high-throughput screening (HTS)^15,16,17, which explores and compares every structure in a database. However, HTS shows limitations with the increased size of the database, and as such, optimization methods such as genetic algorithms^18,19,20 and Bayesian optimizations^21,22,23 have been employed for more efficient exploration of the MOF search space. Recently, generative models have been developed to design materials by learning the intrinsic distribution of the training dataset, and then generating new materials from this space. Various generative models, including variational autoencoders (VAEs)^24,25 and generative adversarial networks (GANs)^26,27, are actively being studied for diverse materials generation tasks.

Although such methods have been used to design MOFs with user-desired properties, the range of the target properties is highly restricted in terms of flexibility. Most previous works have focused primarily on numeric target characteristics, such as pore diameter (e.g., 15 Å) or adsorption capacity (e.g., 40 g/L). However, non-numeric data, such as pore morphology^28,29 and thermal/chemical stability^30,31 are also of great interest, especially for experimentalists, and these properties have not been widely incorporated in this context. Additionally, handling text-based data facilitates users to communicate with the machine learning models, and thereby significantly lowers the barrier to utilizing such technology^32,33,34. Nevertheless, the conditioning on diverse modalities of data for deep generative models targeted at porous materials generation has not been explored yet.

To remedy this issue, diffusion models offer an innovative approach, as they have gained numerous attention for their versatile conditioning capabilities^{34,35,36,37,38}. These models have been actively applied in the field of materials generation with drug-like molecules^39,40, proteins^41,42 and small crystals⁴³ being main targets. When it comes to generating MOFs using diffusion models, Park et al.⁴⁴ focused on the generation of MOF linkers rather than entire structures, while Fu et al.⁴⁵ reduced the structural complexity by applying coarse-grained representation. Nonetheless, the structural diversity and validity of the generated samples remain lacking, and the consideration of the target property for inverse design has been limited to only numeric data (e.g., CO₂ capacity). This highlights the necessity for a data representation that can better capture the intrinsic characteristics of porous materials, along with a novel architecture that enables flexible conditioning.

In this work, we propose MOFFUSION, a multi-modal conditional diffusion model that generates MOFs with careful attention to the three-dimensional (3D) shape of the pore structures (Fig. 1). A database composed of 247,742 hypothetical MOF structures was used for the training, which were generated using PORMAKE¹³ (a MOF construction software) and further geometry optimized using LAMMPS software⁴⁶. We applied a signed distance function (SDF)^37,47,48 for the input representation, marking the first application of 3D modeling techniques for the generation of MOFs. The suitability of SDF to describe MOFs and to capture essential features of porous materials was verified through the high structural validity of the generated samples.

Within MOFFUSION, latent diffusion model architecture and attention algorithms were employed to address the challenges of the limited flexibility of conditioning targets. The ability of our model to conditionally generate MOFs based on diverse modalities of data, including numeric data, categorical data, text, and even their combinations, was validated. As a representative numeric data, MOFFUSION was conditioned on hydrogen working capacity and its capability to generate structures with the desired hydrogen capacity was validated through Grand Canonical Monte Carlo (GCMC) simulations. MOFFUSION also demonstrated outstanding performance in handling categorical data, achieving high accuracy in generating MOFs with targeted geometric topologies. Furthermore, its capability to condition with text data was validated by generating MOFs with desired characteristics based on input prompts. Lastly, MOFFUSION’s ability to account for multiple target properties during conditional generation was confirmed, and its inherent capability of generating MOFs with user-tailored pore shapes was further demonstrated.

Results

SDF representations for periodic crystals

Modeling 3D shapes is an active research area, particularly in the fields of animation and the creation of 3D assets in virtual environments⁴⁹. Of the various data types used to represent 3D geometries, SDFs offer an implicit encoding of object surfaces by measuring the signed distances between 3D coordinate queries to their closest surface (Fig. 1a). SDFs have proven to be a versatile representation (e.g., controllable resolution, and small memory requirements), with a particular strength in accurately modeling the shape of the surfaces.

The pore structure is an essence of porous materials, differentiating them from the other substances. The morphology of the pore structure plays a critical role in determining the overall characteristics of the material. In the case of MOFs, the importance of the pore structure on their chemical properties have been widely observed across various applications, including gas storage⁵⁰, separation^51,52, and even catalysis⁵³. Therefore, utilizing information about the continuous shape of MOF surfaces is crucial in understanding their intrinsic nature and thereby their microscopic/macroscopic characteristics. However, most data representations of MOFs and other porous materials for machine learning (e.g., graph representation⁵⁴) have predominantly focused on the discrete location of atoms or their connectivity. In this context, considering the information on their 3D surfaces could bring a new paradigm in processing porous materials with machine learning models. Therefore, SDFs, known for its capability of delicately describing surfaces, can serve as an appropriate input representation for generating porous materials.

In MOFFUSION, we employed SDFs to describe porous materials for the first time in their generation. SDFs have typically been applied to finite objects, such as 3D assets, and their implementation in repeating systems with periodic boundary conditions (PBCs) has been limited especially in the field of machine learning. Therefore, we used our own methodology to obtain SDFs from the crystal system with considering PBCs, as described in the Supplementary Note #1-1. The procedure for the preparation of periodic SDFs includes conducting the sampling of SDFs in the supercell of the structure, and then taking the minimum value among the replicated SDF sampling points. From the crystal structure and the periodic SDF of MOF-5 shown in Supplementary Fig. 1, it can be verified that the periodic SDF well describes the periodicity of the crystalline structure by capturing the connectivity across the cell boundaries. Notably, the resolution of SDFs is tunable, and we used a resolution of 32³ (i.e., sampling a given unit cell by dividing each axis into 32) to balance representation accuracy with memory constraints. We qualitatively validated that a resolution of 32³ was sufficient for accurately representing the intrinsic pore structures of various representative MOFs (Supplementary Fig. 2). It is worth noting that during the generation process using PORMAKE, a widely used tool for the generation of hypothetical MOFs, we limited the maximum cell length of the generated structures to 60 Å. Therefore, the length of one grid lattice is no longer than 2 Å, ensuring the most chemical bonds are allocated to more than one grid. However, a finer grid will lead to more detailed pore morphology, which will enhance the overall performance of the workflow.

MOFFUSION architecture

The overall scheme of the MOFFUSION architecture is illustrated in Fig. 1c. MOFFUSION basically follows the architecture of a latent diffusion model and is mainly composed of three parts: a vector quantized-variational autoencoder (VQ-VAE), a diffusion model, and a MOF Constructor. Detailed explanation about the diffusion model part will be covered in the Methods section, and the other two parts will be discussed more extensively in this section and in the Supplementary Note #2.

A VQ-VAE, known for its high training stability and model robustness⁵⁵, is employed within MOFFUSION for the data compression and restoration (Fig. 2a). Notably, within the convolutional layers of the VQ-VAE architecture, periodic padding (or circular padding) was used to account for the periodicity of MOFs during the processing. The embedding space of the trained VQ-VAE into 2D space is depicted in Fig. 2b, using t-distributed stochastic neighbor embedding (t-SNE)⁵⁶. The SDFs of MOFs with the top 10 most frequently occurring topologies in the database were assessed, showing that the SDFs of MOFs with the same topologies are clustered and distinguishable from others. This supports that the VQ-VAE was successfully trained, effectively capturing geometric features of SDFs during mapping them into the latent space. Further analysis of the trained VQ-VAE can be found in the Supplementary Note #2-1.

**Fig. 2: VQ-VAE for data compression and restoration.**

While new SDFs are being generated through the diffusion model, the actual demand is for MOF structures rather than SDFs themselves. As a result, MOFFUSION features a unique architecture that includes an additional neural network, named MOF Constructor, which maps generated SDFs to MOF structures. MOF Constructor is a ResNet⁵⁷-based classification model that returns the corresponding topology and building blocks of the given input SDF. MOF Constructor exhibited high prediction accuracy, exceeding 80 % for every building blocks (topology, metal node, organic node, and edge) in the top-5 predictions (Supplementary Fig. 6c). The information on the predicted building components can then be used to generate actual MOF structures using PORMAKE software. Detailed model architecture and in-depth investigation on MOF Constructor can be found in the Supplementary Note #2-3.

Generation performance comparison with precedent models

The generation performance of MOFFUSION was assessed and compared with several precedent models for MOF generation. SMVAE²⁴ is a multi-component VAE architecture that used template-based representation for describing MOFs. MOFDiff⁴⁵ is a coarse-grained diffusion model which represented the building blocks of MOFs using a coarse-grained manner, further applying assembly algorithm as a post-processing step. GHP-MOFassemble⁴⁴ is another diffusion model approach for the generation of MOFs, but it was not considered for comparison as it focused on generating organic linkers, rather than the whole structure, and inserted them into pcu topology with fixed metal nodes.

We built a dataset of 247,742 hypothetical MOFs with diverse topologies and building blocks to train MOFFUSION. Our database comprised of 605 different topologies, 432 metal node, 51 organic nodes, and 220 organic edges, spanning a wide diversity of MOF building components, especially in geometric topologies and types of metal nodes (Table 1). Unlike Perov-5⁵⁸ for perovskites, or Carbon-24⁵⁹ for carbon materials, there exist no publicly shared dataset for MOF generation tasks. All precedent research used their own datasets for training and it is unfeasible to re-train every model with a shared dataset due to the differences in their detailed methodologies. This presents limitations in thoroughly comparing the performance between different models. However, a comparison was conducted as fairly as possible, by focusing solely on the output structures obtained from 10,000 generation attempts for each trained model. The structural validity of the generated MOFs and property statistics for various chemical properties were evaluated, which are widely used metric for the comparison of generative models in materials generation⁴³ (Table 1). The structure was deemed valid if it could be generated using PORMAKE (which accounts for building block compatibility and steric hindrance), and if it successfully underwent geometry optimization without failure. Additionally, property statistics entail an earth mover’s distance (EMD) of chemical properties between the training set and the generated data, assessing the capability of the model in generating realistic samples. Detailed explanations on the evaluation metrics can be found in the Supplementary Note #3-3.

Table. 1 Comparison of dataset diversity and generation performance of MOFFUSION with precedent models

Full size table

MOFFUSION achieved a structure validity of 81.7%, which is higher than other baseline models, as shown in Table 1. The high validity of MOFFUSION stems from its novel architecture and the benefits of SDFs, which provide a deep understanding of the geometric nature of MOFs. The uniqueness and novelty of structures are additional metrics often considered for evaluating generative models. Uniqueness indicates how non-redundant the generated structures are, while novelty measures the percentage of generated structures not included in the training data. For the structures generated using MOFFUSION, 79.9 % were found to be valid and unique, while 58.4% were valid, unique, and novel. This validates MOFFUSION’s capability to generate new structures unseen during the training. However, uniqueness and novelty were not compared with other models, as their respective papers did not report these specific values, except for MOFDiff who reported that its proportion of valid, novel, and unique structures was 30.0%.

MOFFUSION also significantly outperforms baseline models in the property statistics of all three properties (PLD, surface area, and density). The properties distributions of the structures generated through MOFFUSION were almost identical to those comprising the training dataset, as illustrated in Supplementary Fig. 8. This demonstrates MOFFUSION’s ability to effectively learn the probability distribution of the training data, confirming the outstanding similarity between the generated structures and those in the training set in terms of their chemical behavior.

In addition, the interatomic distances of the generated structures were compared with the experimental structures comprising the Open Chemistry Database (OChemDB⁶⁰). The metal-oxygen bond distances in the generated structures from MOFFUSION showed similar distributions to those of the structures from OChemDB, which further validates the reliability of the generated structure and the entire generation workflow (Supplementary Fig. 9). Furthermore, an ablation study on the SDFs was performed to assess the significance of representing structures as SDFs, as shown in the Supplementary Note #3-4. The comparison with a diffusion model that directly predicts the building block entities highlights that representing structures using SDFs played a significant role in the overall performance of MOFFUSION.

Conditional generation on numeric data

As mentioned earlier, the primary challenge posed in this work was the restricted flexibility in the target properties of porous materials generation, a limitation commonly found in previous inverse design methodologies. While other generative models targeted for MOFs used separate prediction models for target property prediction in their latent space, MOFFUSION employs a classifier-free guidance approach for conditional generation. By directly integrating conditioning into the generative model, classifier-free guidance offers several advantages, including simplicity, flexibility and efficiency in training⁶¹. The conditional generation capability of MOFFUSION was tested on diverse data modalities, and the detailed conditioning scheme of each modality can be checked in Supplementary Fig. 11.

First, conditioning on numeric data was evaluated. Among various numeric target properties, gas adsorption capacity was targeted as a representative example. Gas adsorption is of significant interest for MOFs due to their large pore volume and surface area compared to other porous materials. We specifically focused on hydrogen gas, a promising future energy source^62,63. The conditional model was trained for hydrogen working capacity (WC), determined by a pressure swing between 5 bar and 100 bar at 77 K. Structures were generated with target WC values of 5, 15, 25, and 35 g/L for evaluation, with a total of 1000 generation attempts for each target. The generated structures exhibited distinctive distributions of WC, with peaks located near the respective target values (Fig. 3). Additionally, adsorption isotherms of representative samples were assessed through GCMC simulations as a sanity check, confirming that structures generated with larger target values exhibited larger WC, and vice versa. Details on hydrogen adsorption simulations can be found in the Supplementary Note #5-2.

**Fig. 3: Conditional generation targeted on hydrogen working capacity.**

A comparison of MOFFUSION with the genetic algorithm would be valuable, as the latter has been investigated for a similar objective (i.e., the generation of MOFs with desired hydrogen capacity). The genetic algorithm and MOFFUSION share similarities, as both utilize a fixed set of building block for the generation of MOFs. Although the training of MOFFUSION may take considerable time (e.g., several days), in the inference phase, the user can generate a large number of structures (e.g., 10,000) in a few minutes. In contrast, the genetic algorithm necessarily requires numerous cycles for the property of the generated MOFs to converge, which took roughly one week in the workflow from Park et al.¹⁸ Furthermore, the genetic algorithm requires entirely new cycles for each different target value (e.g., 5, 15, 25, and 35 g/L), whereas MOFFUSION does not require re-training, making MOFFUSION a more efficient option.

In addition to hydrogen WC, we also evaluated our model on conditioning with pore diameter. We targeted the largest cavity diameter (LCD), which is straightforward, yet one of the most important parameters of MOFs. Structures were generated with 4 different target LCD values of 5, 15, 25, and 35 Å, and the generated structures showed clearly distinctive distributions according to their target values (Supplementary Fig. 12). From these experiments, we validated MOFFUSION’s capability for conditional generation with numeric properties. Notably, we tested both on a simple geometric parameter (LCD) and a relatively complicated and computationally expensive parameter (hydrogen WC), confirming that a wide range of properties can be targeted according to the user’s interest.

Conditional generation on categorical data

Some properties of materials are categorical, which means they can be classified into several labels (or classes) according to certain criteria. A notable example is the magnetism of materials, where materials can be classified as ferromagnetic, paramagnetic and diamagnetic⁶⁴. For porous materials, properties such as pore dimensionality, thermal/chemical stability and synthesizability are of great interest to both experimentalists and computational scientists^65,66. However, these categorical properties are rarely investigated with the purpose of inverse design.

As a case study for the conditional generation of categorical data, MOF topology was selected as the conditioning target. Among the 605 topologies comprising the dataset, the ability of MOFFUSION to generate samples with a certain desired topology was evaluated. As illustrated in Fig. 4a, the latent diffusion model successfully generated SDFs that accompanied the geometric features of the desired topology. The output MOF structures obtained through MOF Constructor exhibited the target topology, achieving an accuracy of 96.0 % in an experiment conducted on 10 randomly selected target topologies with 1000 generation trials each. In a subsequent test focused on metal nodes, MOFFUSION demonstrated its capability to selectively generate structures with the desired metal node (Supplementary Fig. 13). Through these tests, in addition to numeric data, the ability of MOFFUSION to perform conditional generation for categorical data was verified, and its potential extension to other categorical properties is promising.

**Fig. 4: Conditional generation with various data modalities.**

Conditional generation on text data

Natural language processing (NLP) is a field of artificial intelligence that focuses on the interaction between humans and computers using natural language⁶⁷. NLP is a rapidly evolving field with applications across various domains, such as healthcare⁶⁸ and education⁶⁹. Understanding human language is also crucial for generative models, as it can significantly lower the barrier for users to access these models. Especially, it empowers nonspecialists without domain knowledge to express desired features through their natural language, thus significantly reducing the difficulty of the entire generation procedure⁷⁰. It is our opinion that conditional generation based on natural language written as text is one of the directions that generative models will ultimately pursue.

We examined MOFFUSION for the conditional generation based on text data, as one of the first attempts at text-based conditional generation of porous materials. We prepared training data in the form of text containing the information about the void fraction and topology of MOFs (Supplementary Fig. 14a). We utilized SciBERT⁷¹, a pre-trained language model for scientific text, to process the text with the expectation that the pre-trained scientific literature would help it understands scientific terms such as ‘void fraction’. As illustrated in Supplementary Fig. 11, we added an additional projection layer to handle the text embedding obtained from SciBERT, tailoring it more specifically to this task. The information processed through the projection layer was integrated into the denoising process of MOFFUSION using cross-attention algorithms⁷².

Figure 4b illustrates the conditional generation of MOFFUSION trained with text data. With the use of input prompts, MOFFUSION successfully generated structures with the desired void fraction and topology, respectively. Notably, text-based topology conditioning achieved a high accuracy of 88.6% in experiments with random 10 topologies, a value comparable to those obtained by directly utilizing categorical data. Further details on text conditioning, as well as the description on the SciBERT tokenizer can be found in the Supplementary Note #4-4.

The capability of MOFFUSION for text conditioning in MOF generation was verified in this section. Although we used simplified text data with a fixed format for training, one future direction is to use more flexible text formats. The incorporation of flexible training text data would allow users to employ unrestricted and more colloquial input prompts, thereby facilitating smoother communication. Additionally, the utilization of large language models (LLMs) presents an attractive approach for both generating and processing text data^73,74. In practice, several previous works have reported the integration of LLMs into materials science, aiming to achieve user-friendly interfaces and applications^75,76,77,78. Otherwise, in contrast to our use of the frozen SciBERT model, fine-tuning the language model specifically for the materials domain could also be a promising method for improving the overall performance and enabling more versatile text conditioning.

Multi-target conditioning

Up to this point, MOFFUSION’s capability to handle various data modalities has been demonstrated. While analysis has been focused on a single target property thus far, this section evaluates the conditional generation of MOFFUSION targeting multiple properties simultaneously. The initial focus was on two numeric properties: void fraction and surface area. The conditioning mechanism is the same as in the case of conditioning on a single numeric property (i.e., concatenating a numeric tensor, as depicted in Supplementary Fig. 11), but the number of properties (therefore the number of tensors to concatenate) has increased to two. Conditional generation was tested on two different sets of target values, and the generated structures exhibited distributions of the target properties clustered around their respective target value set (Fig. 5a). This validates that MOFFUSION is capable of handling both void fraction and surface area at the same time during conditional generation.

Subsequently, multi-conditioning targeted on two properties with different modalities was further examined. The type of metal node and LCD were targeted, representing a combination of categorical data and numeric data. Notably, this kind of combination can be particularly interesting for the experimentalists. There could be situations where experimentalists need to fabricate structures exhibiting specific characteristics using particular (or restricted) building components. For instance, one might need a structure composed of Cu paddle wheels (e.g., due to their abundance or its chemical nature such as open metal sites) with a pore size of 15 Å (e.g., for separation purposes). However, it may not be straightforward for experimentalists to determine which topology and organic linker would be appropriate for this task. In such cases, conditioning both on building blocks and target properties could provide valuable guidance.

Similar to the case of considering each single target, pore diameter (a numeric data) was conditioned by concatenating a numeric tensor, while metal node (a categorical data) was processed with an encoder before its incorporation into the diffusion model (Supplementary Fig. 11). Figure 5b illustrates the generation results for two target nodes (N206, N599) and three target pore diameters (5, 15, and 25 Å). MOFFUSION generated structures with desired nodes with an accuracy of 39.1 %, and generated structures exhibited significantly distinctive distributions of LCD with peaks located at the desired target values. This highlights the successful conditioning of MOFFUSION on multi-targets with different modalities, and verifies its ability to handle multiple target data during the generation task.

User-tailored pore generation

In the field of generative models, preserving a designated portion of the original data with modifying or regenerating the rest is often a key focus. In image processing, this process is known as inpainting⁷⁹, and in 3D objects generation, it is referred to as shape-completion⁸⁰. By inspired by such studies, we devised an intrinsic feature of MOFFUSION of tailoring the pore structures of the structures being generated. Similarly with designating the region (or portion) that want to be preserved during inpainting or shape-completion, by designating the region within the unit cell where pore to be located, users can finely tune the pore morphology of the structures being generated. We named this process as ‘pore crafting’.

For pore crafting, we employed the blended diffusion methodology proposed by Avrahami et al.⁸¹. In their work, to generate a seamless result where the masked region complies with the original image, progressively generated noisy images were kept blended with the corresponding noisy version of the original image of the masked region. We followed similar procedures, where placing pores in a desired region is analogous to blending a given image with a blank sketchbook. During the diffusion process, a region designated to become a pore is kept blended with a latent representation of an empty SDF, resulting in an output that features an empty void in the desired region.

Figure 5c provides qualitative analysis of MOFFUSION’s pore crating capabilities. For the convenience of users, the desired pore region can be specified using the range of the fractional coordinates of each axis. For example, to obtain structures with an empty pocket located at the center, the region of x: [0.25, 0.75], y: [0.25, 0.75], z: [0.25, 0.75] is designated as a desirable pore region. It can be visually confirmed that structures with pockets located in the centers were successfully generated as desired. For the case of a 1D pore channel, a query of x: [0.25, 0.75], y: [0, 1], z: [0.25, 0.75] was provided, resulting in the generation of structures with a channel penetrating the y-axis.

While the generation of structures with desired chemical properties is of great interest, finely tailoring the pore morphology of porous materials is one of the ultimate goals in materials science, which holds immense potentials. For example, through the fine tailoring of pore structures, porous materials with optimal pore morphologies can be designed for much efficient gas storage or separation, and the catalytic activity of the material can be delicately controlled by adjusting the exposure of active sites. Additionally, more precise delivery and release of the drugs could be feasible through target drug-specific pore engineering. MOFFUSION handles this task for the first time, and we hope that this work serves as a cornerstone for achieving comprehensive materials generation with user-tailored pore morphologies.

Discussion

In this work, we present MOFFUSION, a multi-modal conditional diffusion model for the flexible conditioning of metal-organic frameworks generation. To accurately capture the pore morphologies of MOFs, MOFFUSION uses SDFs as an input representation of MOFs, marking the first attempt to utilize SDFs for generative models in the context of porous materials. MOFFUSION’s generation performance significantly outperformed existing methods, demonstrating the novelty of its architecture and the effectiveness of SDFs in describing porous materials. Additionally, MOFFUSION’s capability to handle various data modalities, including numeric, categorical, text, and their combinations is validated. Lastly, we showcase its ability to generate structures with user-tailored pore structures, which holds unlimited potential in various applications.

One observation made throughout the workflow is that, as with other generative models, MOFFUSION performs well with interpolation problems but faces relatively more difficulties with extrapolation. In other words, when there are many data points near a target property value, the model performs well in conditionally generating materials with the desired property, resulting in a sharp distribution in the target property of the generated samples. However, if there are not enough data points near the target value, conditional generation still works but tends to produce a relatively broader distribution. Specifically, the distribution often shifts toward regions where data points are more abundant, and this further emphasizes the necessity of preparing a well-distributed training dataset.

One plausible way to further enhance the versatility of MOFFUSION is by incorporating additional channels (in addition to SDFs) that focus more on chemical features. As SDFs mainly focus on the physical shape of the pore structures, there could exist some limitations in delicately capturing the chemical behavior at the atom-scale. In particular, as shown in Supplementary Fig. 6d of the Supplementary Information, the main reason for the reduction in the accuracy of MOF Constructor was a confusion between similar-looking functional groups. Therefore, to improve MOFFUSION’s performance in understanding the deeper chemical nature, an additional channel that effectively captures the intrinsic chemistry of MOFs could be helpful. An energy grid⁸², which calculates the interaction energy between probe gas and the given structure at each grid point, is a simple and attractive candidate for this purpose. Furthermore, explicit consideration of translational and/or rotational invariance could be an option worth considering for enhancing the overall performance of the architecture.

Accounting for the synthesizability of structures is also an interesting future direction. A primary challenge in materials design using generative models is the gap between hypothetical structures and those exist in the real world. Guiding the generation process toward the structures likely to be synthesized is considered a practical solution, but assessing the synthesizability of structures in a virtual space is a complex problem. Several metrics have been proposed to estimate the synthetic likelihood of hypothetical MOFs, often based on energy criteria, although these methods are not perfect⁶⁵. However, we hope to emphasize again that, once a new method for predicting synthesizability is developed, it can be easily incorporated into MOFFUSION’s conditional generation process. Whether the new method predicts the synthesizability as a numeric value (e.g., based on the relative energies of structures), or classifies it (e.g., highly likely, likely, less likely, or not likely), MOFFUSION’s flexible conditioning can accommodate any approach and enable the generation of structures with a high synthetic likelihood. We anticipate the MOFFUSION will find broader application in various meaningful areas or serve as a foundation for developing more versatile and practical methodologies.

Methods

Details on diffusion process

Diffusion model is a branch of probabilistic models that learns a data distribution by denoising a Gaussian variable. It involves two key processes: (1) the forward process (or noising process) and (2) the reverse process (or denoising process), which operate in opposite directions. Within the forward process, which is denoted as $q\left({X}_{0:T}\right)$, a data sample X is gradually converted into pure Gaussian noise. This transformation is in the form of a Markov chain, which can be expressed as follow:

$$q\left({X}_{0:T}\right)=q\left({X}_{0}\right){\prod}_{t=1}^{T}q\left({X}_{t}|{X}_{t-1}\right),$$

(1)

$$q\left({X}_{t}|{X}_{t-1}\right)={{{\mathcal{N}}}}\left({X}_{t};\sqrt{1-{\beta }_{t}}{X}_{t-1},{\beta }_{t}I\right),$$

(2)

where ${\beta }_{t}$ is a hyperparameter for the noise scheduling. Utilizing the reparameterization trick, X can be sampled at any given time step t during the forward process:

$$q\left({X}_{t}|{X}_{0}\right)={{{\mathcal{N}}}}\left({X}_{t};\sqrt{{\bar{\alpha }}_{t}\varepsilon }{X}_{0},\left(1-{\bar{\alpha }}_{t}\right)I\right),{where} \, {\alpha }_{t}=1-{\beta }_{t},\,{\bar{\alpha }}_{t}={\prod}_{s=1}^{t}{\alpha }_{s},$$

(3)

$${X}_{t}=\sqrt{\bar{\alpha }}{X}_{0}+\sqrt{1-{\bar{\alpha }}_{t}}\varepsilon,\varepsilon \, { \sim } \, {{{\mathcal{N}}}}\left({{{\bf{0}}}},\,{{{\boldsymbol{I}}}}\right)$$

(4)

In the reverse process, which is denoted as ${p}_{\theta }\left({X}_{0:T}\right)$, a data sample is reconstructed by gradually removing noise from a pure Gaussian. This reverse process is also characterized as a Markov chain and is learned using the parameter θ:

$${p}_{\theta }\left({X}_{0:T}\right)=p\left({X}_{T}\right) {\prod}_{t=1}^{T}{p}_{\theta }\left({X}_{t-1}|{X}_{t}\right),$$

(5)

$${p}_{\theta }\left({X}_{t-1}{|X}\right)={{{\mathcal{N}}}}\left({X}_{t-1};{\mu }_{\theta }({X}_{t},t),{\sigma }_{t}^{2}{{{\boldsymbol{I}}}}\right),$$

(6)

where ${p}_{\theta }\left({X}_{T}\right)={{{\mathcal{N}}}}\left({X}_{T};{{{\boldsymbol{0}}}},{{{\boldsymbol{I}}}}\right)$ denotes the standard Gaussian distribution, ${\mu }_{\theta }\left({X}_{t},t\right)$ indicates a denoising network parameterized by θ, and ${\sigma }_{t}^{2}$ is a variance depending on the time step. ${X}_{t-1}$ can be sampled as follows:

$${X}_{t-1}={\mu }_{\theta }\left({X}_{t},t\right)+{\sigma }_{t}\epsilon,\varepsilon \, { \sim } \, {{{\mathcal{N}}}}\left({{{\bf{0}}}},\,I\right)$$

(7)

Then, ${\mu }_{\theta }({X}_{t},t)$ can be simplified as following using the approach proposed by Ho et al.⁸³

$${\mu }_{\theta }\left({X}_{t},t\right)=\frac{1}{\sqrt{{\alpha }_{t}}} \left({X}_{t}-\frac{{\beta }_{t}}{\sqrt{1-{\bar{\alpha }}_{t}}}{\varepsilon }_{\sigma }\left({X}_{t},\,t\right)\right)$$

(8)

where ${\epsilon }_{\sigma }({X}_{t},t)$ is a neural network that predicts noise from the noisy input ${X}_{t}$. ${\epsilon }_{\sigma }({X}_{t},t)$ is trained aiming to minimize the mean squared error between the predicted noise and the actual noise ϵ, as follows:

$$L={{||}{\epsilon }_{\sigma }\left({X}_{t},t\right)-\epsilon {||}}_{2}^{2}$$

(9)

Meanwhile, various conditions (${c}_{b}$) was introduced into the generation process, resulting in the reverse process as formulated with ${p}_{\theta }({X}_{t-1}{|X},\,{c}_{b})$, and the final objective for the training is defined as below:

$$L={{||}{\epsilon }_{\sigma }\left({X}_{t},t,\,{c}_{b}\right)-\epsilon {||}}_{2}^{2}$$

(10)

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The datasets used in the current study have been deposited in https://figshare.com/articles/dataset/MOFFUSION_dataset/27908811?file=50813748 (for cif files) and https://drive.google.com/file/d/1Voas6pRkz3dnooQ7gaZ1R9vXD1xD21tw/view?pli=1 (for the corresponding SDFs). Source data are provided with this paper.

Code availability

A GitHub repository containing the source code and demos for the structure generation of various conditioning targets is available at https://github.com/parkjunkil/MOFFUSION, and https://doi.org/10.5281/zenodo.14090274.

References

Zhang, X. et al. Optimization of the pore structures of MOFs for record high hydrogen volumetric working capacity. Adv. Mater. 32, 1907995 (2020).
Article CAS Google Scholar
Nguyen, T., Shimizu, G. & Rajendran, A. Post-Combustion CO2 capture by vacuum swing adsorption using a hydrophobic metal-organic framework (MOF), CALF-20: Multi-objective optimization and experimental validation (2022).
Li, B., Wen, H.-M., Zhou, W., Xu, J. Q. & Chen, B. Porous metal-organic frameworks: promising materials for methane storage. Chem 1, 557–580 (2016).
Article CAS MATH Google Scholar
Wang, Q. & Astruc, D. State of the art and prospects in metal–organic framework (MOF)-based and MOF-derived nanocatalysis. Chem. Rev. 120, 1438–1511 (2019).
Article PubMed MATH Google Scholar
Doonan, C. J. & Sumby, C. J. Metal–organic framework catalysis. CrystEngComm 19, 4044–4048 (2017).
Article CAS Google Scholar
Shan, Y., Zhang, G., Shi, Y. & Pang, H. Synthesis and catalytic application of defective MOF materials. Cell Rep. Phys. Sci. 4 (2023).
Mallakpour, S., Nikkhoo, E. & Hussain, C. M. Application of MOF materials as drug delivery systems for cancer therapy and dermal treatment. Coord. Chem. Rev. 451, 214262 (2022).
Article CAS Google Scholar
Suresh, K. & Matzger, A. J. Enhanced drug delivery by dissolution of amorphous drug encapsulated in a water unstable metal–organic framework (MOF). Angew. Chem. Int. Ed. 58, 16790–16794 (2019).
Article CAS Google Scholar
Poonia, K. et al. Recent advances in Metal Organic Framework (MOF)-based hierarchical composites for water treatment by adsorptional photocatalysis: A review. Environ. Res. 222, 115349 (2023).
Article CAS PubMed MATH Google Scholar
Zhou, W. et al. MOF derived metal oxide composites and their applications in energy storage. Coord. Chem. Rev. 477, 214949 (2023).
Article CAS Google Scholar
Chung, Y. G. et al. Advances, updates, and analytics for the computation-ready, experimental metal–organic framework database: CoRE MOF 2019. J. Chem. Eng. Data 64, 5985–5998 (2019).
Article CAS MATH Google Scholar
Moghadam, P. Z. et al. Development of a Cambridge Structural Database subset: a collection of metal–organic frameworks for past, present, and future. Chem. Mater. 29, 2618–2625 (2017).
Article CAS MATH Google Scholar
Lee, S. et al. Computational screening of trillions of metal–organic frameworks for high-performance methane storage. ACS Appl. Mater. Interfaces 13, 23647–23654 (2021).
Article CAS PubMed Google Scholar
Boyd, P. G. et al. Data-driven design of metal–organic frameworks for wet flue gas CO2 capture. Nature 576, 253–256 (2019).
Article ADS CAS PubMed MATH Google Scholar
Avci, G., Velioglu, S. & Keskin, S. High-throughput screening of MOF adsorbents and membranes for H2 purification and CO2 capture. ACS Appl. Mater. interfaces 10, 33693–33706 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ahmed, A. et al. Exceptional hydrogen storage achieved by screening nearly half a million metal-organic frameworks. Nat. Commun. 10, 1568 (2019).
Article ADS PubMed PubMed Central MATH Google Scholar
Colón, Y. J. & Snurr, R. Q. High-throughput computational screening of metal–organic frameworks. Chem. Soc. Rev. 43, 5735–5749 (2014).
Article PubMed MATH Google Scholar
Park, J., Lim, Y., Lee, S. & Kim, J. Computational design of metal–organic frameworks with unprecedented high hydrogen working capacity and high synthesizability. Chem. Mater. 35, 9–16 (2022).
Article MATH Google Scholar
Lim, Y., Park, J., Lee, S. & Kim, J. Finely tuned inverse design of metal–organic frameworks with user-desired Xe/Kr selectivity. J. Mater. Chem. A 9, 21175–21183 (2021).
Article CAS Google Scholar
Chung, Y. G. et al. In silico discovery of metal-organic frameworks for precombustion CO2 capture using a genetic algorithm. Sci. Adv. 2, e1600909 (2016).
Article ADS PubMed PubMed Central Google Scholar
Comlek, Y., Pham, T. D., Snurr, R. Q. & Chen, W. Rapid design of top-performing metal-organic frameworks with qualitative representations of building blocks. npj Comput. Mater. 9, 170 (2023).
Article ADS CAS Google Scholar
Ghude, S. & Chowdhury, C. Exploring Hydrogen Storage Capacity in Metal‐Organic Frameworks: A Bayesian Optimization Approach. Chem.–A Eur. J. 29, e202301840 (2023).
Article CAS Google Scholar
Taw, E. & Neaton, J. B. Accelerated discovery of CH4 uptake capacity metal–organic frameworks using bayesian optimization. Adv. Theory Simul. 5, 2100515 (2022).
Article CAS Google Scholar
Yao, Z. et al. Inverse design of nanoporous crystalline reticular materials with deep generative models. Nat. Mach. Intell. 3, 76–86 (2021).
Article MATH Google Scholar
Hawkins-Hooker, A. et al. Generating functional protein variants with variational autoencoders. PLoS Comput. Biol. 17, e1008736 (2021).
Article CAS PubMed PubMed Central Google Scholar
Dan, Y. et al. Generative adversarial networks (GAN) based efficient sampling of chemical composition space for inverse design of inorganic materials. npj Comput. Mater. 6, 84 (2020).
Article ADS CAS MATH Google Scholar
Kim, B., Lee, S. & Kim, J. Inverse design of porous materials using artificial neural networks. Sci. Adv. 6, eaax9324 (2020).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Idrees, K. B. et al. Tailoring pore aperture and structural defects in zirconium-based metal–organic frameworks for krypton/xenon separation. Chem. Mater. 32, 3776–3782 (2020).
Article CAS MATH Google Scholar
Shin, S., Yoon, H., Yoon, Y., Park, S. & Shin, M. W. Porosity tailoring of the Zn-MOF-5 derived carbon materials and its effects on the performance as a cathode for lithium-air batteries. Microporous Mesoporous Mater. 311, 110726 (2021).
Article CAS MATH Google Scholar
Batra, R., Chen, C., Evans, T. G., Walton, K. S. & Ramprasad, R. Prediction of water stability of metal–organic frameworks using machine learning. Nat. Mach. Intell. 2, 704–710 (2020).
Article MATH Google Scholar
Healy, C. et al. The thermal stability of metal-organic frameworks. Coord. Chem. Rev. 419, 213388 (2020).
Article CAS MATH Google Scholar
Liu, S. et al. Multi-modal molecule structure–text model for text-based retrieval and editing. Nat. Mach. Intell. 5, 1447–1457 (2023).
Article MATH Google Scholar
Wang, Z. et al. Instructprotein: Aligning human and protein language via knowledge instruction. arXiv Prepr. arXiv 2310, 03269 (2023).
ADS Google Scholar
Zhu, H., Xiao, T. & Honavar, V. G. 3M-Diffusion: Latent Multi-Modal Diffusion for Text-Guided Generation of Molecular Graphs. arXiv Prepr. arXiv 2403, 07179 (2024).
Google Scholar
Rombach, R., Blattmann, A., Lorenz, D., Esser, P. & Ommer, B. in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 10684–10695.
Ruan, L. et al. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 10219-10228.
Cheng, Y.-C., Lee, H.-Y., Tulyakov, S., Schwing, A. G. & Gui, L.-Y. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4456–4465.
Park, J., Gill, A. P. S., Moosavi, S. M. & Kim, J. Inverse design of porous materials: a diffusion model approach. J. Mater. Chem. A (2024).
Hoogeboom, E., Satorras, V. G., Vignac, C. & Welling, M. in International conference on machine learning. 8867-8887 (PMLR).
Igashov, I. et al. Equivariant 3D-conditional diffusion model for molecular linker design. Nat. Mach. Intell., 1–11 (2024).
Yim, J. et al. Diffusion models in protein structure and docking. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 14, e1711 (2024).
CAS MATH Google Scholar
Fu, C. et al. in Learning on Graphs Conference. 29: 21–29: 17 (PMLR).
Xie, T., Fu, X., Ganea, O.-E., Barzilay, R. & Jaakkola, T. Crystal diffusion variational autoencoder for periodic material generation. arXiv Prepr. arXiv 2110, 06197 (2021).
MATH Google Scholar
Park, H. et al. Ghp-mofassemble: Diffusion modeling, high throughput screening, and molecular dynamics for rational discovery of novel metal-organic frameworks for carbon capture at scale. arXiv Prepr. arXiv 2306, 08695 (2023).
Google Scholar
Fu, X., Xie, T., Rosen, A. S., Jaakkola, T. & Smith, J. MOFDiff: Coarse-grained Diffusion for Metal-Organic Framework Design. arXiv Prepr. arXiv 2310, 10732 (2023).
ADS Google Scholar
Thompson, A. P. et al. LAMMPS-a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales. Comput. Phys. Commun. 271, 108171 (2022).
Article CAS MATH Google Scholar
Park, J. J., Florence, P., Straub, J., Newcombe, R. & Lovegrove, S. in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 165–174.
Lee, Y. & Kim, J. ShapeProt: Top-down Protein Design with 3D Protein Shape Generative Model. bioRxiv, 2023.2012. 2003.567710 (2023).
Tang, Y. M. & Ho, H. L. in mixed reality and three-dimensional computer graphics (IntechOpen, 2020).
Zhang, M. et al. Fine tuning of MOF‐505 analogues to reduce low‐pressure methane uptake and enhance methane working capacity. Angew. Chem. 129, 11584–11588 (2017).
Article ADS MATH Google Scholar
Zhou, S. et al. Asymmetric pore windows in MOF membranes for natural gas valorization. Nature 606, 706–712 (2022).
Article ADS CAS PubMed MATH Google Scholar
Suh, B. L., Hyun, T., Koh, D.-Y. & Kim, J. Rational tuning of ultramicropore dimensions in MOF-74 for size-selective separation of light hydrocarbons. Chem. Mater. 33, 7686–7692 (2021).
Article CAS MATH Google Scholar
Liang, Z., Qiu, T., Gao, S., Zhong, R. & Zou, R. Multi‐scale design of metal–organic framework‐derived materials for energy electrocatalysis. Adv. Energy Mater. 12, 2003410 (2022).
Article CAS Google Scholar
Xie, T. & Grossman, J. C. Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties. Phys. Rev. Lett. 120, 145301 (2018).
Article ADS CAS PubMed MATH Google Scholar
Van Den Oord, A. & Vinyals, O. Neural discrete representation learning. Advances in neural information processing systems 30 (2017).
Van der Maaten, L. & Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 9 (2008).
He, K., Zhang, X., Ren, S. & Sun, J. in Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778 (2016).
Castelli, I. E. et al. Computational screening of perovskite metal oxides for optimal solar light capture. Energy Environ. Sci. 5, 5814–5819 (2012).
Article CAS MATH Google Scholar
Pickard, C. J. AIRSS data for carbon at 10GPa and the C+N+H+O system at 1GPa, https://archive.materialscloud.org/record/2020.0026/v1 (2020).
Altomare, A. et al. OChemDb: the free online Open Chemistry Database portal for searching and analysing crystal structure information. J. Appl. Crystallogr. 51, 1229–1236 (2018).
Article ADS CAS MATH Google Scholar
Ho, J. & Salimans, T. Classifier-free diffusion guidance. arXiv Prepr. arXiv 2207, 12598 (2022).
ADS MATH Google Scholar
David, W. I. Effective hydrogen storage: a strategic chemistry challenge. Faraday Discuss. 151, 399–414 (2011).
Article ADS CAS PubMed MATH Google Scholar
Moradi, R. & Groth, K. M. Hydrogen storage and delivery: Review of the state of the art technologies and risk and reliability analysis. Int. J. Hydrog. Energy 44, 12254–12269 (2019).
Article ADS CAS MATH Google Scholar
Kaufman, A. A., Hansen, R. O. & Kleinberg, R. L. Paramagnetism, Diamagnetism, and Ferromagnetism. Methods Geochem. Geophysics 42, 207–254 (2008).
Article MATH Google Scholar
Anderson, R. & Gómez-Gualdrón, D. A. Large-scale free energy calculations on a computational metal–organic frameworks database: toward synthetic likelihood predictions. Chem. Mater. 32, 8106–8119 (2020).
Article CAS MATH Google Scholar
Jang, J., Gu, G. H., Noh, J., Kim, J. & Jung, Y. Structure-based synthesizability prediction of crystals using partially supervised learning. J. Am. Chem. Soc. 142, 18836–18843 (2020).
Article CAS PubMed Google Scholar
Chowdhary, K. & Chowdhary, K. Natural language processing. Fund. Artific. Intell. 603–649 (2020).
Zhou, B., Yang, G., Shi, Z. & Ma, S. Natural language processing for smart healthcare. IEEE Reviews in Biomedical Engineering (2022).
Alhawiti, K. M. Natural language processing and its use in education. Int. J. Adv. Comput. Sci. Appl. 5 (2014).
Luo, Y. et al. Text-guided Diffusion Model for 3D Molecule Generation. arXiv Prepr. arXiv 2410, 03803 (2024).
Google Scholar
Beltagy, I., Lo, K. & Cohan, A. SciBERT: A pretrained language model for scientific text. arXiv Prepr. arXiv 1903, 10676 (2019).
ADS MATH Google Scholar
Vaswani, A. et al. Attention is all you need. Advances in neural information processing systems 30 (2017).
Touvron, H. et al. Llama: Open and efficient foundation language models. arXiv Prepr. arXiv 2302, 13971 (2023).
ADS Google Scholar
Bubeck, S. et al. Sparks of artificial general intelligence: Early experiments with gpt-4. arXiv Prepr. arXiv 2303, 12712 (2023).
ADS MATH Google Scholar
M. Bran, A. et al. Augmenting large language models with chemistry tools. Nat. Mach. Intell. 1–11 (2024).
Kang, Y. & Kim, J. Chatmof: An autonomous ai system for predicting and generating metal-organic frameworks. arXiv Prepr. arXiv 2308, 01423 (2023).
Google Scholar
Ansari, M., Watchorn, J., Brown, C. E. & Brown, J. S. dZiner: Rational Inverse Design of Materials with AI Agents. arXiv Prepr. arXiv 2410, 03963 (2024).
MATH Google Scholar
Liu, Z. et al. Post-Pretraining Large Language Model Enabled Reverse Design of MOFs for Hydrogen Storage. (2024).
Lugmayr, A. et al. in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 11461–11471 (2022).
Zhou, L., Du, Y. & Wu, J. in Proceedings of the IEEE/CVF international conference on computer vision. 5826–5835 (2021).
Avrahami, O., Lischinski, D. & Fried, O. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 18208–18218 (2022).
Bucior, B. J. et al. Energy-based descriptors to rapidly predict hydrogen storage in metal–organic frameworks. Mol. Syst. Des. Eng. 4, 162–174 (2019).
Article CAS Google Scholar
Ho, J., Jain, A. & Abbeel, P. Denoising diffusion probabilistic models. Adv. neural Inf. Process. Syst. 33, 6840–6851 (2020).
Google Scholar

Download references

Acknowledgements

This project was supported by National Research Foundation of Korea (NRF) under grant No. RS-2024-00337004, RS-2024-00451160, and RS-2024-00435493.

Author information

These authors contributed equally: Junkil Park, Youhan Lee.

Authors and Affiliations

Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea
Junkil Park & Jihan Kim
NVIDIA Corporation, Santa Clara, CA, USA
Youhan Lee

Authors

Junkil Park
View author publications
Search author on:PubMed Google Scholar
Youhan Lee
View author publications
Search author on:PubMed Google Scholar
Jihan Kim
View author publications
Search author on:PubMed Google Scholar

Contributions

J.P. and Y.L. contributed equally to this work. J.P. and Y.L. developed the code and carried out the analysis. J.K. supervised the project. The manuscript was written with contributions from all authors.

Corresponding author

Correspondence to Jihan Kim.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Eliu Huerta and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Transparent Peer Review file

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Park, J., Lee, Y. & Kim, J. Multi-modal conditional diffusion model using signed distance functions for metal-organic frameworks generation. Nat Commun 16, 34 (2025). https://doi.org/10.1038/s41467-024-55390-9

Download citation

Received: 11 July 2024
Accepted: 05 December 2024
Published: 02 January 2025
DOI: https://doi.org/10.1038/s41467-024-55390-9