From bulk effective mass to 2D carrier mobility accurate prediction via adversarial transfer learning

Chen, Xinyu; Lu, Shuaihua; Chen, Qian; Zhou, Qionghua; Wang, Jinlan

doi:10.1038/s41467-024-49686-z

Download PDF

Article
Open access
Published: 25 June 2024

From bulk effective mass to 2D carrier mobility accurate prediction via adversarial transfer learning

Xinyu Chen¹,
Shuaihua Lu¹,
Qian Chen¹,
Qionghua Zhou^1,2 &
…
Jinlan Wang ORCID: orcid.org/0000-0002-4529-874X^1,2

Nature Communications volume 15, Article number: 5391 (2024) Cite this article

9255 Accesses
35 Citations
Metrics details

Subjects

An Author Correction to this article was published on 31 July 2024

This article has been updated

Abstract

Data scarcity is one of the critical bottlenecks to utilizing machine learning in material discovery. Transfer learning can use existing big data to assist property prediction on small data sets, but the premise is that there must be a strong correlation between large and small data sets. To extend its applicability in scenarios with different properties and materials, here we develop a hybrid framework combining adversarial transfer learning and expert knowledge, which enables the direct prediction of carrier mobility of two-dimensional (2D) materials using the knowledge learned from bulk effective mass. Specifically, adversarial training ensures that only common knowledge between bulk and 2D materials is extracted while expert knowledge is incorporated to further improve the prediction accuracy and generalizability. Successfully, 2D carrier mobilities are predicted with the accuracy over 90% from only crystal structure, and 21 2D semiconductors with carrier mobilities far exceeding silicon and suitable bandgap are successfully screened out. This work enables transfer learning in simultaneous cross-property and cross-material scenarios, providing an effective tool to predict intricate material properties with limited data.

Accelerating materials property prediction via a hybrid Transformer Graph framework that leverages four body interactions

Article Open access 18 January 2025

Exploiting redundancy in large materials datasets for efficient machine learning with less data

Article Open access 10 November 2023

Advancing extrapolative predictions of material properties through learning to learn using extrapolative episodic training

Article Open access 22 February 2025

Introduction

Data-driven machine learning (ML) has succeeded in rapidly predicting material properties for data-rich systems such as perovskites^1,2, alloys^3,4, and catalysis^5,6. Properties including formation energy⁷, stability⁸, and bandgap⁹ can be predicted almost instantaneously, significantly accelerating material discovery compared with the traditional trial-and-error approach using experiments and simulations¹⁰. ML heavily relies on the quantity and quality of training data as a data-driven approach. However, high-fidelity data for complex properties are often insufficient, compromising its prediction accuracy^11,12. In addition, data insufficiency may also cause incompleteness, which can lead to the ML model constantly suffering from overfitting and poor generalizability¹³.

Transfer learning is a machine-learning technique that can improve the performance of learners on small datasets (target domain) by transferring knowledge from different but large datasets (source domain). It has been considered a very promising approach to address the data scarcity challenge in ML-assisted material design^14,15. For example, Liu et al. successfully predicted phonon properties of bulk semiconductors by training on 1245 electronic bandgaps and finetuning on 124 phonon bandgaps¹⁶. Similarity, Li et al. accurately predict the formation energy of perovskite oxides by training on 5329 spinel oxides and finetuning on 855 perovskite oxides¹⁷. However, current transfer learning applications are either between different properties with the same materials (cross-property) or between different materials with the same property (cross-material)^{18,19,20,21,22,23,24,25}. This is owing to that the effectiveness of transfer learning is closely related to the difference between the source and target domain, and if the domain difference is too large, it will not be effective and may give poorer predictions, i.e., negative transfer²⁶.

In practical applications, the problem of data scarcity becomes even more pronounced, as our extensive databases typically only cover fundamental properties of widely-used materials. Yet, our focus is often on a particular category of materials, for which we strive to predict their more complex properties. Carrier mobility in atomically thin 2D semiconductors is such a typical example. 2D materials with suitable bandgap and high carrier mobility are expected to facilitate the continued transistor scaling^27,28. However, the evaluation of carrier mobility is a costly process that often requires extensive density functional theory calculations, as a result, the available data is very limited^29,30. In addition, 2D materials themselves are recent additions to the material family which also lacks sufficient data. In contrast, bulk materials have been studied for a much longer period and have rich data available, including diverse properties, in which the effective mass is believed to be closely related to carrier mobility. Naturally, we hope to utilize bulk effective mass data to enhance the prediction of 2D carrier mobility. However, owing to the diversity of 2D material structures and the complexity of their properties, simultaneous cross-material and cross-property transfer learning poses a greater challenge.

To achieve such simultaneous cross-material and cross-property transfer learning, we propose a hybrid framework that combines domain adversarial training and expert knowledge. The domain adversarial training method was first introduced in the realm of computer vison to learn common knowledge between different images³¹. Here, we employ a similar adversarial training concept to acquire common knowledge between different materials, meanwhile, we incorporate a priori knowledge of chemistry to better describe the uniqueness of material property. Successfully, 2D carrier mobility can be predicted within an order of magnitude by simply inputting crystal structure files, and 21 semiconductors with ultrahigh carrier mobility (> 10⁴cm²/V·s) and suitable bandgap are screened out. This successful knowledge transfer across different materials and properties shows the potential to fully utilize existing data, which may be an effective tool for material design with limited data.

Results

Hybrid transfer learning framework

Our transfer learning framework consists of two main components. The first part utilizes adversarial transfer learning (ATL) to extract shared features from both bulk materials and 2D materials, as shown in Fig. 1(a). The adversarial transfer learning is composed by three multi-layer perceptron (MLP) models: a feature extractor, an effective mass regressor, and a data source classifier. The feature extractor transforms initial input features into a low-dimensional vector using materials agnostic platform for informatics and exploration (MAGPIE)³². This extractor can be applied to both bulk and 2D materials; initially, without any constraint, the output of the extractor is a random number. Meanwhile, the extracted features are also used to train the bulk material effective mass regressor, and the regression loss is backpropagated to optimize the feature extractor. At this stage, the feature extractor learns the knowledge of effective mass and provides features closely related to it. In contrast to the standard approach, we not only train an effective mass regressor but also an additional data source classifier. This classifier is designed to determine whether the features are extracted from bulk or 2D materials and tell the feature extractor the common features between both types of materials. We achieve this by backpropagating the reversed classification loss to the feature extractor, and iteratively training both extractor and classifier until the classifier can no longer identify the data source. During the training iterations, the feature extractor is trained to fool the classifier about the data source, while the classifier is trained to discriminate between them. Hence, this process is referred to as adversarial training.

**Fig. 1: Schematic of adversarial transfer learning from bulk effective mass to 2D carrier mobility.**

The second component of our model involves the embedding of expert knowledge and provides a direct prediction of 2D carrier mobility, as illustrated in Fig. 1(b). The necessity of incorporating expert knowledge lies in the fact that the adversarial approach only ensures the extraction of common knowledge, but it lacks the description of the uniqueness of target materials and their properties. This is particularly critical in cases like 2D materials, where many interesting properties stem from their unique structures. Therefore, in addition to the features extracted by transfer learning, we add features from lattice symmetry, crystal geometry, and electronic properties to describe the unique 2D structures and their electronic behavior. From the perspective of deformation potential theory, we speculate that these features are closely related to carrier mobility. For example, the symmetry can affect phonon vibration mode, while the thickness is related to elastic modulus and reflects the strength of electron-phonon coupling, thus contributing to carrier mobility. The electronic features, such as electronegativity and valence electron distribution, are also regarded as important for carrier transport. The full feature list and description are presented in Supplementary Table 2. It is worth noting that these added features can be directly taken from the structure files without additional density functional theory (DFT) calculation, which is critical for quick-and-direct prediction.

To demonstrate the impact of adversarial training and expert knowledge on the performance of cross-material transfer learning, we conducted comparative tests, as shown in Fig. 2(a). When transfer learning is applied without adversarial training, the extracted features perform worse than MAGPIE (baseline), which is a typical negative transfer. This indicates that although the extracted features work in the source domain, they may not necessarily be helpful in the target domain, especially when the source and target domains are different materials with different properties. However, with the help of adversarial training, common knowledge between bulk and 2D materials is captured, and negative transfer is alleviated. It is also essential to recognize that many appealing properties of 2D materials stem from their unique structure. Therefore, additional features based on expert knowledge describe their special structures, which complements the knowledge acquired from the bulk materials and leads to more accurate predictions. This demonstrates the effectiveness and importance of leveraging adversarial training and expert knowledge to enhance the transfer learning performance.

**Fig. 2: Effective knowledge transfer enabled by adversarial transfer learning.**

To further investigate how adversarial transfer learning works, we utilized t-distributed stochastic neighbor embedding (t-SNE) to visualize the output differences between our transfer learning model with and without adversarial training, as shown in Fig. 2(b). The input feature space of 2D and bulk materials is separated, which implies that the overlap is small, making it challenging to transfer knowledge from bulk to 2D materials. We can see that most of the bulk and 2D materials are still separated after the transfer learning without adversarial training. However, after incorporating adversarial training, the extracted features are no longer able to distinguish between bulk and 2D materials, it can be observed that the two types of materials are mixed together in the t-SNE plot. This indicates that the features extracted by adversarial transfer learning are shared by both bulk and 2D materials, therefore, improving the effectiveness of cross-material transfer learning.

Carrier mobility prediction and model interpretation

Our hybrid transfer learning framework has been applied to three tasks related to 2D carrier mobility. Figure 3(a) displays the prediction accuracy of carrier mobility under deformation potential theory (DPT), with R² values of 0.88 and 0.90 for the average electron and hole mobility, respectively, and a MAE of 0.19 for both mobilities. Importantly, our trained model only requires the crystal structure file as input when making carrier mobility predictions. This streamlined approach ensures the usability and efficiency. Compared to DFT-based mobility calculation, our approach is five orders of magnitude faster as shown in Supplementary Fig. 10. These results demonstrate that our models can provide accurate and efficient estimation of overall carrier mobility. In addition, feature importance analysis in Fig. 3(d) reveals that the most important predictors for average carrier mobility are ATL features, closely related to effective mass. Furthermore, electronic features such as valence electron distribution, which characterize the electronic distribution of materials, are also crucial predictors. Our models also accurately predict mobility anisotropy, as evidenced by the R² scores of 0.89 and MAE of 0.11 and 0.13 for electron and hole, respectively, as shown in Fig. 3(b). Symmetry features, such as space group and mirror symmetry, play a more prominent role in determining the mobility anisotropy, as shown in Fig. 3(e), compared to that of ATL features. These results indicate that mobility anisotropy is mainly determined by the symmetry of the material.

**Fig. 3: Model performance and interpretation.**

Recently, a more accurate method to estimate carrier mobility was developed by solving the electron-phonon coupling (EPC) matrix, which provides a valuable re-evaluation of the carrier mobility of common 2D materials and gives more consistent results with experimental measurements³³. We further tested our method of predicting EPC mobilities of transition metal dichalcogenides (TMDs), which show high prediction accuracy as illustrated in Fig. 3(c). The R² score reaches 0.95 for electrons and 0.89 for holes, which is similar to DPT mobility prediction. Differently, the feature importance in Fig. 3(f) suggests that ATL features depicting effective mass are less important than electron features and symmetries in this case. This is in line with the finding that effective mass shows no obvious correlation with carrier mobility for 2D TMDs³⁴. Despite the limited structure types hindering its extrapolation, our method still gives very good accurate predictions. With increasing amounts of high-accuracy data, it can be useful in the future. Nevertheless, this consistency and maintained accuracy prove that our method can provide robust predictions with great generalization ability.

With the well-trained model, we can screen out 2D candidates for high carrier mobility and proper bandgap that is comparable to silicon. As shown in Fig. 4(a), 9115 2D materials were collected from two open-source databases, and a de-duplication process was carried out based on their formula and space group. Then we removed all metals, leaving 4266 semiconductors, of which 3109 are thermodynamically stable. Considering that silicon has a bandgap of around 1 eV, we selected 869 semiconductors with similar bandgaps ranging from 0.5 eV to 1.5 eV. Finally, the trained model was applied to predict their average carrier mobility $\bar{\mu }$, and 21 materials with electron or hole mobility higher than 10⁴cm²/V·s were screened out. We further validate the accuracy of our ML model by DFT calculations based on effective mass approximation and deformation potential approximation, which provide reliable estimations of carrier mobility at an acceptable computational cost. As shown in Supplementary Fig. 9, for both electron and hole mobility, our ML model gives consistent predictions with the R² score above 0.82, and the MAE values below 0.22, demonstrating the great predictive ability of our ML model. Note that among the screened 21 materials, some have already been synthesized or even been experimentally validated to have high mobility, such as In₄Se₃ and Nb₂SiTe₄^35,36,37.

**Fig. 4: Rational discovery of 2D semiconductors with high carrier mobility.**

Figure 4(b) shows the element and crystal system distribution of selected materials with high carrier mobility. The most frequent elements belong to p-block, and the most common crystal syngony are parallelogram, orthorhombic, and rhombic syngony. To gain a deeper understanding about why these materials possess high carrier mobility, we conducted partial dependence analysis based on Shapley additive explanation (SHAP) values, as shown in Fig. 4(c–f). Regarding the elemental features, an increase in the p-valence electrons fraction is positively correlated with carrier mobility (Fig. 4(c)), consistent with the element distribution results in Fig. 4(b). Moreover, Fig. 4(d) shows that the smaller the difference in electronegativity, the more positive the contribution to carrier mobility. This may be because smaller electronegativity difference tends to facilitate the formation of covalent bonds, in which the electrons are more free and easier to move, resulting in higher mobility. Structural features such as mirror symmetries are also found to correlate to carrier mobility. As shown in Fig. 4(e–f), materials with three in-plane mirror operations, such as TMDs, generally exhibit lower electron mobility, while materials with out-of-plane mirror operations, such as hexagonal boron nitride, have higher electron mobility. This may be because the mirror symmetry restricts some specific phonon vibration modes, which weakens the electron-phonon coupling and results in higher carrier mobility³⁸. It is worth noting that although the trend regarding symmetry is obvious, the overall impact of symmetry on the model is not as significant as elemental features according to SHAP values. Therefore, in materials with different compositions, the effect of symmetry can be easily masked. Moreover, research on the correlation between symmetry and mobility is still limited, and further study based on phonon vibration modes is needed to provide deeper insights.

Figure 4(g, h) presents the distribution of carrier mobility in 2D semiconductors using t-SNE. Space groups including P6/mcc, P4/nmm, P4₁2₂, and some low symmetry systems are observed to have high carrier mobility. A significant trend is that the majority of 2D semiconductors exhibit higher electron mobility than hole mobility, except for some low symmetry structures where the hole mobility is higher, the t-SNE plot of hole mobility is shown in Supplementary Fig. 7. Further DFT calculations show that the effective mass of electrons and holes in these systems are similar, while the deformation potential exhibits significant differences, as listed in Table 1 and Supplementary Table 3. Therefore, the high hole mobility in these low symmetry structures may be influenced by different phonon scattering mechanisms for electrons and holes. Although the materials selected in this study exhibit higher charge carrier mobility than traditional 2D semiconductors such as MoS₂ and black phosphorus, a considerable fraction of these materials have bandgaps that are either too large or too small for application in the semiconductor industry. However, these materials may still have potential applications in fields such as catalysis and photovoltaic applications. Figure 4(i) presents the electron and hole carrier mobility of the screened materials with proper bandgap around 1 eV, most materials have high mobility of only one carrier type, which are promising candidates for p-type or n-type semiconductors. Notably, BiAs and BiSb possess both high electron and hole mobility, serving as the compelling choice for complementary logic devices.

Table 1 Calculated carrier mobility, effective mass and deformation potential for the top 10 materials with the highest carrier mobility

Full size table

2D semiconductors with high carrier mobility

To gain insight into the mechanisms underlying the high mobility of these 2D semiconductors, we performed an electronic structure analysis of two representative structures (group V AB and group IV-V AB₂), as shown in Fig. 5. We also summarized their carrier mobility, bandgap, effective mass and deformation potential in Table 1 and Supplementary Table 3. Group V AB, they have a structure similar to black phosphorus, as shown in Fig. 5(a)³⁹. However, with different elements substituted, their band edges shift from the Γ point to a point within the Y-Γ high-symmetry path. Moreover, the shape of the shifted band edges is sharper, indicating smaller effective masses and higher carrier mobilities as shown in Fig. 5(b, c). These findings suggest that bandgap engineering through elemental substitution can be an effective approach to enhance carrier transport, and the elements within the p-block may be good choices according to the above partial dependence analysis. For the group IV-V AB₂ semiconductors, as shown in Fig. 5(d), they exhibit strong structural anisotropy, which leads to remarkable electronic anisotropy, as shown in Fig. 5(e, f). Local magnifications of conduction band minimum and valence band maximum reveal obvious differences in effective mass along orthogonal directions. Specifically, the effective mass is smaller along a lattice direction, interestingly, the deformation potential is far smaller along b direction than a direction (see Table 1). Consequently, the mobility is higher along the b direction, which indicates that the electron-phonon interaction plays a decisive role in this system. In addition, the a, b plane also show a symmetry difference. Specifically, the a direction exhibits mirror symmetry, while the b direction does not, further implying a potential correlation between symmetry and mobility. This highlights the importance of further investigation on how symmetry affects carrier mobility, as well as the potential for modulating carrier mobility through symmetry-protection or symmetry-broken structural engineering.

**Fig. 5: Representative 2D materials with high carrier mobility.**

Discussion

In summary, we have developed a hybrid transfer learning method that combines adversarial training and expert knowledge to enable effective knowledge transfer across different materials and different properties. As a compelling demonstration, this method has been applied 2D materials and achieved rapid and accurate predictions of carrier mobility by utilizing the big data of bulk effective mass. Notably, such mobility prediction only necessitates crystal structures as input, yet maintains accuracy comparable to DFT calculations but at a speed five orders of magnitude faster. Moreover, 21 2D semiconductors with ultra-high carrier mobility far exceed silicon have been screened out from 4266 candidates. The success of this method lies in the incorporation of adversarial training and expert knowledge, which effectively captures similarity among diverse materials while also characterizing the distinctive attributes of target materials and properties. Therefore, it facilitates simultaneous cross-material and cross-property transfer learning, enhancing the predictive capabilities and reliability of the model. This study provides a widely applicable strategy for addressing data scarcity in ML-assisted material design.

Nevertheless, the effectiveness of this approach for systems with higher degrees of dissimilarity, such as from ordered crystal to disordered materials like alloy, remains untested and may not be as successful. It may require an improved adversarial training approaches or borrowing generative adversarial network methods from inverse design^40,41,42,43. Another significant challenge in this field is how to select the most appropriate source-domain tasks from various options available. This is especially important since the amount and diversity of materials data is constantly expanding. This highlights the need for further research to explore how to uncover the correlations between different materials and properties, which guides source task selection while also improves the interpretability in transfer learning.

Methods

Machine learning: The transfer learning framework is composed of three parts: a feature extractor, a property regressor, and a data source discriminator. All of these parts utilize multi-layer perceptron (MLP) models, which are built and trained under the PyTorch⁴⁴ framework. To optimize the MLP hyperparameters, including the number of layers and neurons per layer, we employed a random search method, as depicted in Supplementary Fig. 3, 4. Then, the extracted features were fed into a gradient boosting tree (XGBoost) model to predict carrier mobility and its anisotropy. Other models, such as kernel ridge regression (KRR) and least absolute shrinkage and selection operator (LASSO) were also tested, and the tree model under the XGBoost framework gives the best performance, as illustrated in Supplementary Fig. 5, the optimized hyperparameters are given in Supplementary Table 1.

Model interpretation: To interpret the machine learning models, we used Shapley additive explanation (SHAP⁴⁵), based on game theory by Lloyd Shapley. This method produces SHAP values that reflect the positive or negative impact of each feature in each sample on the prediction results, providing deeper interpretation capabilities for complex ML models. In this study, we assessed the importance of feature using the mean absolute SHAP values, which describe the average impact of each feature. Due to the relatively small dataset, which may lead to increased randomness, we used 20-fold cross-validation to assess the model performance.

High-throughput calculations: All high-throughput calculations were performed using a self-developed Python script within the framework of density functional theory (DFT) implemented in the Vienna ab initio Simulation Package (VASP⁴⁶). Specifically, the electron-electron interactions were handled using a general gradient approximation that was parameterized by Perdew, Burke, and Ernzerhof (PBE⁴⁷). In addition, based on the effective mass approximation, the mobility was computed by using the deformation potential theory^48,49 for 2D systems, which is expressed as Formula 1:

$${\mu }_{2D}=\frac{{{e}}{{{\hslash }}}^{3}{C}_{2{{{\rm{D}}}}}}{{k}_{{{{\rm{B}}}}}T{m}^{*}{{m}_{{{{\rm{l}}}}}^{*}\left({E}_{{{{\rm{l}}}}}^{{{{\rm{i}}}}}\right)}^{2}}$$

(1)

where ${m}^{*}$ is the average effective mass in two transport directions, ${m}_{{{{\rm{l}}}}}^{*}$ and ${E}_{{{{\rm{l}}}}}^{{{{\rm{i}}}}}$ are the effective mass and deformation potential constant along the transport direction, and ${C}_{2{{{\rm{D}}}}}$ is 2D elastic modulus, respectively. The automatic calculation workflow is demonstrated in Supplementary Fig. 8, and more computational details can be found in the Supplementary Information (SI).

Data collection and processing: The initial carrier mobility data for 178 2D materials were collected from published literature provided as the Supplementary Data. These data were then used as both training and testing sets. Meanwhile, we utilized two open-source 2D material databases, C2DB^50,51 and 2Dmatpedia⁵², to serve as predicting sets. The source properties, i.e., bulk effective mass, are acquired from Materials Project⁵³. Their element distribution can be seen in Supplementary Fig. 1, 2. Given the large range of mobility values and the difficulty in uniformly defining the carrier transport direction for different lattices, we employed two dimensionless quantities to describe the carrier mobility: the average carrier mobility $\bar{\mu }$ and mobility anisotropy A, as defined in Formula 2 and 3.

$$\bar{\mu }={log }_{10}\left(\frac{\sqrt{{\sum}_{{{{\rm{i}}}}}{\mu }_{{{{\rm{i}}}}}^{2}}}{{\mu }_{{{{\rm{Si}}}}}}\right)$$

(2)

$$A={log }_{10}\left(\frac{\max \left\{{\mu }_{{{{\rm{i}}}}}\right\}}{\min \left\{{\mu }_{{{{\rm{i}}}}}\right\}}\right)$$

(3)

Where i can be two orthogonal transport directions and ${\mu }_{{Si}}$ represents the carrier mobility of silicon. These two quantities can describe both electron and hole mobility, which are noted as ${\bar{\mu }}_{{{{\rm{e}}}}}$, ${\bar{\mu }}_{{{{\rm{h}}}}}$, ${A}_{{{{\rm{e}}}}}$ and ${A}_{{{{\rm{h}}}}}$. The electron and hole mobilities in silicon⁵⁴ are 1331 cm² V⁻¹ s⁻¹ and 283.5 cm² V⁻¹ s⁻¹ respectively. To avoid the model placing too much emphasis on samples with large mobility or anisotropy, we logarithmically transformed all data.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The carrier mobility data generated in this study are provided in the manuscript file, the Supplementary Information files, and Source Data files.

The data of 2D materials and bulk effective mass used in this study are available at public websites, C2DB^50,51 (https://cmr.fysik.dtu.dk/c2db/c2db.html), 2Dmatpedia⁵² (http://www.2dmatpedia.org) and MP⁵³ (https://materialsproject.org/). The carrier mobility data for model training are provided in Supplementary Data 2. Source data are provided with this paper.

Code availability

The codes to perform adversarial transfer learning and predict 2D carrier mobility are provided as Supplementary Code which are also available at https://github.com/XinYu-Chen98/Hybrid-ATL-and-expert-knowledge-for-materials-design⁵⁵.

Change history

31 July 2024
A Correction to this paper has been published: https://doi.org/10.1038/s41467-024-50561-0

References

Lu, S. H. et al. Accelerated discovery of stable lead-free hybrid organic-inorganic perovskites via machine learning. Nat. Commun. 9, 3405 (2018).
Article ADS PubMed PubMed Central Google Scholar
Lu, S., Zhou, Q., Ma, L., Guo, Y. & Wang, J. Rapid discovery of ferroelectric photovoltaic perovskites and material descriptors via machine learning. Small Methods 3, 1900360 (2019).
Article CAS Google Scholar
Hart, G. L. W., Mueller, T., Toher, C. & Curtarolo, S. Machine learning for alloys. Nat. Rev. Mater. 6, 730–755 (2021).
Article ADS Google Scholar
Rao, Z. et al. Machine learning–enabled high-entropy alloy discovery. Science 378, 78–85 (2022).
Article ADS CAS PubMed Google Scholar
Mai, H., Le, T. C., Chen, D., Winkler, D. A. & Caruso, R. A. Machine learning for electrocatalyst and photocatalyst design and discovery. Chem. Rev. 122, 13478–13515 (2022).
Article CAS PubMed Google Scholar
Zahrt, A. F. et al. Prediction of higher-selectivity catalysts by computer-driven workflow and machine learning. Science 363, eaau5631 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kirklin, S. et al. The open quantum materials database (OQMD): assessing the accuracy of DFT formation energies. NPJ Comput. Mater. 1, 15010 (2015).
Article ADS CAS Google Scholar
Higgins, K., Ziatdinov, M., Kalinin, S. V. & Ahmadi, M. High-throughput study of antisolvents on the stability of multicomponent metal halide perovskites through robotics-based synthesis and machine learning approaches. J. Am. Chem. Soc. 143, 19945–19955 (2021).
Article CAS PubMed Google Scholar
Chen, C., Zuo, Y., Ye, W., Li, X. & Ong, S. P. Learning properties of ordered and disordered materials from multi-fidelity data. Nat. Comput. Sci. 1, 46–53 (2021).
Article PubMed Google Scholar
Butler, K. T., Davies, D. W., Cartwright, H., Isayev, O. & Walsh, A. Machine learning for molecular and materials science. Nature 559, 547–555 (2018).
Article ADS CAS PubMed Google Scholar
Xu, P., Ji, X., Li, M. & Lu, W. Small data machine learning in materials science. NPJ Comput. Mater. 9, 42 (2023).
Article ADS Google Scholar
Chen, X. et al. Accurate property prediction with interpretable machine learning model for small datasets via transformed atom vector. Phys. Rev. Mater. 6, 123803 (2022).
Article CAS Google Scholar
Zhou, Q. H., Lu, S. H., Wu, Y. L. & Wang, J. L. Property-oriented material design based on a data-driven machine learning technique. J. Phys. Chem. Lett. 11, 3920–3927 (2020).
Article CAS PubMed Google Scholar
Jha, D. et al. Enhancing materials property prediction by leveraging computational and experimental data using deep transfer learning. Nat. Commun. 10, 5316 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Yamada, H. et al. Predicting materials properties with little data using shotgun transfer learning. ACS Cent. Sci. 5, 1717–1730 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lu, S. H. et al. Coupling a crystal graph multilayer descriptor to active learning for rapid discovery of 2D ferromagnetic semiconductors/half-metals/metals. Adv. Mater. 32, 2002658 (2020).
Article CAS Google Scholar
Li, Y., Zhu, R., Wang, Y., Feng, L. & Liu, Y. Center-environment deep transfer machine learning across crystal structures: from spinel oxides to perovskite oxides. NPJ Comput. Mater. 9, 109 (2023).
Article ADS Google Scholar
Liu, Z. Y., Jiang, M. & Luo, T. F. Leverage electron properties to predict phonon properties via transfer learning for semiconductors. Sci. Adv. 6, eabd1356 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Ma, R., Colón, Y. J. & Luo, T. Transfer learning study of gas adsorption in metal–organic frameworks. ACS Appl. Mater. Interfaces 12, 34041–34048 (2020).
Article CAS PubMed Google Scholar
Ju, S. H. et al. Exploring diamondlike lattice thermal conductivity crystals via feature-based transfer learning. Phys. Rev. Mater. 5, 053801 (2021).
Article CAS Google Scholar
Liu, Z., Jiang, M. & Luo, T. Leveraging low-fidelity data to improve machine learning of sparse high-fidelity thermal conductivity data via transfer learning. Mater. Today Phys. 28, 100868 (2022).
Article Google Scholar
Chen, C. & Ong, S. P. AtomSets as a hierarchical transfer learning framework for small and large materials datasets. NPJ Comput. Mater. 7, 173 (2021).
Article ADS Google Scholar
Gupta, V. et al. Cross-property deep transfer learning framework for enhanced predictive analytics on small materials data. Nat. Commun. 12, 6595 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Kolluru, A. et al. Transfer learning using attentions across atomic systems with graph neural networks (TAAG). J. Chem. Phys. 156, 184702 (2022).
Article ADS CAS PubMed Google Scholar
Kiyohara, S., Hinuma, Y. & Oba, F. Band alignment of oxides by learnable structural-descriptor-aided neural network and transfer learning. J. Am. Chem. Soc. 146, 9697–9708 (2024).
Article CAS PubMed PubMed Central Google Scholar
Zhuang, F. et al. A comprehensive survey on transfer learning. Proc. IEEE 109, 43–76 (2021).
Article Google Scholar
Liu, Y. et al. Promises and prospects of two-dimensional transistors. Nature 591, 43–53 (2021).
Article ADS CAS PubMed Google Scholar
Ng, H. K. et al. Improving carrier mobility in two-dimensional semiconductors with rippled materials. Nat. Electron. 5, 489–496 (2022).
Article CAS Google Scholar
Poncé, S., Li, W., Reichardt, S. & Giustino, F. First-principles calculations of charge carrier mobility and conductivity in bulk semiconductors and two-dimensional materials. Rep. Prog. Phys. 83, 036501 (2020).
Article ADS MathSciNet PubMed Google Scholar
Poncé, S., Margine, E. R. & Giustino, F. in Towards predictive many-body calculations of phonon-limited carrier mobilities in semiconductors. Phys. Rev. B 97, 121201 (2018).
Article ADS Google Scholar
Ganin, Y. & Lempitsky, V. Unsupervised domain adaptation by backpropagation, Int. Conf. Mach. Learn. 37, 1180-1189 (2015).
Ward, L., Agrawal, A., Choudhary, A. & Wolverton, C. A general-purpose machine learning framework for predicting properties of inorganic materials. NPJ Comput. Mater. 2, 16028 (2016).
Article Google Scholar
Cheng, L., Zhang, C. & Liu, Y. Why two-dimensional semiconductors generally have low electron mobility. Phys. Rev. Lett. 125, 177701 (2020).
Article ADS MathSciNet CAS PubMed Google Scholar
Cheng, L. & Liu, Y. What limits the intrinsic mobility of electrons and holes in two dimensional metal dichalcogenides? J. Am. Chem. Soc. 140, 17895–17900 (2018).
Article CAS PubMed Google Scholar
Wang, F. et al. Anisotropic infrared response and orientation-dependent strain-tuning of the electronic structure in Nb2SiTe4. ACS Nano 16, 8107–8115 (2022).
Article CAS PubMed Google Scholar
Vorobeva, N. S. et al. Anisotropic properties of Quasi-1D In₄Se₃: mechanical exfoliation, electronic transport, and polarization-dependent photoresponse. Adv. Funct. Mater. 31, 2106459 (2021).
Article CAS Google Scholar
Zhao, M. et al. Nb₂SiTe₄: A stable narrow-gap two-dimensional material with ambipolar transport and mid-infrared response. ACS Nano 13, 10705–10710 (2019).
Article CAS PubMed Google Scholar
Zheng, S. et al. Symmetry-guaranteed high carrier mobility in quasi-2D thermoelectric semiconductors. Adv. Mater. 35, 2210380 (2023).
Article CAS Google Scholar
Qiao, J., Kong, X., Hu, Z.-X., Yang, F. & Ji, W. High-mobility transport anisotropy and linear dichroism in few-layer black phosphorus. Nat. Commun. 5, 4475 (2014).
Article ADS CAS PubMed Google Scholar
Lu, S., Zhou, Q., Chen, X., Song, Z. & Wang, J. Inverse design with deep generative models: next step in materials discovery. Natl Sci. Rev. 9, nwac111 (2022).
Article PubMed PubMed Central Google Scholar
Wang, J. & Chen, Y. Adversarial Transfer Learning. In Introduction to Transfer Learning. Machine Learning: Foundations, Methodologies, and Applications, 163–174 (Springer, Singapore, 2023). https://doi.org/10.1007/978-981-19-7584-4_10.
Deng, Z., Zhang, L. J., Vodrahalli, K., Kawaguchi, K. & Zou, J. Adversarial training helps transfer learning via better representations. Adv. Neural Inf. Process. Syst. 34, 25179–25191 (2021).
Google Scholar
Gupta, V. et al. Structure-aware graph neural network based deep transfer learning framework for enhanced predictive analytics on diverse materials datasets. NPJ Comput. Mater. 10, 1 (2024).
Article ADS CAS Google Scholar
Paszke, A. et al. In PyTorch: An Imperative Style, High-Performance Deep Learning Library, 33rd Conference on Neural Information Processing Systems, (NIPS 2019).
Lundberg, S. M. et al. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2, 56–67 (2020).
Article PubMed PubMed Central Google Scholar
Kresse, G. & Furthmüller, J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 54, 11169 (1996).
Article ADS CAS Google Scholar
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 78, 1396 (1997). 1396.
Article ADS CAS Google Scholar
Long, M. Q., Tang, L., Wang, D., Wang, L. J. & Shuai, Z. G. Theoretical predictions of size-dependent carrier mobility and polarity in graphene. J. Am. Chem. Soc. 131, 17728–17729 (2009).
Article CAS PubMed Google Scholar
Bardeen, J. & Shockley, W. Deformation potentials and mobilities in non-polar crystals. Phys. Rev. 80, 72–80 (1950).
Article ADS CAS Google Scholar
Haastrup, S. et al. The computational 2D materials database: high-throughput modeling and discovery of atomically thin crystals. 2D Mater. 5, 042002 (2018).
Article CAS Google Scholar
Gjerding, M. N. et al. Recent progress of the computational 2D materials database (C2DB). 2D Mater. 8, 044002 (2021).
Article CAS Google Scholar
Zhou, J. et al. 2DMatPedia, an open computational database of two-dimensional materials from top-down and bottom-up approaches. Sci. Data 6, 86 (2019).
Article PubMed PubMed Central Google Scholar
Ricci, F. et al. An ab initio electronic transport database for inorganic materials. Sci. Data 4, 170085 (2017).
Article CAS PubMed PubMed Central Google Scholar
Arora, N. D., Hauser, J. R. & Roulston, D. J. Electron and hole mobilities in silicon as a function of concentration and temperature. IEEE Trans. Electron Devices 29, 292–295 (1982).
Article ADS Google Scholar
Chen X. From bulk effective mass to 2D carrier mobility accurate prediction via adversarial transfer learning, XinYu-Chen98/Hybrid-ATL-and-expert-knowledge-for-materials-design: v1.0.0, https://doi.org/10.5281/zenodo.11387808 (2024).

Download references

Acknowledgements

This work is supported by the National Key Research and Development Program of China (2022YFB3807200, 2022YFA5000703), Natural Science Foundation of China (22033002, T2321002, 22373013), Natural Science Foundation of Jiangsu Province, Major Project (BK20232012, BK20222007), Jiangsu Provincial Scientific Research Center of Applied Mathematics (BK20233002) and the Fundamental Research Funds for the Central Universities. The authors thank the computational resources from the Big Data Computing Center of SEU and the National Supercomputing Center in Tianjin.

Author information

Authors and Affiliations

Key Laboratory of Quantum Materials and Devices of Ministry of Education, School of Physics, Southeast University, Nanjing, China
Xinyu Chen, Shuaihua Lu, Qian Chen, Qionghua Zhou & Jinlan Wang
Suzhou Laboratory, Suzhou, China
Qionghua Zhou & Jinlan Wang

Authors

Xinyu Chen
View author publications
Search author on:PubMed Google Scholar
Shuaihua Lu
View author publications
Search author on:PubMed Google Scholar
Qian Chen
View author publications
Search author on:PubMed Google Scholar
Qionghua Zhou
View author publications
Search author on:PubMed Google Scholar
Jinlan Wang
View author publications
Search author on:PubMed Google Scholar

Contributions

Q.Z. and J.W. conceived this work. X.C. proposed a hybrid transfer learning framework and wrote the code with guidance from S.L., Q.Z., and J.W., X.C. performed DFT calculations with guidance from Q.Z. and Q.C., X.C., Q.Z., and J.W. analyzed the data and co-wrote the manuscript, with input from the other authors.

Corresponding authors

Correspondence to Qionghua Zhou or Jinlan Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Code

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, X., Lu, S., Chen, Q. et al. From bulk effective mass to 2D carrier mobility accurate prediction via adversarial transfer learning. Nat Commun 15, 5391 (2024). https://doi.org/10.1038/s41467-024-49686-z

Download citation

Received: 18 July 2023
Accepted: 10 June 2024
Published: 25 June 2024
Version of record: 25 June 2024
DOI: https://doi.org/10.1038/s41467-024-49686-z

This article is cited by

Artificial Intelligence Empowered New Materials: Discovery, Synthesis, Prediction to Validation
- Ying Cao
- Hong Fu
- Bingang Xu
Nano-Micro Letters (2026)
Accurate prediction of synthesizability and precursors of 3D crystal structures via large language models
- Zhilong Song
- Shuaihua Lu
- Jinlan Wang
Nature Communications (2025)