Abstract
Many technological applications depend on the response of materials to electric fields, but available databases of such responses are limited. Here, we explore the infrared, piezoelectric, and dielectric properties of inorganic materials by combining high-throughput density functional perturbation theory and machine learning approaches. We compute Γ-point phonons, infrared intensities, Born-effective charges, piezoelectric, and dielectric tensors for 5015 non-metallic materials in the JARVIS-DFT database. We find 3230 and 1943 materials with at least one far and mid-infrared mode, respectively. We identify 577 high-piezoelectric materials, using a threshold of 0.5 C/m2. Using a threshold of 20, we find 593 potential high-dielectric materials. Importantly, we analyze the chemistry, symmetry, dimensionality, and geometry of the materials to find features that help explain variations in our datasets. Finally, we develop high-accuracy regression models for the highest infrared frequency and maximum Born-effective charges, and classification models for maximum piezoelectric and average dielectric tensors to accelerate discovery.
Similar content being viewed by others
Introduction
The Materials Genome Initiative (MGI)1 has revolutionized the development of new technologically important materials, which historically has been a time-consuming task that was mainly dominated by trial and error strategies. Since MGI launched in 20111,2, high-throughput computational3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23 and experimental12,24,25,26,27 techniques have become a foundation for the “materials by design” paradigm. Moreover, the synergy among multidisciplinary sciences, along with rapid advancements in the electronic-structure methods, computational resources, and experimental techniques has made the process of materials design faster and far more efficient in some cases. As a result, the community is gradually migrating towards systematic computation-driven materials selection paradigms9,10,13,14,15,16,17,18,19,28,29,30,31,32,33,34,35, where functional materials are screened by establishing a direct link between the macroscopic functionality and the atomic-scale nature of the material. We are in a data-rich, modeling-driven era where trial and error approaches are gradually being replaced by rational strategies9,36,37,38, which couple predictions not only from specific electronic-structure calculations of a given property but also by learning from the existing data using machine learning. Subsequent targeted experimental synthesis and validation provide a means of rapid iteration to verify and improve computational models.
Applications of these computational techniques in a high-throughput manner have led to several databases of computed geometries and many physicochemical properties, AFLOW4, Materials-project3, Khazana17, Open Quantum Materials Database (OQMD)5, NOMAD7, Computational Materials Repository (CMR)39, NIMS databases40, and our NIST-JARVIS databases6,8,21,22,23,41,42,43,44,45,46,47. Despite a few systematic experimental databases of IR data (such as https://webbook.nist.gov/chemistry/vib-ser/), a systematic investigation of IR for inorganic materials is still lacking. Similarly, there have been only a few systematic databases developed for piezoelectric (PZ) and dielectric (DL) materials such as by De Jong et al.48, Petousis et al.49,50, Roy et al.51, and Choudhary et al.41. In this work, we significantly expand the scope of these previous efforts by developing systematic databases for infrared (IR) absorption spectra, piezoelectric tensors, and dielectric tensors.
Vibrational spectroscopy based on infrared (IR), Raman, and neutron scattering are ubiquitous methods to probe the chemical bonding, and thus the electronic structure of a material. Infrared frequencies are classified in three categories: far (30–400 cm−1), mid (400–4000 cm−1), and near (4000–14,000 cm−1) IR frequencies. Traditionally, IR spectroscopy was only used to probe organic materials, but with the availability of instruments capable to detect frequencies 600 cm−1, IR spectroscopy has also been proven successful in distinguishing phases of inorganic compounds52,53,54, thermal imaging55, infrared astronomy56, and food quality control57.
The piezoelectric tensor (PZ)58 describes the change in the polarization of a material due to mechanical stress or strain, or conversely, the change in stress or strain due to an external electric field. Similarly, the dielectric tensor59,60 describes the change in polarization due to an electric field. A related quantity, the Born-effective charge (BEC) tensor, describes first-order response of the atomic positions to an electric field. All of the PZ tensor-components are zero for materials that have inversion symmetry, leaving 138 space groups with non-zero PZ response while DL tensors and BEC tensors can have non-zero components regardless of symmetry. However, even when non-zero values are allowed, symmetry still strongly restricts the structure of these tensors.
While IR data can be used in infrared-detector design, the associated PZ61, DL62, and Born-effective charge data can be utilized for designing sensors, actuators, and capacitors62,63,64. In addition, there is a significant interest in finding lead-free piezoelectric materials to replace lead zirconate titanate (PZT) in various applications. All the above quantities can be obtained from density functional perturbation theory (DFPT) calculations65,66,67,68,69. The DFPT is a well-known technique to effectively calculate the second derivative of the total energy with respect to atomic displacement. The computational phonon-spectroscopy method has been recently shown to be an effective means for characterizing materials, as discussed by Skelton et al.70 and Kendrick and Burnett71. The information obtained using the DFPT method can also be utilized for predicting piezoelectric coefficients, static dielectric matrix, and Born-effective charges, as described by Gonze68 and Wu et al.66.
To validate our calculations, we compare them to the handful of available experimental data and show the uncertainty in predictions. We also identify materials that we predict have exceptional IR, PZ, or DL properties, which may be good candidates for experimental synthesis and characterization. Finally, we develop high accuracy supervised machine learning models based on the classification and regression methods to pre-screen high-performance materials without performing additional first-principles calculations. This work is a continuation of our previous datasets for exfoliability6, elastic23, optoelectronic41, topological21,22, solar-cell efficiency21, and thermoelectric43 property, scanning tunneling microscopy image8 datasets. The complete datasets for IR, PZ, BEC, and DL properties are provided on the JARVIS-DFT website (https://www.ctcms.nist.gov/~knc6/JVASP.html). The JARVIS-DFT is a part of the Materials Genome Initiative (MGI) at NIST.
Results and discussion
We use DFPT to predict the infrared, dielectric, and piezoelectric response of insulating materials. Out of 38,000 3D materials in the JARVIS-DFT database, we select materials with OptB88vdW bandgaps >0.1 eV41 and energy above convex hull <0.5 eV/atom72 and in their corresponding conventional cell representation. This leads to 10,305 materials. Note that the OptB88vdW gaps are generally underestimated compared to experiments but are useful in pre-screening. The convex hull helps enumerate the thermodynamic stability at 0 K. We further narrow down the list of candidates with the number of atoms in the simulation cell <20 atoms leading to 7230 systems. We have completed calculations for 5015 materials so far and other calculations are still ongoing. After the DFPT calculations, we obtain the phonon-frequencies at Г-point as well. We predict infrared, piezoelectric, and dielectric properties of systems only with all positive phonon-frequencies because the materials with negative frequencies are dynamically unstable. Dynamically unstable materials are usually the high symmetry structure of a material that undergoes a symmetry-lowering structural phase transition at low temperatures, such as a ferroelectric transition, but further calculations are necessary to understand these behaviors. For the dynamically stable materials, we also train machine learning models to quickly pre-screen materials for the next set of DFPT calculations. Such machine-learning models have not been reported before to the best of our knowledge. A brief- flow-chart of the whole process is given in Fig. 1. Now we discuss in detail individual components of this work below.
Infrared intensity
As mentioned above, the infrared intensity is important for thermal-imaging, infrared-astronomy, and food-quality control. Using Eqs. (1)–(7) (see “Methods” section), we calculate the IR intensities of 3411 materials with bandgap >0.1 eV and positive phonon frequencies at Γ-point. We compare nine experimental (Exp.) IR modes with the DFPT results to estimate the data-uncertainty. The comparison is listed in supporting information (see Supplementary Table S1). Based on the data in Supplementary Table S1, the difference in IR mode location between our calculations and experiments has a mean absolute deviation of 8.36 cm−1. This difference is small compared to the total range of IR frequencies, from 0 to 1000 cm−1. Several previous studies also report the difference between DFPT computed frequencies and measured peaks in this range73,74,75. In addition to intrinsic limitations in the DFT-exchange-correlation function, this small discrepancy may be caused by the fact that our calculations are carried out at 0 K while experiments are typically done at 300 K, or that experimental systems can contain defects and impurities, which are absent in our calculations. Given the strong agreement between theory and experiment, we expect that our carefully curated database will be useful for materials characterization.
To analyze the overall trend in the IR data, we plot the results for all materials together in Fig. 2. We observe that most of the modes are less than 1500 cm−1 which is consistent with the fact that inorganic materials generally have much lower IR frequencies than organic and soft materials, which typically contain covalently bonded light elements, leading to higher vibrational frequencies. Moreover, a close look at the dataset suggests that 3230 materials have at least one far-IR mode, and 1943 materials have at least one mid-IR mode. As expected, we could not find any material with near-IR modes, as the largest frequency we find is 3764 cm−1 for Mg5(HO3)2 (JVASP-13093). We find 41 elemental systems such as crystalline nitrogen (JVASP-25051), krypton (JVASP-907), xenon (JVASP-25276), etc. have no IR peaks. This can be attributed to the acoustic sum rule for Born-effective charges, which forces the total Born-effective charge of a system to add up to zero. From the dataset, we also observe that 1426 materials have only far-IR, 1804 both far and mid-IR, and 139 mid-IR peaks. Some of the materials with the lowest IR frequencies are from the halide family, such as CuBr (JVASP-5176), AgI (JVASP-12023), TlTe3Pt2 (JVASP-4627), while some of the highest frequencies are obtained for O- or OH-containing compounds, such as Mg5(HO3)2 (JVASP-13093). All the materials above 2078 cm−1 contain hydrogen and the next highest set of frequencies are obtained for compounds with C–N bonds, which is reasonable, as ω ~ m−1/2 (where ω is frequency and m is the atomic mass). These trends may be useful as a starting point in identifying new materials for infrared-related devices, like detectors, sensors, and lenses.
a IR peaks for all the materials in the DFPT database, b comparison of finite-difference (FDM) and DFPT phonon frequencies for conventional cells, c space-group distributions of materials with at least one far (blue) and mid (green) IR peaks, d Venn-diagram of the chemistry of materials containing chalcogenides, halides, or pnictides materials, e minimum frequency vs density of the system, and f dimensionality analysis of the far-IR materials.
Next, we compare the DFPT and finite-displacement method (FDM) phonon frequencies (obtained from our elastic-constant database23) for 2926 materials and 72,624 phonon modes (as shown in Fig. 2b). We find that the DFPT and FDM compare very well (Pearson correlation coefficient of 0.99), and significant differences are only found for a few molecular systems such as crystalline H2, N2, and a few lanthanide oxides (such as ErBiO, JVASP-49979). This consistency suggests an overall high quality of our computational dataset.
In Fig. 2c, we analyze the space-group variation of materials with at least one far-IR (blue) and mid-IR (green) peaks. Some of the high-symmetry space-groups with a high number of materials with far-IR peaks are 36, 80, 129, 166, and 216. As shown in Venn diagram in Fig. 2d, we find that a large number of materials with far-IR peaks are chalcogenides (O, S, Se, Te-based compounds); however, our database of inorganic materials is generally biased towards oxides (39% are chalcogenides, 14% are halogens, 19% pnictides, and 28% others), and we find many examples of halides and pnictides with far-IR peaks as well. Halides such as MgF2, LiF, etc. have been used in several astronomical telescopes such as Hubble telescope76 for infrared astronomy, and non-oxide chalcogenides such as CdTe have been used for infrared night vision cameras. We find that denser materials generally have lower phonon frequencies, which can be observed in Fig. 2e. This can be explained by the fact that the denser materials have heavier atoms leading to lower phonon frequencies for fixed spring constants. Conversely, high IR modes seem to require low-density materials. Finally, we analyzed how the dimensionality23 of materials affect the far-IR peaks (Fig. 2f). While we find that most of the far-IR-peak materials have 3D bonding, we also determine that a significant fraction (20.9%) are low-dimensional, which is slightly higher than their overall representation in the database of 17.2%. The low-dimensional/vdW-bonding is determined based on the bond-topology or lattice constant criteria23. Several examples of materials with different dimensionality with low and high-IR modes are shown in Tables 1 and 2. The complete list is provided in the Supplementary Information (see “Data availability” section). Identifying low dimensional dielectric materials is important for designing ultrathin flexible electronic devices.
Piezoelectric properties
The piezoelectric effect is a reversible process where materials exhibit electrical polarization resulting from an applied mechanical stress, or conversely, a strain due to an applied electric field. Common applications for piezoelectricity include medical applications, energy harvesting devices, actuators, sensors, motors, atomic force microscopes, and high voltage power generation58. As discussed in the “Methods” section, PZ responses can be measured under constant strain, giving the piezoelectric stress tensor eij or constant stress, giving the piezoelectric strain tensor dij. In Supplementary Table S2, we compare DFT computed PZ stress tensors (maximum eij) to experimental ones for several classes of materials, such as oxides, nitrides, and sulfides, and in several crystal structures. We find that the mean absolute deviation in max (eij) is 0.79 C/m2, which is reasonable at least for initial screening purposes. The relaxed-ion contribution to the piezoelectric tensor is proportional to the inverse dynamical matrix (see Eq. (5) in the “Methods” section), which makes this contribution very sensitive to low-frequency IR modes. These modes are often very sensitive to temperature changes and defects, especially in ferroelectric materials, which makes a direct comparison between theory and experiment challenging77. In Fig. 3a, we show a histogram of the distribution of the maximum absolute value of the eij PZ tensor component across the dataset, which shows a peak at zero and a long-tailed distribution. Most of the materials have max eij less than 1.0, but some of the high PZ materials have max. eij above 4 C/m2. We choose a 0.5 C/m2 threshold to select whether a material is a good PZ material, giving rise to 577 screened materials. The 0.5 C/m2 threshold value is used in Fig. 3d–f. In Fig. 3b, we plot a similar histogram for the maximum component of the dij tensor, the more commonly measured piezoelectric coefficient. The value of d33 is generally below 50 × 10−12 C/N as shown in Fig. 3b. Examples of the predicted lead-free high PZ materials, which we do not find reported in literature are MoO3 (JVASP-30103), YWN3(JVASP-38813), W(BrO)2 (JVASP-30364), InBiO3(JVASP-34653), NbNO (JVASP-52492), MgTiSiO5 (JVASP-9481), MgCN2 (JVASP-7814), Ca2SnS4 (JVASP-40157), ZnTa2O6 (JVASP-9231), and Bi3TaO7 (JVASP-13350). The highest PZ values naturally require a combination of high-Born-effective charges and low-frequency IR modes, in combination with a crystal symmetry that allows for the non-zero piezoelectric response. This confluence of factors makes strong piezoelectricity difficult to predict and optimize.
As depicted in Fig. 3c we compare the stress and strain-based PZ. We find that there is no obvious relation between the stress and strain PZ tensors, implying that accurate compliance tensor is essential to predict the dij values. It is important to note that PZ strain tensors are the components which are generally measured during the experiments. Next, in Fig. 3d we analyze the chemistry of high PZ materials (using 0.5 C/m2 as a threshold). Like the IR data, the high PZ materials are dominated by chalcogenides, with very few halides. We find most of the materials with high PZ are 3D, with low-dimensional materials under-represented. Some examples of 3D and low-dimensional materials are given in Table 3. Analyzing the crystal systems of the high-PZ materials, we observe that orthorhombic systems are the most represented crystal system (Fig. 3e), while space group 216 is the most common space group. As the PZ is a tensor quantity, we analyze the tensor component distribution for the whole dataset in Fig. 4a, b. All the piezoelectric tensor components have similar distributions with outlier materials with high coefficients. The e33 component has the largest number of high-value piezoelectric response materials. This stands in contrast to other tensors, like the elastic tensor, where diagonal components tend to dominate, and many components are almost always near zero. Our database can also be used to identify unusual piezoelectric mechanisms, such as negative piezoelectricity78, but such detailed analysis is beyond the scope of current work.
Dielectric properties
Dielectric materials are important components in many electronic devices such as capacitors, field-effect transistors computer memory (DRAM), sensors and communication circuits59,60. Both high and low-value DL materials have applications in different technologies. To evaluate the uncertainty in the data, we compare the dielectric constant of 16 materials with respect to experiments and the linear optics methods. The comparison is listed in Supplementary Table S3. We find the DFPT total DL has a mean-absolute deviation (MAD) of 2.46, which is lower than MAD of 2.78 for linear optics-based Tran–Blaha modified Becke–Johnson (MBJ) potential with respect to experiments. Next, we analyze the dielectric constant data obtained using the DFPT method. In Fig. 5a, we plot the distribution of the average of the diagonal values of the dielectric tensor (εavg). We find a peak value of 8.9 and a long tail towards high values. Setting our threshold for high dielectric constant materials at 20, we identify 441 materials. Some examples of previously unreported low- εavg materials are CClN(JVASP-14343), GeF4(JVASP-22445) and Mg(CN)2 (JVASP-29285), BCl3 (JVASP-164), PHF4 (JVASP-25550) while high-k materials are ZnAgF3 (JVASP-7792), CaZrN2 (JVASP-79475), Ta2ZnO6 (JVASP-9231), YWN3 (JVASP-38113), KCaCl3 (JVASP-36962), NbBi3O7 (JVASP-13286), and RuC (JVASP-36402). In Fig. 5b, we compare the dielectric constants from this work to the low-frequency limit of the linear optics method we used to calculate the frequency-dependent dielectric function in our previous work41. The electronic part of the DFPT dielectric constant is in close agreement with that obtained from the linear optics method. Importantly, the ionic contribution to the static dielectric constant is frequently larger than the electronic contribution. Our dataset of electronic and ionic components can guide experimentalists to choose a material with either high and/or low ionic-contribution compounds. As expected, the electronic part of the dielectric constant and the bandgaps have a qualitative inverse relationship79, as shown in Fig. 5c. This relationship, combined with the underestimation of bandgaps by semilocal DFT functionals, can lead to an overestimation of electronic dielectric constants. Electronic bandgap properties are already discussed in detail in our previous work41. The dataset suggests that most of the high εavg materials are 3D, with again a high number of chalcogenides (Fig. 5d). However, the low-dimensional dielectric materials could be of great technological importance, because the capacitance of a layer is generally inversely proportional to its thickness, potentially allowing for ultrathin vdW-bonded devices. Examples of 3D/2D/1D/0D bulk dimensional materials with a high and low-dielectric constant are shown in Tables 4 and 5. We find similar crystal system trends for high DL and PZ materials. In Fig. 5e, f, we show the crystal system and dimensionality trends of the screened materials. We observe that trigonal and tetragonal crystal systems are highly favored for the high DL materials. Similar to the PZ data, most of the high PZ value materials in our database are 3D. The electronic part of the dielectric constant can be directly used to estimate other physical properties such as the refractive index and the birefringence, which will be analyzed in detail in a follow-up work in the future.
a Dielectric constant distribution, b dielectric constant comparison from linear optics and DFPT methods, c dielectric constant vs bandgaps of materials, d dimensionality analysis of high dielectric constant materials, e crystal system distribution of high dielectric constant materials, and f dimensionality (3D/2D/1D/0D) analysis of high DL materials.
Dynamically unstable materials
Our analysis finds many materials with imaginary phonon frequencies at Г-point, which we call dynamically unstable materials, and which we exclude from our main analysis. In order to be observed experimentally, these materials must be stabilized by finite temperature contributions to the free energy, which are neglected in this analysis. Unstable phonon modes typically indicate that materials will go through phase transitions, and are therefore of interest for applications as ferroelectrics, antiferroelectrics, ferroelastics, etc. We find 1061 materials with at least one imaginary phonon frequency at Г-point. We observed that most unstable materials have high crystal symmetry, and would likely go through symmetry-lowering phase transitions. Many of these systems belong to oxide families such as ABO3 (examples: BaTiO3 (JVASP-110), KIO3 (JVASP-22568), CoBiO3 (JVASP-29444), ZrPbO3 (JVASP-7966), MgZrO3 (JVASP-36637)), AO2 (examples: NbO2(JVASP-12003), BiO2(JVASP-12066), FeO2 (JVASP-18430)). In addition to the oxide family, we also find several other classes, including halides, hydroxides, and chalcogenides. Related high-throughput searches of unstable materials have recently been carried out by Garrity80. A full analysis of dynamically unstable materials would require finding all of the low energy phases of an unstable material, as well as how they are related. This goes beyond the scope of this work, but may be the topic of future work. A list of dynamically unstable materials is given in the Supplementary Information (see “Data availability” section).
Born-effective charge analysis
In Figs. 6 and 7, we show some of the general trends in the PZ, IR, and dielectric data. The relaxed ion contributions to all the properties considered in this work depend on the Born-effective charges, making understanding trends in the BEC important to understanding the entire dataset. In Fig. 6a the overall distribution of the maximum BEC in a material and in Fig. 6b the absolute difference in maximum BEC and the maximum formal charges81 in a material. While most materials have BEC near their formal charges, there are significant outliers with anomalous BEC, which can represent some of the best candidate materials for piezoelectric and dielectrics and is commonly associated with ferroelectrics. For example, Fig. 6c, d portrays the maximum BEC charge of a material and the minimum frequency versus the piezoelectric and ionic dielectric constant, and it is evident that almost all the high-response materials have high BEC and low-frequency modes at Γ-point. We note that while anomalous BECs are often a signature of ferroelectric materials, we remove materials with unstable phonon modes at Γ in this work.
a High IR peak (having at least one peak >400 cm−1), b high piezoelectric (>1 C/m2), c high dielectric (>20) materials, and d high maximum Born-effective charge in a system (>5). The elements in a material are weighed 1 or 0 if the material has high or low-values. Then the percentage probability of finding the element in a high-value material is calculated.
Chemical trends
Next, we depict the trends across the periodic table of the properties investigated in this work in Fig. 7. In order to understand the contribution of various elements to a given property, we weigh an element in a material one or zero depending on whether the material has a maximum-IR for far-IR peak (Fig. 7a), has PZ value greater than 0.5 C/m2 (Fig. 7b), has dielectric constant more than 10 (Fig. 7c), and has BEC more than 5 or not (Fig. 7d). After such weighing for all the materials in our dataset, we calculate the probability that an element is part of a high-value material using the threshold defined above. For example, suppose there are x number of Se-containing materials and y of them have a property over the threshold, then the percentage probability (p) for Se is calculated as \(p = \frac{y}{x} \times 100\%\). We find that high-IR peaks generally have the light elements H, B, C, N, or O, as discussed earlier. The highest PZ, DL, and BEC most commonly have Ti, Zr, Hf, Nb, Ta, or Bi. The first five elements are commonly found in 4+ and 5+ oxidation states in insulators, which tends to result in high BECs, and these elements are also present in many ferroelectric perovskite oxides. Most of the transition metals after d5 have low IR/PZ/DL and BEC values that can be related to the partial-filling of d-orbitals. Similar behavior for other materials properties such as elastic properties has been observed23. Somewhat surprisingly, our database shows the Bi has a higher average piezoelectric response than neighboring lone-pair element Pb, which is commonly associated with a high-piezoelectric response, especially in PZT, suggesting that Bi may be effective as a Pb-free piezoelectric substitute.
Machine learning
Recent advances in data infrastructure, statistics, machine learning, and computational methods have led to an explosion of computed data in the field of materials science25,26,82,83,84,85,86. Here, we apply the machine learning classification model using classical force-field inspired descriptors (CFID)46, gradient-boosting decision trees (GBDT)87,88 to our dataset consisting of 3954 dynamically stable materials. We partition the complete dataset of highest IR frequency, highest Born-effective charge in a system, highest PZ tensor value, and average dielectric tensor in 90–10% train–test divisions. To obtain optimized parameters (number of trees, number of leaves, and learning rate) for each model, we carry out hyper-parameter optimization with 5-fold cross-validation on the 90% training data for each model. First, we train a regression model for predicting the highest IR frequency mode of a material. The mean absolute error (MAE) and r2 on the 10% held set for such a model are 67.8 cm−1 and 0.96, respectively. A scatter plot of the performance on the held set is shown in Fig. 8a. While the MAE of the ML model of 67.8 cm−1 is about 8 times higher than the MAE of DFPT with respect to experiments, it is small with respect to the whole range of investigated values. The high r2 value and obvious trend in Fig. 8a shows that the maximum IR frequency can be at least roughly predicted using M. Furthermore, a 5-fold cross-validation shows a MAE score of 77.84 ± 34.67 cm−1 supports such a conclusion. The corresponding learning curve (Supplementary Fig. S1) shows that the model error decreases as we add more data, suggesting that this model will improve as our database increased.
GBDT inherently allows accessing the importance of all the features of the models. For this model, feature importance analysis shows that some of the most important descriptors are radial distribution function data, especially at 1.5 and 5.9 Å, electron affinity, and atomic radii of elements. The high importance of short bond-lengths can be easily understood as it indicates the presence of H-containing compounds, which tend to have high IR modes as discussed earlier.
Similarly, we train a regression model for maximum BEC in materials. Predicting for the test dataset, we find the MAE and r2 in BEC as 0.6 and 0.76, respectively, as shown in Fig. 8c, showing that a significant portion of the BEC can be predicted using ML. A 5-fold cross-validation shows MAE of 0.69 ± 0.31, therefore supporting the above conclusion. Given the non-trivial relationship between the BEC and the formal charge, shown in Fig. 8b, it may be difficult to improve this model. Some of the most important features are the radial distribution function at 3.4 Å, 6.2 Å, angle distribution at 143°, dihedral angle at 95°, atomic radii, and heat of fusion of elements. We notice that the standard deviation for cross-validation in both the above two regression models is about half of the actual MAE values (IR: 34.67 vs 77.84 cm−1 and BEC: 0.31 vs 0.69). We provide the learning curves for these regression models in supporting information (see Supplementary Fig. S1). Again, it shows that the models would improve if we add more training data.
We try similar regression models for a maximum value of the PZ tensor (MAE: 0.47 C/m2) and average DL tensor (MAE: 4.91), but these regression models perform poorly, with r2 < 0.2 in both cases. Therefore, we train classification models for predicting the high PZ and high dielectric models using a threshold of 1 C/m2 for PZ and 10 for the dielectric constant, instead of trying to predict exact values, as for IR and BEC. In classification models, the accuracies are evaluated in terms of the area under curve (AUC) of the receiver operating characteristics (ROC) curves (Fig. 8b, d). The ROC curve illustrates the model’s ability to differentiate between high and low-performance materials. We find ROC AUCs of 0.86 and 0.93 for the high PZ and high DL materials. For each property, the 5-fold cross-validation result is shown with the gray region around the ROC curve. We find a very small deviation due to the swapping of datasets during cross-validation, indicating the model is fairly agnostic to the choice of test data. For the above models, in addition to the structural descriptors, chemical descriptors such as heat of fusion and maximum oxidation states of constituent elements are important. We note that the structural features are more important for the above DFPT-based properties than quantities such as formation, exfoliation energies, bandgaps where chemical descriptors were more important as shown in our previous work46. This highlights the higher difficulty in predicting the DFPT-based properties studied in this work, as these properties depend strongly on the structure of the material, rather than mostly on just the type of elements and bonding.
All these models, along with the data, are publicly available at https://www.ctcms.nist.gov/jarvisml/ to predict the performance of new compounds. These ML models can be used to identify materials worth performing the next set of DFPT calculations. For instance, we apply the classification models for high DL values on 1,193,972 materials taken from AFLOW4, Materials-project3, Open Quantum Materials Database (OQMD)5, Crystallography Open Database (COD)89, and JARVIS-DFT databases combined. We convert the structures from these databases into CFID and then we can easily apply the trained models. As the predictions in ML are very fast, we quickly pre-screen 32,188 high DL materials. Now, these new materials can be prioritized in the next set of DFT calculations in our database workflow. We have applied such workflows previously for various quantities such as exfoliation energies, solar-cell efficiency, and thermoelectric with appreciable success.
In summary, we perform a multi-step high-throughput computational screening study for infrared, piezoelectric, and dielectric materials, using DFPT on 5015 inorganic materials. These data constitute one of the largest datasets of infrared, dielectric, and piezoelectric properties. Creating such a repository is a necessary step for data-driven material-design. We verify our workflow by comparing our computational results with several experimental measurements and alternative computational techniques like finite-differences, finding strong agreement. Using this database, we analyze the trends in these properties in terms of the dimensionality of materials (0D/1D/2D/3D), space-group, and chemical constituents, and we find various correlations that quantitively confirm and match some known chemical and physical trends and may help designing improved materials. We have identified several candidate compounds for high-performing infrared-detectors, piezoelectrics, and dielectrics that have not been studied experimentally yet, to the best of our knowledge. To summarize, some of them are listed in Tables 1–5. We observe that many hydroxides and halides have high and low-IR modes, respectively, as shown in Tables 1 and 2, which are trends that can be used for the rational design of IR-detectors. We identify several candidates with high piezoelectric stress and strain coefficients, and notice that they are generally oxides and belong to orthorhombic crystal system. Somewhat surprisingly, we note that PZ-stress and PZ-strain coefficients are not strongly correlated. We verify the inverse relationship between bandgap and dielectric constants at a large scale and predict several new materials with unusually low and high constants (shown in Fig. 5c). We find that most of the halides have low while oxides have high dielectric constants. Using the phonon data at Г-point, we identify many dynamically unstable materials, which may include new functional materials like ferroelectrics. Most of the high piezoelectric, dielectric, and Born-effective charge materials contain elements such as Ti, Zr, Hf, Nb, Ta, and Bi. Most of the high-performance materials in the above classes are three-dimensional (i.e., no vdW bonding) in nature, but there are also several candidates of 2D, 1D, and 0D materials that can be useful for flexible and unique electronics applications. In addition to finding trends and correlations, we use our data to train machine learning surrogate models to expedite the computational screening processes. We achieve high accuracy models for predicting Born-effective charges, maximum IR mode, and classifying high piezoelectric and dielectric materials. We find bond-lengths, electron-affinity, and atomic radii as some of the most important features in the machine learning models which are also intuitive and helpful for physical understanding. We believe that our results, workflow, and tools can act as a guide for the experimental synthesis and characterization of various next-generation materials.
Methods
Density functional perturbation theory (DFPT) calculations
The DFT calculations were carried out using the Vienna Ab-initio simulation package (VASP)90,91. The entire study was managed, monitored, and analyzed using the modular workflow, which we have made available on our GitHub page (https://github.com/usnistgov/jarvis). Please note that commercial software is identified to specify procedures. Such identification does not imply recommendation by the National Institute of Standards and Technology. We use the projected augmented wave method92,93 and OptB88vdW functional94, which gives accurate lattice parameters for both and non-vdW (3D-bulk) solids6,23. In this work, a material is defined as low-dimensional if it contains vdW-bonding in one (2D-bulk), two (1D-bulk), or three (0D-bulk) crystallographic directions23. Both the internal atomic positions and the lattice constants are allowed to relax until the maximal residual Hellmann–Feynman forces on atoms are smaller than 0.001 eV Å−1. The k-point mesh and plane-wave cut-off were converged for each materials using the automated procedure in the JARVIS-DFT47. We assume that achieving absolute convergence in energy is sufficient for obtaining reasonable DFPT results, and this assumption is supported by the agreement between frozen-phonon/finite-difference/finite-displacement method and DFPT as well as linear-optics and DFPT results (discussed later). We also carry out K-point convergence for three materials: Si, AlN, MgF2 as test cases to show that our converged K-points are sufficient to predict the DFPT related data as shown in Supplementary Table S4. We carry out the DFPT calculation on the standard conventional cell for each material. DFPT calculations, as implemented in the VASP code, were used to determine the Born-effective charge tensors and the phonon eigenvectors were determined using phonopy code75.
Given an insulating system with N atoms per cell, with cell volume Ω0, atomic displacements um (m = (1…3N)), homogenous strain ηj (j = (1…6)), homogenous electric fields \({\cal{E}}_\alpha\) (α = (x, y, z)), energy E, the force-constant matrix (Kmn), internal strain tensor (Λmj), the dielectric susceptibility (χαβ), Born dynamical effective charge (Zma), piezoelectric stress tensor (eα,j), and piezoelectric strain tensor (dα,j) are calculated as follows (SI units are used throughout)65,66,67,68,69:
The dielectric constant can be derived from the dielectric susceptibility using:
In Eqs. (4) and (5), the first term represents the electronic contribution and the second term the ionic contribution for DL and PZ constants, respectively.
The PZ is a 3 × 6 tensor, the DL 3 × 3, and the BEC N × 3 × 3 tensor. The IR intensity of phonon modes is calculated using:
where eβ(s, n) is the normalized vibrational eigenvector of the nth phonon mode of the sth atom in the unit cell, and α, β are the cartesian coordinates. Zαβ(s) is the Born-effective charge tensor of sth atom (here we explicitly write both the cartesian indices of Z). These approaches are universal and have been already applied to various material classes. More details about the DFPT formalism can be found in elsewhere65,66.
Machine learning training
Our machine-learning models were trained using GBDT87,88 and CFID46 using a 5-fold cross-validation grid search on the 90% training set. Using the best model found during the grid search, we test the model on the 10% held set and report the performance. The accuracy of the regression models and classification models were evaluated using MAE and ROC curves, respectively. The principal idea behind the GBDT algorithm is to build new base learners to be maximally correlated with the negative gradient of the loss function associated with the whole ensemble. The CFID approach gives a unique representation of a material using structural (such as radial, angle, and dihedral distributions), chemical, and charge properties for a total of 1557 descriptors. We trained machine learning regression models to predict the highest IR frequency and maximum BEC of a material and classification models to predict whether a material has high PZ coefficient (>0.5 C/m2) and dielectric constant (>20).
Data availability
The electronic structure data is available at the JARVIS-DFT website: https://www.ctcms.nist.gov/~knc6/JVASP.html and http://jarvis.nist.gov. The dataset is also available at the Figshare repository: https://doi.org/10.6084/m9.figshare.11916720.
Code availability
Python-language-based codes with examples are given at https://github.com/usnistgov/jarvis.
References
Materials Genome Initiative for Global Competitiveness. https://www.mgi.gov/sites/default/files/documents/materials_genome_initiative-final.pdf (2011).
The First Five Years of the Materials Genome Initiative. https://www.mgi.gov/sites/default/files/documents/mgi-accomplishments-at-5-years-august-2016.pdf (2016).
Jain, A. et al. Commentary: The Materials Project: a materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).
Curtarolo, S. et al. AFLOWLIB. ORG: a distributed materials properties repository from high-throughput ab initio calculations. Comput. Mater. Sci. 58, 227–235 (2012).
Kirklin, S. et al. The Open Quantum Materials Database (OQMD): assessing the accuracy of DFT formation energies. npj Comput. Mater. 1, 15010 (2015).
Choudhary, K., Kalish, I., Beams, R. & Tavazza, F. High-throughput identification and characterization of two-dimensional materials using density functional theory. Sci. Rep. 7, 5179 (2017).
Draxl, C. & Scheffler, M. J. The NOMAD laboratory: from data sharing to artificial intelligence. J. Phys.: Mater. 2, 036001 (2019).
Choudhary, K. et al. Density functional theory and deep-learning to accelerate data analytics in scanning tunneling microscopy. https://arxiv.org/abs/1912.09027 (2019).
Sharma, V. et al. Rational design of all organic polymer dielectrics. Nat. Commun. 5, 4845 (2014).
Montoya, J. H. & Persson, K. A. A high-throughput framework for determining adsorption energies on solid surfaces. npj Comput. Mater. 3, 14 (2017).
Greeley, J., Jaramillo, T. F., Bonde, J., Chorkendorff, I. & Nørskov, J. K. Materials For Sustainable Energy: A Collection of Peer-Reviewed Research and Review Articles from Nature Publishing Group 280–284 (World Scientific, 2011).
Chen, W. et al. Understanding thermoelectric properties from high-throughput calculations: trends, insights, and comparisons with experiment. J. Mater. Chem. C 4, 4414–4426 (2016).
Carrete, J., Li, W., Mingo, N., Wang, S. & Curtarolo, S. Finding unprecedentedly low-thermal-conductivity half-Heusler semiconductors via high-throughput materials modeling. Phys. Rev. X 4, 011019 (2014).
Cheng, L. et al. Accelerating electrolyte discovery for energy storage with high-throughput screening. J. Phys. Chem. Lett. 6, 283–291 (2015).
Huan, T. D. et al. A polymer dataset for accelerated property prediction and design. Sci. Data 3, 160012 (2016).
Woods-Robinson, R. et al. Assessing high-throughput descriptors for prediction of transparent conductors. Chem. Mater. 30, 8375–8389 (2018).
Tabor, D. P. et al. Accelerating the discovery of materials for clean energy in the era of smart automation. Nat. Rev. Mater. 3, 5 (2018).
Dagdelen, J., Montoya, J., de Jong, M. & Persson, K. Computational prediction of new auxetic materials. Nat. Commun. 8, 323 (2017).
Bassman, L. et al. Active learning for accelerated design of layered materials. npj Comput. Mater. 4, 74 (2018).
Saal, J. E., Kirklin, S., Aykol, M., Meredig, B. & Wolverton, C. Materials design and discovery with high-throughput density functional theory: The Open Quantum Materials Database (OQMD). JOM 65, 1501–1509 (2013).
Choudhary, K. et al. Accelerated discovery of efficient solar-cell materials using quantum and machine-learning methods. Chem. Mater. 31, 5900–5908 (2019).
Choudhary, K., Garrity, K. F., Jiang, J., Pachter, R. & Tavazza, F. Computational search for magnetic and non-magnetic 2D topological materials using unified spin–orbit spillage screening. npj Comput. Mater. 6, 49 (2020).
Choudhary, K., Cheon, G., Reed, E. & Tavazza, F. Elastic properties of bulk and low-dimensional materials using van der Waals density functional. Phys. Rev. B 98, 014107 (2018).
Chung, Y. G. et al. Computation-ready, experimental metal–organic frameworks: a tool to enable high-throughput screening of nanoporous crystals. Chem. Mater. 26, 6185–6192 (2014).
Green, M. L. et al. Fulfilling the promise of the materials genome initiative with high-throughput experimental methodologies. J. Appl. Phys. Rev. 4, 011105 (2017).
Hattrick-Simpers, J. R., Gregoire, J. M. & Kusne, A. G. Perspective: Composition–structure–property mapping in high-throughput experiments: turning data into knowledge. APL Mater. 4, 053211 (2016).
Zakutayev, A. et al. An open experimental database for exploring inorganic materials. Sci. Data 5, 180053 (2018).
Kang, K., Meng, Y. S., Bréger, J., Grey, C. P. & Ceder, G. Electrodes with high power and high capacity for rechargeable lithium batteries. Science 311, 977–980 (2006).
Huang, X., Ackland, G. J. & Rabe, K. M. Crystal structures and shape-memory behaviour of NiTi. Nat. Mater. 2, 307 (2003).
Yan, F. et al. Design and discovery of a novel half-Heusler transparent hole conductor made of all-metallic heavy elements. Nat. Commun. 6, 7308 (2015).
Sharma, V. et al. Effects of moisture on (La, A)MnO3 (A = Ca, Sr, and Ba) solid oxide fuel cell cathodes: a first-principles and experimental study. J. Mater. Chem. A 4, 5605–5615 (2016).
Nayak, S. K. et al. Insight into point defects and impurities in titanium from first principles. npj Comput. Mater. 4, 11 (2018).
Kumar, P., Sharma, V., Reboredo, F. A., Yang, L.-M. & Pushpa, R. Tunable magnetism in metal adsorbed fluorinated nanoporous graphene. Sci. Rep. 6, 31841 (2016).
Stampfl, C., Kreuzer, H., Payne, S., Pfnür, H. & Scheffler, M. First-principles theory of surface thermodynamics and kinetics. Phys. Rev. Lett. 83, 2993 (1999).
Reuter, K. & Scheffler, M. First-principles atomistic thermodynamics for oxidation catalysis: surface phase diagrams and catalytically interesting regions. Phys. Rev. Lett. 90, 046103 (2003).
Mannodi-Kanakkithodi, A. et al. Scoping the polymer genome: a roadmap for rational polymer dielectrics design and beyond. Mater. Today 21, 785–796 (2018).
Hautier, G., Fischer, C. C., Jain, A., Mueller, T. & Ceder, G. Finding nature’s missing ternary oxide compounds using machine learning and density functional theory. Chem. Mater. 22, 3762–3767 (2010).
Pankajakshan, P. et al. Machine learning and statistical analysis for materials science: stability and transferability of fingerprint descriptors and chemical insights. Chem. Mater. 29, 4190–4201 (2017).
Castelli, I. E. et al. New light‐harvesting materials using accurate and efficient bandgap calculations. Adv. Energy Mater. 5, 1400915 (2015).
NIMS Database. https://mits.nims.go.jp/index_en.html.
Choudhary, K. et al. Computational screening of high-performance optoelectronic materials using OptB88vdW and TB-mBJ formalisms. Sci. Data 5, 180082 (2018).
Choudhary, K., Garrity, K. F. & Tavazza, F. High-throughput discovery of topologically non-trivial materials using spin-orbit spillage. Sci. Rep. 9, 8534 (2019).
Choudhary, K., Garrity, K. & Tavazza, F. Data-driven discovery of 3D and 2D thermoelectric materials. https://arxiv.org/abs/1906.06024 (2019).
Choudhary, K. et al. Evaluation and comparison of classical interatomic potentials through a user-friendly interactive web-interface. Sci. Data 4, 160125 (2017).
Choudhary, K. et al. High-throughput assessment of vacancy formation and surface energies of materials using classical force-fields. J. Phys.: Condens. Matter 30, 395901 (2018).
Choudhary, K., DeCost, B. & Tavazza, F. Machine learning with force-field-inspired descriptors for materials: fast screening and mapping energy landscape. Phys. Rev. Mater. 2, 083801 (2018).
Choudhary, K. & Tavazza, F. Convergence and machine learning predictions of Monkhorst-Pack k-points and plane-wave cut-off in high-throughput DFT calculations. Comput. Mater. Sci. 161, 300–308 (2019).
De Jong, M., Chen, W., Geerlings, H., Asta, M. & Persson, K. A. A database to enable discovery and design of piezoelectric materials. Sci. Data 2, 150053 (2015).
Petousis, I. et al. High-throughput screening of inorganic compounds for the discovery of novel dielectric and optical materials. Sci. Data 4, 160134 (2017).
Petousis, I. et al. Benchmarking density functional perturbation theory to enable high-throughput screening of materials for dielectric constant and refractive index. Phys. Rev. B 93, 115151 (2016).
Roy, A., Bennett, J. W., Rabe, K. M. & Vanderbilt, D. Half-Heusler semiconductors as piezoelectrics. Phys. Rev. Lett. 109, 037602 (2012).
Nyquist, R. A. & Kagel, R. O. Handbook of Infrared and Raman Spectra of Inorganic Compounds and Organic Salts: Infrared Spectra of Inorganic Compounds, Vol. 4 (Academic Press, 2012).
Giannozzi, P. & Baroni, S. Vibrational and dielectric properties of C60 from density‐functional perturbation theory. J. Chem. Phys. 100, 8537–8539 (1994).
Porezag, D. & Pederson, M. R. Infrared intensities and Raman-scattering activities within density-functional theory. Phys. Rev. B 54, 7830 (1996).
Vollmer, M. & Möllmann, K.-P. Infrared Thermal Imaging: Fundamentals, Research and Applications (John Wiley & Sons, 2017).
Glass, I. S. Handbook of Infrared Astronomy, Vol. 1 (Cambridge University Press, 1999).
Black, C., Haughey, S. A., Chevallier, O. P., Galvin-King, P. & Elliott, C. T. A comprehensive strategy to detect the fraudulent adulteration of herbs: the oregano approach. J. Food Chem. 210, 551–557 (2016).
Safari, A. & Akdogan, E. K. Piezoelectric and Acoustic Materials for Transducer Applications (Springer Science & Business Media, 2008).
Hao, X. J. A review on the dielectric materials for high energy-storage application. J. Adv. Dielectr. 3, 1330001 (2013).
Kotecki, D. E. A review of high dielectric materials for DRAM capacitors. J. Integr. Ferroelectr. 16, 1–19 (1997).
Gautschi, G. Piezoelectric Sensorics 73–91 (Springer, 2002).
Nalwa, H. S. Handbook of Low and High Dielectric Constant Materials and Their Applications, Two-Volume Set (Elsevier, 1999).
Uchino, K. Piezoelectric Actuators and Ultrasonic Motors, Vol. 1 (Springer Science & Business Media, 1996).
Kasap, S. O. Principles of Electronic Materials and Devices. Vol. 2, McGraw-Hill: New York, 2006.
Gonze, X. & Lee, C. Dynamical matrices, Born effective charges, dielectric permittivity tensors, and interatomic force constants from density-functional perturbation theory. Phys. Rev. B 55, 10355 (1997).
Wu, X., Vanderbilt, D. & Hamann, D. Systematic treatment of displacements, strains, and electric fields in density-functional perturbation theory. Phys. Rev. B 72, 035105 (2005).
Baroni, S., Giannozzi, P. & Testa, A. Green’s-function approach to linear response in solids. Phys. Rev. Lett. 58, 1861 (1987).
Gonze, X. Perturbation expansion of variational principles at arbitrary order. Phys. Rev. A 52, 1086 (1995).
Xie, L. & Zhu, J. The electronic structures, Born effective charges, and interatomic force constants in BaMO3 (M = Ti, Zr, Hf, Sn): a comparative first-principles study. J. Am. Ceram. Soc. 95, 3597–3604 (2012).
Skelton, J. M. et al. Lattice dynamics of the tin sulphides SnS2, SnS and Sn2S3: vibrational spectra and thermal transport. Phys. Chem. Chem. Phys. 19, 12452–12465 (2017).
Kendrick, J. & Burnett, A. D. PDielec: the calculation of infrared and terahertz absorption for powdered crystals. J. Comput. Chem. 37, 1491–1504 (2016).
De Jong, M. et al. Charting the complete elastic properties of inorganic crystalline compounds. Sci. Data 2, 150009 (2015).
Beams, R. et al. Characterization of few-layer 1T′ MoTe2 by polarization-resolved second harmonic generation and Raman scattering. ACS Nano 10, 9626–9636 (2016).
Molina-Sanchez, A. & Wirtz, L. Phonons in single-layer and few-layer MoS2 and WS2. Phys. Rev. B 84, 155413 (2011).
Togo, A. & Tanaka, I. First principles phonon calculations in materials science. Scr. Mater. 108, 1–5 (2015).
Sirianni, M. et al. The photometric performance and calibration of the Hubble Space Telescope Advanced Camera for Surveys. Publ. Astron. Soc. Pac. 117, 1049 (2005).
Roberts, S. Dielectric and piezoelectric properties of barium titanate. Phys. Rev. 71, 890 (1947).
Liu, S. & Cohen, R. Origin of negative longitudinal piezoelectric effect. Phys. Rev. Lett. 119, 207601 (2017).
Herve, P. & Vandamme, L. General relation between refractive index and energy gap in semiconductors. Infrared Phys. Technol. 35, 609–615 (1994).
Garrity, K. F. High-throughput first-principles search for new ferroelectrics. Phys. Rev. B 97, 024115 (2018).
Ong, S. P. et al. Python Materials Genomics (pymatgen): a robust, open-source python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013).
Mueller, T., Kusne, A. G. & Ramprasad, R. Machine learning in materials science: recent progress and emerging applications. Rev. Comput. Chem. 29, 186–273 (2016).
Ward, L. & Wolverton, C. Atomistic calculations and materials informatics: a review. Curr. Opin. Solid State Mater. Sci. Eng. A 21, 167–176 (2017).
Ramprasad, R., Batra, R., Pilania, G., Mannodi-Kanakkithodi, A. & Kim, C. Machine learning in materials informatics: recent applications and prospects. npj Comput. Mater. 3, 54 (2017).
Schmidt, J., Marques, M. R., Botti, S. & Marques, M. A. Recent advances and applications of machine learning in solid-state materials science. npj Comput. Mater. 5, 83 (2019).
Vasudevan, R. K. et al. Materials science in the artificial intelligence age: high-throughput library generation, machine learning, and a pathway from correlations to the underpinning physics. MRS Commun. 9, 821–838 (2019).
Friedman, J, Hastie, T. & Tibshirani, R. The elements of statistical learning. Springer Series in Statistics. Vol. 1, Springer: New York, 2001.
Ke, G. et al. Advances in Neural Information Processing Systems 3146–3154 (2017).
Gražulis, S. et al. Crystallography Open Database (COD): an open-access collection of crystal structures and platform for world-wide collaboration. Nucleic Acids Res. 40, D420–D427 (2011).
Kresse, G. & Furthmüller, J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 54, 11169 (1996).
Kresse, G. & Furthmüller, J. Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set. Comput. Mater. Sci. 6, 15–50 (1996).
Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B 50, 17953 (1994).
Kresse, G. & Joubert, D. From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B 59, 1758 (1999).
Klimeš, J., Bowler, D. R. & Michaelides, A. Chemical accuracy for the van der Waals density functional. J. Phys.: Condens. Matter 22, 022201 (2009).
Acknowledgements
K.C., K.F.G., and F.T. thank National Institute of Standards and Technology for funding, computational, and data-management resources. K.C. also thank the computational support from XSEDE computational resources under allocation number TG-DMR 190095. VS acknowledges the XSEDE computational resources through allocation grant number TG-DMR160051 and the Advanced Computer Facility (ACF) of the University of Tennessee for computational resources.
Author information
Authors and Affiliations
Contributions
K.C. and K.F.G. jointly developed the workflow. K.C. carried out the high-throughput DFT calculations. K.C., K.F.G., and F.T. analyzed the DFT data with help from V.S. A.J.B. and A.R.H.W. helped in the experimental validation section. All authors contributed in writing the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Choudhary, K., Garrity, K.F., Sharma, V. et al. High-throughput density functional perturbation theory and machine learning predictions of infrared, piezoelectric, and dielectric responses. npj Comput Mater 6, 64 (2020). https://doi.org/10.1038/s41524-020-0337-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41524-020-0337-2
This article is cited by
-
Representing Born effective charges with equivariant graph convolutional neural networks
Scientific Reports (2025)
-
Unified differentiable learning of electric response
Nature Communications (2025)
-
Universal machine learning interatomic potentials are ready for phonons
npj Computational Materials (2025)
-
Accurate piezoelectric tensor prediction with equivariant attention tensor graph neural network
npj Computational Materials (2025)
-
Dielectric tensor prediction for inorganic materials using latent information from preferred potential
npj Computational Materials (2024)