SuperBand: an Electronic-band and Fermi surface structure database of superconductors

Zhang, Tengdong; Suo, Chenyu; Wu, Yanling; Xu, Xiaodan; Liu, Yong; Yao, Dao-Xin; Li, Jun

doi:10.1038/s41597-025-05015-7

Download PDF

Data Descriptor
Open access
Published: 06 May 2025

SuperBand: an Electronic-band and Fermi surface structure database of superconductors

Scientific Data volume 12, Article number: 744 (2025) Cite this article

4036 Accesses
3 Citations
11 Altmetric
Metrics details

Subjects

Abstract

In comparison to simpler data such as chemical formulas and lattice structures, electronic band structure data provide a more fundamental and intuitive insight into superconducting phenomena. In this work, we generate superconductor’s lattice structure files optimized for density functional theory (DFT) calculations. Through DFT, we obtain electronic band for superconductors, including band structures, density of states (DOS), and Fermi surface data. Additionally, we outline efficient methodologies for acquiring structure data, establish high-throughput DFT computational protocols, and introduce tools for extracting this data from large-scale DFT calculations. As an example, we have curated a dataset containing information on 1,362 superconductors along with their experimentally determined superconducting transition temperatures (T_c) as well as 1,112 experimentally verified non-superconducting materials, which is well-suited for machine learning applications. This dataset is constructed with a focus on data quality, accessibility, and usability for machine learning models aimed at predicting superconducting properties.

Designing high-T_C superconductors with BCS-inspired screening, density functional theory, and deep-learning

Article Open access 22 November 2022

Catalogue of flat-band stoichiometric materials

Article 30 March 2022

3DSC - a dataset of superconductors including crystal structures

Article Open access 21 November 2023

Background & Summary

The phenomenon of zero electrical resistance in a material is of profound scientific and practical significance, referred to as superconductivity. This unique state allows electric current to flow through a material without any energy dissipation, making it an essential field of study with numerous potential applications. However, the practical use of superconductors is often constrained by the requirement for extremely low temperatures or high pressures. Since its initial discovery in 1911, the quest for superconductors that function at higher temperatures has been a major focus, as such advancements would enable a wider range of technological applications.

Superconductivity is a well-documented phenomenon, with over 10,000 superconductors identified to date^1,2. Prominent examples include cuprate³, iron-based⁴ and nickel-based superconductors⁵, which highlight the typical progression in the field: experimental physicists first synthesize new superconductors, followed by theoretical physicists who seek to unravel the fundamental mechanisms of superconductivity through a variety of models and theoretical frameworks. Despite the existence of numerous theories in condensed matter physics that attempt to explain superconductivity, predicting new high-temperature superconductors remains one of the greatest challenges in the field.

In condensed matter theory, energy-band theory serves as a cornerstone for understanding the electronic properties of materials. First-principles calculations based on density functional theory (DFT) play a crucial role in this regard, offering detailed insights into a material’s electronic band structure and density of states (DOS). These elements are instrumental in determining the electrical properties of a material⁶. Since superconductivity is inherently an electrical property, it follows that the energy-band theory derived from DFT should be applicable for explaining and predicting superconducting behavior⁷.

Theoretically, the electronic band structure obtained from DFT calculations provides essential parameters for understanding superconducting behavior. These parameters are critical for elucidating both conventional superconductors, such as those explained by BCS theory (e.g., the superconducting gap and electron-phonon coupling constants⁸), and unconventional superconductors, where strong correlations⁹ and spin fluctuations¹⁰ play a pivotal role.

For instance, the electron-phonon coupling constant in ambient-pressure BCS superconductors like MgB₂, which boasts a relatively high transition temperature¹¹, can be extracted through DFT calculations. Likewise, DFT has been instrumental in identifying the key parameters in high-pressure hydrogen-rich superconductors^12,13,14. Furthermore, two-dimensional carbon-based materials^15,16, nonlinear phonon properties in YBa₂Cu₃O_6.5¹⁷, and magnetic interactions in iron-based superconductors¹⁰ are examples where DFT has significantly contributed to understanding unconventional superconductivity.

Moreover, DFT has provided insights into the tight-binding model parameters¹⁸, electronic Coulomb correlation terms in iron-based superconductors⁹, spin-orbit coupling in heavy fermion systems¹⁹, and interlayer interactions in bilayer twisted graphene²⁰. It also helps illuminate the σ-bonds in high-pressure nickel-based superconductors²¹, the superconducting pairing symmetry in bilayer silicene²², and the unconventional pairing mechanisms in two-dimensional carbon materials^23,24. These examples demonstrate the broad applicability of DFT in aiding our understanding of both conventional and unconventional superconductivity.

In contrast to simpler data, such as chemical formulas and lattice structures, electronic band structure data provides a more fundamental and intuitive perspective on superconducting phenomena. This deeper insight is particularly relevant in the context of recent advancements in big data processing techniques, including machine learning (ML) approaches²⁵. The potential of ML to analyze complex electronic properties highlights the need for a comprehensive database of electronic band structures. Such a database would enable large-scale analyses, fostering the discovery of new superconductors and enhancing the understanding of their underlying mechanisms. The development of this resource is essential for advancing both theoretical and experimental research in the field of superconductivity.

In this paper, we introduce SuperBand, a comprehensive electronic band and Fermi surface structure database for superconductors, as depicted in Fig. 1. We generate lattice structure files optimized for DFT calculations and, through these calculations, obtain crucial electronic band data for experimentally realized superconductors. This dataset includes the electronic band structure, DOS, and Fermi surface information. Additionally, we outline methods for the efficient acquisition of structural data, provide high-throughput DFT calculation protocols, and offer programs designed to extract the aforementioned data from large-scale DFT computations. In SuperBand, we have compiled a dataset of 1,362 superconductors, including their experimentally determined superconducting transition temperatures (T_c), and 1,112 experimentally verified non-superconducting materials, which is ideal for ML applications.

Methods

The chemical formulas and T_c data for superconductors presented in this paper are mainly sourced from the 2022 edition of the SuperCon database¹. This extensive database contains information on 33,458 materials, including 7,190 non-superconducting compounds and 26,268 superconductors with experimentally measured T_c values. To ensure the most up-to-date dataset, we supplemented these materials with superconductors newly identified after 2022 by reviewing publicly available literature.

As depicted in Fig. 2, the crystal structure data utilized in this study are primarily obtained from the Materials Project (MP)²⁶, with additional contributions from the Open Quantum Materials Database (OQMD)²⁷. Since a significant proportion of superconductors are derived through doping parent compounds with various elements, we adopt the 3DSC methodology² to deal with lattice doping. To handle the complexities of doped structures, supercell processing is applied, replacing doped atoms and generating ordered crystallographic information files (CIFs) compatible with density functional theory (DFT) calculations.

Data Cleaning

The SuperCon database¹ contains numerous duplicate data entries, necessitating a rigorous data cleaning process. A key distinction of this work, compared to previous studies, lies in the determination of ordered crystal lattices for superconductors suitable for DFT calculations. The initial phase involved retrieving CIFs for lattice structures from relevant databases, including the MP and the OQMD. It should be noted that CIFs obtained from these public sources often contain disordered structures. In cases where CIFs were unavailable, we construct some disordered lattice structure files manually.

To address this, we employ an order transformation method that retains only ordered structures with the lowest Ewald energy²⁸. This method efficiently standardizes lattice structures with co-occupying atoms to generate ordered configurations. However, the method encounters difficulties when applied to materials with multiple-element co-occupations or a large number of co-occupied atomic sites. Consequently, we retained 14 materials for which disorder could not be resolved, including K₂RbC₆₀ (ID 15960), TiVNbTa (ID 16063), and Cu_0.65La_1.83Ni_0.35Sr_0.17O₄ (ID 17788). These materials were excluded from further DFT calculations due to unresolved structural complexities.

Subsequently, we applied the 3DSC methodology² to handle chemical formulas, including the definitions for exact matching, similarities, doping, and unmatched cases. This methodology is applied to the SuperCon database¹ to determine whether the chemical formulas could be matched with the ordered structures collected. For materials in the SuperCon database accompanied by space group information, a space group matching analysis is also performed on the relevant CIFs to identify the most closely corresponding material structure.

When a fully matching or similar CIF could not be identified, we search for materials with chemically doped formulas. If the doping concentration exceeded 0.75, the doped atoms are replaced. For doping concentrations exceeding 0.45 (0.29, 0.19, 0.1), supercell expansions of 1 × 1 × 2 (1 × 1 × 3, 2 × 2 × 1, 2 × 2 × 2) are performed to accommodate the doped atoms. The doped atoms are then replaced while preserving the lattice symmetry as much as possible. This process is repeated until the expanded and substituted supercell achieve chemical similarity with the given chemical formulas.

It is important to note that the introduction of doping does not necessarily alter the T_c of a material. In some cases, the incorporation of dopants has little to no discernible effect on the superconducting properties. For such doped superconductors, it is sufficient to disregard minor dopants that do not significantly impact superconductivity, as seen in SiV₃-based superconductors. A threshold of 0.2 is thus established to differentiate between doping and similarity for these materials.

However, for certain other systems, such as iron-based superconductors, even a small amount of elemental doping can substantially shift the Fermi level or modify DOS near the Fermi surface. These changes can markedly enhance or suppress superconductivity, often accompanied by a significant shift in T_c. For such materials, a more stringent threshold of 0.1 is applied to distinguish between doping and similarity, given the pronounced sensitivity of their superconducting properties to minor doping modifications.

Under these circumstances, we generated limited CIF representing parent compounds for each doping series less than 0.1. Taking the YBCO system as an example²⁹,we utilized the YBa₂Cu₃O₇ CIF to represent 344 distinct doped variant superconductors, assigning the maximum observed T_c of 95 K as the training label, thereby indicating the optimal highest T_c achievable through doping modifications of YBa₂Cu₃O₇ parent compound. Indeed, the identification of novel parent materials for superconductivity through neural network algorithms presents both significant challenges and remarkable potential.

Following the matching process with CIFs, we obtained data for 8,590 materials with non-duplicate chemical formulas, including 6,780 superconductors. Notably, compared to the reports on superconductors, there is a significant scarcity of reports on non-superconducting materials. Although the number of non-superconducting materials likely far exceeds that of superconductors, research on superconductivity often omits such materials from published studies.

In the realm of ML and big data research, this lack of data on non-superconducting materials hinders the reliability of predictions related to superconductivity. Non-superconducting materials are just as critical to the study of superconductivity, as they offer valuable insight into the boundaries of superconducting behavior. Therefore, we also provide data for 1,780 materials that have been experimentally verified to lack superconducting properties.

Notably, a significant portion of the 6,780 superconductors are represented by the same CIF. We identified a total of 1,763 unique CIFs. It is inappropriate to classify a material as a distinct superconductor based on a minor doping of 0.01 of another element. As a result, the CIF itself is used as the definitive criterion for identifying unique superconductors in this study. Therefore, the subsequent sections of this paper focus exclusively on the 1,763 superconductors corresponding to these unique CIFs, as depicted in Fig. 3.

DFT calculation

The projector-augmented wave (PAW) method, implemented in the Vienna Ab initio Simulation Package (VASP), is employed to carry out our DFT calculations³⁰. The generalized gradient approximation (GGA) and the Perdew Burke-Ernzerhof (PBE) function are used to treat the electron exchange correlation potential. High-throughput DFT calculations are facilitated by the Atomate open-source package³¹, with parameter settings derived from the MIT High-Throughput Project³². For workflow automation, we employ the FireWorks package³³, which efficiently manages the task flow for structure optimization, static calculations, non-self consistent field calculations.

The plane wave cut-off energy is set at 520 eV. In structure optimization, a Monkhorst-Pack k lattice with a spacing of 2π × 0.04 Å⁻¹ is employed and the self-consistent convergence threshold is set to 5 × 10⁻⁵ eV. In static calculations, we employ Monkhorst-Pack k lattice with a spacing of 2π × 0.02 Å⁻¹, and set self-consistent convergence threshold to 1 × 10⁻⁵ eV.

Collinear magnetism is consistently incorporated in all calculations. Transition metal elements are automatically assigned magnetic moments, with typical configurations as examples: Mn atoms are generally set to 5 μB, while Mn³⁺ and Mn⁴⁺ ions are assigned 4 μB and 3 μB, respectively; Fe atoms are configured with 5 μB, among other standard magnetic moment settings for other magnetic atoms.The GGA+U approach is systematically implemented across all calculations, with Hubbard U parameters assigned to specific transition metals: Ag is set to U = 1.5 eV, Co to U = 3.4 eV for example, and so forth for other atoms, following established computational protocols. Advanced methods such as spin-orbit coupling (SOC), dynamical mean-field theory (DMFT), HSE06 or GW calculations are more accurate for capturing strong correlation effects in systems like cuprates or nickelates. However, such methods are computationally intractable for high-throughput workflows, we deliberately omit these methods in all simulations to optimize computational resource utilization. Our approach prioritizes scalability and consistency, acknowledging that DFT+U and collinear magnetism serves as a pragmatic first step for large-scale electronic structure analysis.

The band structure and DOS data from non-self consistent field calculations are extracted for analysis. We utilize the Pymatgen package^26,27 to facilitate the plot of band structure and DOS. For Fermi surface generation, analysis, and visualization, the Ifermi package³⁴ is used, enabling detailed examination of electronic properties crucial for understanding superconducting mechanisms.

Data standardization

The availability and standardization of data are critical prerequisites for the development of ML models aimed at predicting material properties. In our DFT calculations, the electronic bands of different materials show significant variations due to the MIT-initialized DFT parameter settings³². Initially, lattice symmetry is considered to reduce computational costs, but the equivalent k-point values differ across space groups. Moreover, the k-space mesh density must be adjusted based on the number of atoms and lattice dimensions in each unit cell to enhance the accuracy of the calculations.

To address the normalization of k-space band data, we employ IFermi package³⁴ to standardize the k-space by considering only symmetry-equivalent k-points. Following this, interpolation techniques are applied to standardize the k-space mesh coordinates onto a uniform k grid of 32 × 32 × 32, ensuring consistency across various materials for ML applications.

After completing the standardization process, the number of electronic bands varies among different materials. In constructing a standardized dataset for ML, one could theoretically pad the training set tensors with zero tensors to maintain uniformity. However, this approach wastes computational resources and diminishes the efficiency of the calculations. Studies on both conventional and unconventional superconductors have demonstrated that the DOS near the Fermi surface has a substantial impact on superconductivity, while bands far from the Fermi surface contribute minimally. Therefore, focusing on the electronic bands in close proximity to the Fermi surface is more computationally efficient and enhances the relevance of the dataset for predicting superconducting properties.

Therefore, we limit our analysis to the 18 electronic bands around to the Fermi surface. Each band is mapped onto a 32 × 32 × 32 grid, yielding band data with dimensions of 18 × 32 × 32 × 32. This targeted approach ensures that our dataset captures the most relevant features for predicting superconducting properties efficiently. These data can be systematically augmented through various techniques: lattice orientation variations can be achieved through simple dimensional permutations, while lattice geometry modifications can be implemented via transformation matrices. For instance, primitive and conventional cell representations in face-centered cubic systems can be interconverted. Additionally, repeated selective sampling of bands near the Fermi level enables effective simulation of band structure folding in supercell. Within our constrained storage framework motivated by an optimal balance between storage efficiency and computational precision, we prioritize preserving critical band structure information near the Fermi surface. Importantly, the incorporation of data augmentation techniques is essential for enhancing predictive accuracy in AI training, as demonstrated in our companion paper³⁵.

Data Records

In the DFT calculations, we get the results of band structure data for 1,362 distinct superconductors as well as 1,112 experimentally verified non-superconducting materials.

Data Organization

These band structure data on Science Data Bank³⁶, combined with experimentally reported T_c, form the basis of a ML training set. The dataset is stored in HDF5 format, providing a platform-independent, efficient means of accessing scientific and engineering data. In addition to the normalized band structure data, we also include several critical features for ML: orbital-resolved DOS data, chemical formulas, space group symmetries, lattice constants, atomic species, and atomic positions.

Within the HDF5 architecture, all data pertaining to a specific superconductor are organized into a Group (analogous to a directory). Each Group encapsulates critical metadata within the Group’s Attribute, including experimentally reported T_c, chemical formula, space group system, space group number, and cell volume. The Group further comprises multiple datasets categorized as follows:

1.
Crystallographic Parameters
- Atomic Species Data.
- Unit cell vectors (in Ångström units)
- Atomic coordinates (in Ångström units)
2.
Electronic Structure Data:
- Fermi surface data
- DOS data partitioned by orbital contributions (s, p, d, f), normalized to a uniform length of 2001 data points
- Reciprocal space coordinates
3.
Normalized Energy Band Data: Standardized energy band datasets are structured as four-dimensional tensors (18 × 32 × 32 × 32), encoding band indices and momentum-space sampling.

Data Summary

A comprehensive summary of the literature documenting the initial experimental synthesis of these superconductors is provided. For 159 materials, no corresponding references were found. However, for the remaining 1,604 superconductors, relevant publications are identified. As illustrated in left of Fig. 4, the proportion of superconductors with T_c below 30 K remains consistent across various periods, suggesting that the discovery of new superconductors is largely stochastic. Additionally, the distribution of superconductors relative to their T_c follows an inverse relationship, except for those with T_c < 2 K. The 1970s saw the advent of superconductors with T_c > 30 K, most notably with the discovery of cuprate superconductors, which triggered a surge in high-temperature superconductor research during the 1980s.

The use of CIFs enables precise characterization of material properties via Pymatgen’s structure tool, as shown in right of Fig. 4. Among superconductors, the most prevalent crystalline structure is tetragonal, which appears in 453 distinct cases. This is followed closely by cubic symmetry in 439 cases, with the fewest occurrences noted for monoclinic (112 instances) and triclinic (27 instances). The tendency of superconductors to favor high-symmetry structures aligns with Matthias’ hypothesis regarding the correlation between symmetry and superconductivity. However, for materials with T_c > 10 K, a significant decline in the proportion of cubic superconductors is observed, coinciding with a marked increase in orthorhombic superconductors, which exhibit lower symmetry.

For superconductors with T_c values greater than 40 K, the majority of unconventional superconductors that surpass the McMillan limit tend to have either tetragonal or orthorhombic symmetry. This shift suggests that structures with lower symmetry may play a key role in high-temperature superconductivity, especially in systems where conventional electron-phonon interactions are insufficient to explain the observed T_c.

Technical Validation

During the collection of superconductor crystal structures, we made every effort to establish one-to-one correspondence between CIF and T_c. For superconductors whose original research papers provided information such as space group, lattice constants, or specific crystal structures, we ensured that the collected CIF strictly matched those specified in the publications. However, for doped superconductors, we could only employ the supercell expansion method mentioned previously to maintain maximum consistency in their chemical formulas. Our dataset was enhanced by incorporating superconductors reported after 2022 beyond the SuperCon database, thereby ensuring comprehensive coverage of materials. This completeness is demonstrated in Fig. 4, which presents statistics regarding the discovery timeline of superconductors.

Figure 5 presents the energy band data for three representative superconductors in SuperBand, BCS superconductor MgB₂ (mp-763) with a hexagonal system¹¹, cuperate superconductor YBa₂Cu₃O₇ (mp-22215) with an orthorhombic system²⁹, and iron-based superconductor KFe₂Se₂ (mp-1070735) with a tetragonal system³⁷. The electronic band structures of these three materials exhibit excellent consistency with data from the MP database, demonstrating the accuracy of our DFT calculations.

For the technical validation and initial training of this dataset, we employed the 3D-Vision Transformer model^35,38 and compare the predicted T_c with the experimental values. We use a set of optimal hyperparameters P × Q × F × D = 18 × 8 × 8 × 8, Ld = 534, De = 0.127, Hd = 64, Dm = 0.197, Md = 1038, and Lt = 3 in 3D-Vision Transformer model. We employ the mean squared error (MSE) between the predicted outputs and therescaling log(Tc +1) values of the training set as the loss function. In training, we utilize stochastic gradient descent (SGD) with a learning rate of 0.001, momentum of 0.9, weight decay of 10⁻⁵, and batch size of 32.

The goodness of fit between the predicted and experimental T_c values is quantified using the coefficient of determination, R², defined by:

$${R}^{2}=1-\frac{{S}_{{\rm{Res}}}}{{S}_{{\rm{Tot}}}}=1-\frac{{\sum }_{i}{({T}_{i}-{\widehat{T}}_{i})}^{2}}{{\sum }_{i}{({T}_{i}-\bar{T})}^{2}},$$

(1)

where T_i represents the predicted T_c values, ${\widehat{T}}_{i}$ denotes the average of predicted T_c values, and $\bar{T}$ is the average experimental T_c. The deep learning model’s predictions, illustrated in Fig. 6, provide good agreement with the experimental superconductors, giving an R² = 0.976. Our training code is provided on the our Github repository (https://github.com/ljcj007/SuperBand)³⁹. As a preliminary demonstration, our dataset exhibits promising potential for application in neural network algorithms. Beyond the band structures we primarily utilized, the dataset encompasses diverse material properties, such as DOS and Fermi surface data, that can serve as comprehensive training features for machine learning models.

Usage Notes

We publicly provide the full SuperBand dataset on Science Data Bank³⁶. The code used to generate figures, tool for ingesting new data into this database, code for accessing and reading the HDF5 file, and a neural network model capable of training this dataset provided on the our Github repository³⁹. For ease of use, a CSV file is included in our Github repository, which contains superconducting-related data, along with corresponding CIFs for the crystal structures.

High-pressure hydride superconductors (e.g., LaH₁₀, ID 15969; YH₉, ID 18619) are excluded due to their reliance on extreme pressure conditions and the absence of ambient-pressure structural data. Their chemical formulas are listed in the csv file in Github repository³⁹ for reference.

While pairing mechanisms differ between conventional and unconventional superconductors, our dataset aims to provide a broad foundation for ML exploration. Subclass-specific models may yield higher accuracy. However, the inclusion of diverse materials enables cross-class feature discovery, which is critical for identifying universal trends. We encourage users to subset the data by material class for targeted analyses.

The current dataset does not account for pressure-dependent properties, which limits its applicability to high-pressure systems like hydrides. Future extensions will address this gap through targeted collaborations and advanced computational protocols.

Code availability

Code and data are available free of charge. We publicly provide the full SuperBand dataset on Science Data Bank³⁶. The code base is made available in³⁹ as noted above.

References

Center for Basic Research on Materials. Mdr supercon datasheet ver.240322. https://doi.org/10.48505/nims.4487 (2024).
Sommer, T., Willa, R., Schmalian, J. & Friederich, P. 3DSC - a dataset of superconductors including crystal structures. Scientific Data 10, 816, https://doi.org/10.1038/s41597-023-02721-y (2023).
Article CAS PubMed PubMed Central Google Scholar
Bednorz, J. G. & Muller, K. A. Possible high Tc superconductivity in the Ba-La-Cu-O system. Zeitschrift fur Physik B Condensed Matter 64(2), 189–193, https://doi.org/10.1007/BF01303701 (1986).
Article ADS Google Scholar
Stewart, G. R. Superconductivity in iron compounds. Reviews of Modern Physics 83(4), 1589–1652, https://doi.org/10.1103/RevModPhys.83.1589 (2011).
Article ADS CAS Google Scholar
Sun, H. et al. Signatures of superconductivity near 80 K in a nickelate under high pressure. Nature 621(7979), 493–498, https://doi.org/10.1038/s41586-023-06408-7 (2023).
Article ADS CAS Google Scholar
Martin, R. M. Electronic Structure: Basic Theory and Practical Methods https://doi.org/10.1017/CBO9780511805769 (Cambridge University Press, Cambridge, 2004).
Lüders, M. et al. Ab initio theory of superconductivity. I. Density functional formalism and approximate functionals. Physical Review B 72(2), 024545, https://doi.org/10.1103/PhysRevB.72.024545 (2005).
Article ADS CAS Google Scholar
Giustino, F. Electron-phonon interactions from first principles. Reviews of Modern Physics 89(1), 015003, https://doi.org/10.1103/RevModPhys.89.015003 (2017).
Article ADS MathSciNet Google Scholar
Aichhorn, M., Biermann, S., Miyake, T., Georges, A. & Imada, M. Theoretical evidence for strong correlations and incoherent metallic state in FeSe. Physical Review B 82(6), 064504, https://doi.org/10.1103/PhysRevB.82.064504 (2010).
Article ADS CAS Google Scholar
Graser, S. et al. Spin fluctuations and superconductivity in a three-dimensional tight-binding model for BaFe₂As₂. Physical Review B 81(21), 214503, https://doi.org/10.1103/PhysRevB.81.214503 (2010).
Article ADS CAS Google Scholar
Bohnen, K. P., Heid, R. & Renker, B. Phonon Dispersion and Electron-Phonon Coupling in MgB₂ and AlB₂. Physical Review Letters 86(25), 5771–5774, https://doi.org/10.1103/PhysRevLett.86.5771 (2001).
Article ADS CAS Google Scholar
Xie, S. R. et al. Machine learning of superconducting critical temperature from Eliashberg theory. npj Computational Materials 8(1), 1–8, https://doi.org/10.1038/s41524-021-00666-7 (2022).
Article ADS Google Scholar
Cerqueira, T. F. T. et al, Searching materials space for hydride superconductors at ambient pressure. Advanced Functional Materials 34, 2404043, https://doi.org/10.1002/adfm.202404043 (2024).
Saha, S. et al. Mapping superconductivity in high-pressure hydrides: The Superhydra project. Physical Review Materials 7(5), 054806, https://doi.org/10.1103/PhysRevMaterials.7.054806 (2023).
Article ADS Google Scholar
Li, J. & Yao, D. X. Superconductivity in octagraphene. Chinese Physics B 31(1), 017403, https://doi.org/10.1088/1674-1056/ac40fa (2022).
Article ADS CAS Google Scholar
Si, C., Liu, Z., Duan, W. & Liu, F. First-Principles Calculations on the Effect of Doping and Biaxial Tensile Strain on Electron-Phonon Coupling in Graphene. Physical Review Letters 111(19), 196802, https://doi.org/10.1103/PhysRevLett.111.196802 (2013).
Article ADS CAS Google Scholar
Mankowsky, R. et al. Nonlinear lattice dynamics as a basis for enhanced superconductivity in YBa₂Cu₃O_6.5. Nature 516(7529), 71–73, https://doi.org/10.1038/nature13875 (2014).
Article ADS CAS Google Scholar
Cao, C., Hirschfeld, P. J. & Cheng, H. P. Proximity of antiferromagnetism and superconductivity in LaFeAsO_1−xF_x: Effective Hamiltonian from ab initio studies. Physical Review B 77(22), 220506, https://doi.org/10.1103/PhysRevB.77.220506 (2008).
Article ADS CAS Google Scholar
Samokhin, K. V., Zijlstra, E. S. & Bose, S. K. CePt₃Si An unconventional superconductor without inversion center. Physical Review B 69(9), 094514, https://doi.org/10.1103/PhysRevB.69.094514 (2004).
Article ADS CAS Google Scholar
Carr, S., Fang, S., Jarillo-Herrero, P. & Kaxiras, E. Pressure dependence of the magic twist angle in graphene superlattices. Physical Review B 98(8), 085144, https://doi.org/10.1103/PhysRevB.98.085144 (2018).
Article ADS Google Scholar
Luo, Z., Hu, X., Wang, M., Wu, W. & Yao, D. X. Bilayer Two-Orbital Model of La₃Ni₂O7 under Pressure. Physical Review Letters 131(12), 126001, https://doi.org/10.1103/PhysRevLett.131.126001 (2023).
Article ADS Google Scholar
Liu, F., Liu, C. C., Wu, K., Yang, F. & Yao, Y. $d+i{d}^{{\prime} }$ chiral superconductivity in bilayer silicene. Physical Review Letters 111(6), 066804, https://doi.org/10.1103/physrevlett.111.066804 (2013).
Article ADS Google Scholar
Li, J., Jin, S., Yang, F. & Yao, D. X. Electronic structure, magnetism, and high-temperature superconductivity in multilayer octagraphene and octagraphite. Physical Review B 102(17), 174509, https://doi.org/10.1103/PhysRevB.102.174509 (2020).
Article ADS Google Scholar
Ye, J., Li, J., Zhong, D. & Yao, D. X. Possible Superconductivity in Biphenylene. Chinese Physics Letters 40(7), 077401, https://doi.org/10.1088/0256-307X/40/7/077401 (2023).
Article ADS Google Scholar
Carleo, G. et al. Machine learning and the physical sciences. Reviews of Modern Physics 91(4), 045002, https://doi.org/10.1103/RevModPhys.91.045002 (2019).
Article ADS Google Scholar
Jain, A. et al. Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. APL Materials 1, 011002, https://doi.org/10.1063/1.4812323 (2013).
Article ADS CAS Google Scholar
Kirklin, S. et al. The Open Quantum Materials Database (OQMD): assessing the accuracy of DFT formation energies. npj Computational Materials 1, 15010, https://doi.org/10.1038/npjcompumats.2015.10 (2015).
Article ADS CAS Google Scholar
Ong, S. P. et al. The Materials Application Programming Interface (API): A simple, flexible and efficient API for materials data based on REpresentational State Transfer (REST) principles. Computational Materials Science 97, 209–215, https://doi.org/10.1016/j.commatsci.2014.10.037 (2015).
Article Google Scholar
Collocott, S. J., Driver, R., Welsh, H. K. & Andrikidis, C. The heat capacity of YBa₂Cu₃O₇ and YBa₂Cu₃O₆ in the range 0.4 to 20 K:Evidence for an intrinsic T-term. Physica C: Superconductivity 152(5), 401–407, https://doi.org/10.1016/0921-4534(88)90044-5 (1988).
Article ADS Google Scholar
Kresse, G. & Hafner, J. Ab initio molecular dynamics for liquid metals. Physical Review. B, Condensed Matter 47(1), 558–561, https://doi.org/10.1103/physrevb.47.558 (1993).
Article ADS Google Scholar
Mathew, K. et al. Atomate: A high-level interfaceto generate, execute, and analyze computational materials science workflows. Computational Materials Science 139, 140–152, https://doi.org/10.1016/j.commatsci.2017.07.030 (2017).
Article Google Scholar
Jain, A. et al. A high-throughput infrastructure for density functional theory calculations. Computational Materials Science 50(8), 2295–2310, https://doi.org/10.1016/j.commatsci.2011.02.023 (2011).
Article CAS Google Scholar
Jain, A. et al. Fireworks:a dynamic workflow system designed for high-throughput applications. Concurrency and Computation: Practice and Experience 27(17), 5037–5059, https://doi.org/10.1002/cpe.3505 (2015).
Article Google Scholar
Ganose, A. M., Searle, A., Jain, A. & Griffin, S. M. IFermi: A python library for Fermi surface generation and analysis. Journal of Open Source Software 6(59), 3089, https://doi.org/10.21105/joss.03089 (2021).
Article ADS Google Scholar
Li, J. et al. A deep learning approach to search for superconductors from electronic bands. arXiv preprint arXiv:2409.07721 https://doi.org/10.48550/arXiv.2409.07721 (2024).
Li, J., Zhang, T. & Suo, C. SuperBand: Superconductor’s energy band. Science Data Bank https://doi.org/10.57760/sciencedb.16728 (2024).
Ying, T. P. et al. Observation of superconductivity at 30-46 K in A_xFe₂Se₂(A= Li, Na, Ba, Sr, Ca, Yb and Eu). Scientific Reports 2(1), 426, https://doi.org/10.1038/srep00426 (2012).
Article CAS PubMed Central Google Scholar
Dosovitskiy, A. et al. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 https://doi.org/10.48550/arXiv.2010.11929 (2020).
Li, J. SuperBand. https://github.com/ljcj007/SuperBand (2024).

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (Grants No. 12204400, No. 12494591, No. 92165204), Natural Science Foundation of Hebei Province (Grant No. A2022203010, A2024203011), Innovation Capability Improvement Project of Hebei province (Grant No. 22567605H), National Key R&D Program of China (2022YFA1403301), Guangdong Fundamental Research Center for Magnetoelectric Physics, and Guangdong Provincial Quantum Science Strategic Initiative (GDZX2401010).

Author information

Authors and Affiliations

State Key Laboratory of Metastable Materials Science and Technology, Hebei Key Laboratory of Microstructural Material Physics, School of Science, Yanshan University, Qinhuangdao, 066004, China
Tengdong Zhang, Chenyu Suo, Yanling Wu, Xiaodan Xu, Yong Liu & Jun Li
State Key Laboratory of Optoelectronic Materials and Technologies, Guangdong Provincial Key Laboratory of Magnetoelectric Physics and Devices, School of Physics, Sun Yat-Sen University, Guangzhou, 510275, China
Dao-Xin Yao

Authors

Tengdong Zhang
View author publications
Search author on:PubMed Google Scholar
Chenyu Suo
View author publications
Search author on:PubMed Google Scholar
Yanling Wu
View author publications
Search author on:PubMed Google Scholar
Xiaodan Xu
View author publications
Search author on:PubMed Google Scholar
Yong Liu
View author publications
Search author on:PubMed Google Scholar
Dao-Xin Yao
View author publications
Search author on:PubMed Google Scholar
Jun Li
View author publications
Search author on:PubMed Google Scholar

Contributions

J. Li and T. Zhang wrote the main draft. T. Zhang and C. Suo performed the literature search and data cleaning. Y. Wu and X. Xu prepared figures. J. Li and Y. Liu performed the DFT calculations. J. Li and D.-X. Yao guided the workfow, tool, and analysis. All authors participated in the discussions and revised the manuscript.

Corresponding authors

Correspondence to Yong Liu, Dao-Xin Yao or Jun Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, T., Suo, C., Wu, Y. et al. SuperBand: an Electronic-band and Fermi surface structure database of superconductors. Sci Data 12, 744 (2025). https://doi.org/10.1038/s41597-025-05015-7

Download citation

Received: 31 October 2024
Accepted: 15 April 2025
Published: 06 May 2025
Version of record: 06 May 2025
DOI: https://doi.org/10.1038/s41597-025-05015-7

This article is cited by

Tree model machine learning to identify liquid metal-based alloy superconductor
- Chen Hua
- Jing Liu
Journal of Materials Science (2025)