Table 2 Comparison of DFT calculated datasets that can be used to train the neural network potential.

From: Towards universal neural network potential for material discovery applicable to arbitrary combination of 45 elements

Dataset

Systems

Structures

# of

 

Molecule

Bulk

Cluster

Slab

Adsorption

Disorder

Opt.

Vib.

MD

TS

Elements

Data

Materials Project18

 

✓

 

✓

  

✓

   

Unlimited

> 1 × 105

OQMD53

 

✓

    

✓

   

Unlimited

8 × 105

NOMAD54

 

✓

    

✓

   

Unlimited

> 5 × 107

Jarvis-DFT55

 

✓

    

✓

   

Unlimited

> 4 × 105

AFLOW56

 

✓

✓

   

✓

   

Unlimited

> 3 × 106(*1)

QM912,13

✓

     

✓

   

5

1 × 105

PubChemQC57

✓

     

✓

   

30

> 3 × 106(*2)

MD1758

✓

      

✓

  

4

9 × 106

SN2 reactions59

✓

     

✓

 

✓

✓

6

4 × 105

ANI-114

✓

     

✓

✓

✓

 

5

2 × 107

ANI-2x15

✓

     

✓

✓

✓

 

7

9 × 106

COMP6v215

✓

     

✓

✓

✓

 

7

2 × 105

tensor-mol 0.1 water51

✓

       

✓

 

2

4 × 105

tensor-mol 0.1 spider51

✓

       

✓

 

4

3 × 106

TeaNet60

✓

    

✓

  

✓

 

18

3 × 105

OC2019,20

    

✓

 

✓

✓

✓

 

56(*3)

1 × 108

PFP molecular dataset (ours)

✓

     

✓

✓

✓

 

9

6 × 106

PFP crystal dataset (ours)

✓

✓

✓

✓

✓

✓

✓

✓

✓

 

45

3 × 106

  1. (*1): The number is checked on May 24, 2021. (*2): The number is taken from 57, and is updated weekly. (*3): The number was checked using only the training dataset of version 1.