Table 1 Molecular descriptor statistics for the QM9 and AICures molecules in the GEOM dataset.
From: GEOM, energy-annotated molecular conformations for property prediction and molecular generation
AICures drug dataset (N = 304,466) | |||
---|---|---|---|
Mean | Standard deviation | Maximum | |
Number of atoms | 44.4 | 11.3 | 181 |
Number of heavy atoms | 24.9 | 5.7 | 91 |
Molecular weight (amu) | 355.4 | 80.4 | 1549.7 |
Number of rotatable bonds | 6.5 | 3.0 | 53 |
Stereochemistry (specified) | 45,712 | — | — |
Stereochemistry (all) | 83,326 | — | — |
QM9 dataset (N = 133,258) | |||
Mean | Standard deviation | Maximum | |
Number of atoms | 18.0 | 3.0 | 29 |
Number of heavy atoms | 8.8 | 0.51 | 9 |
Molecular weight (amu) | 122.7 | 7.6 | 152.0 |
Number of rotatable bonds | 2.2 | 1.6 | 8 |
Stereochemistry (specified) | 95,734 | — | — |
Stereochemistry (all) | 95,734 | — | — |