Fig. 1
From: mdCATH: A Large-Scale MD Dataset for Data-Driven Computational Biophysics

Exclusion criteria and the resulting number of domains at each step, starting from the 14,433 domains in the S20 homology set of CATH release 4.2.0, and ending with 5,398 domains included in the mdCATH dataset presented in this work.