Fig. 5: Comparison of chemical space and data coverage by the original and production models.

a Principal Component Analysis (PCA) plot. The PCA plot demonstrates an expanded coverage of chemical space by both the original and production models. The orange and blue dots correspond to the coverage of the original and production model, respectively, while the grey dots represent the 13,000 known polymers in our database. b Dataset comparison. A comparison between the original and production models reveals an incorporation of diverse data types. The production model integrates experimental and simulation data for permeability, diffusivity, and solubility properties.