Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Transferable enantioselectivity models from sparse data

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Abstract

Identifying a catalyst class to optimize the enantioselectivity of a new reaction, either involving a different combination of known substrate types or an entirely unfamiliar class of compounds, is a formidable challenge. Statistical models trained on a reported set of reactions can help predict out-of-sample transformations1–5 but often face two challenges: (1) only sparse data are available i.e., limited information on catalyst–substrate interactions, and (2) simple stereoelectronic parameters may fail to describe mechanistically complex transformations.6,7 Here we report a descriptor generation strategy that accounts for changes in the enantiodetermining step with catalyst or substrate identity, allowing us to model reactions involving distinct ligand and substrate types. As validating case studies, we collected data on enantioselective nickel-catalyzed C(sp3)-couplings8 and trained statistical models with features extracted from the transition states and intermediates proposed to be involved in asymmetric induction. These models allow for the optimization of poorly performing examples reported in a substrate scope and are applicable to unseen ligands and reaction partners. This approach offers the opportunity to streamline catalyst and reaction development, quantitatively transferring knowledge learned on sparse data to novel chemical spaces.

This is a preview of subscription content, access via your institution

Access options

Rent or buy this article

Prices vary by article type

from$1.95

to$39.95

Prices may be subject to local taxes which are calculated during checkout

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Abigail G. Doyle or Matthew S. Sigman.

Supplementary information

Supplementary Information

This file contains Supplementary Information, including the following eight sections: 1. General Information; 2. Computational Workflow and Benchmarking; 3. Case Study 1; 4. Case Study 2; 5. Case Study 3; 6. Additional Synthesis and Characterization of Ligands; 7. NMR Spectra; and 8. References.

Peer Review File

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gallarati, S., Bucci, E.M., Doyle, A.G. et al. Transferable enantioselectivity models from sparse data. Nature (2026). https://doi.org/10.1038/s41586-026-10239-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1038/s41586-026-10239-7

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing