Fig. 1

Extraction pipeline and example. Top panel: Schematic representation of the standard text mining pipeline: (i) scrape papers in markup format from the major publishers; (ii) identify and classify synthesis sections; (iii) extract key information including materials, amounts, sequenced operations, and conditions; (iv) store synthesis procedures into the database for future data mining. Bottom panel: Example of a codified procedure extracted from a synthesis paragraph.