Fig. 1: Overview of the data retrieval for this study. | Nature Communications

Fig. 1: Overview of the data retrieval for this study.

From: Scientific literature on carbon dioxide removal revealed as much larger through AI-enhanced systematic mapping

Fig. 1

Squares symbolise documents, a coloured square a document with labels, either assigned by hand (solid colour) or automatically (faded colour). Red documents are excluded, blue ones included. Step 1: 70,000 documents were retrieved from databases using search queries. Step 2: Of these about 6000 documents are sorted (=coded) by hand into being on CDR (relevant, blue squares) or being not on CDR (irrelevant, red squares). Documents on CDR were additionally described with CDR options, see Fig. 2, and other categories. Steps 3 and 4: The relevance labels and additional categories were used to train machine learning classifiers. Step 5: The trained classifiers were used to extend all labels to the unseen ~64,000 documents. Detailed information on methods can be found in the “Method” Section and the Supplementary Methods 3 and 4.

Back to article page