Table 5 Comparison of retrosynthesis recently published methods for retrosynthesis prediction on USPTO-50 k.

From: State-of-the-art augmented NLP transformer models for direct and single-step retrosynthesis

Model

Top-1

Top-2

Top-5

Top-10

Ref. #

Comments

Seq2Seq

37.4

 

57.0

61.7

12

40/5/5 split; splitting any reactions with multiple products into multiple single product and removal of trivial products

Transformer (3*6)

42.7

52.5

69.8

13

45/5 split: no validation set was used

Transformer (6*8), (self corrected)

43.7

 

65.2

68.7

19

40/5/5 split, reagents from reactants are removed

Transformer, augmentation

44.8

57.1

57.7

79.4

32

Same as in ref. 12.

Similarity-based

37.3

 

63.3

74.1

20

Same as in ref. 12.

Graph Logic Network

52.5

 

75.6

83.7

24

Same as in refs. 12,19.

G2Gs

48.9

 

72.5

75.5

25

Same as in ref. 12.

ATa

53.5

69.4

81

85.7

 

Same as in ref. 13.

AT

53.2

68.1

80.5

85.2

 

Only 40 k samples were used as training set to match the other results

AT MaxFragb

58.5

73

85.4

90

 

Same as in ref. 13.

AT MaxFrag

58

73.4

84.8

89.1

 

Only 40 k samples were used as training set to match the other results

  1. aThe results of the reference model applied to x100 augmented dataset using beam size = 10.
  2. bThe classical retro-synthesis accuracy was estimated as accuracy for prediction of the largest fragment (MaxFrag).