Fig. 1: The workflow of this work.

In this work, we generated 10 billion reactions using RDChiral38, followed by pre-training based on Llama233 architecture to enhance the acquisition of chemical reaction knowledge and used reinforcement learning from artificial intelligence feedback (RLAIF) to elucidate the relationships among products, reactants, and templates, rendering RSGPT as a template-free model.