Fig. 4: Illustration of the CANN model.
From: Faithful novel machine learning for predicting quantum properties

Global data is simply appended to each token atom, and successive layers of attention and simply 3-layer feed-forward networks are applied in succession.