Extended Data Fig. 1: BERT has a moral direction.
From: Large pre-trained language models contain human-like biases of what is right and wrong to do

The direction is defined by a PCA computed on BERT-based sentence embeddings. The top PC, the moral direction m, is dividing the x axis into Dos and Don’ts. The displayed verbs were used to compute the PCA.