Fig. 4: Number of benchmark-related predictions by Word2Vec and BERT model in 20 benchmark alternative categories.

In each benchmark category, the benchmark-related results of Word2Vec model (in black) and BERT model (in red) were counted. The BERT model herein is a summation of the six filling mask models.