Fig. 8: Histogram of PL exponents and Log Alpha Norm for weight matrices from models of different sizes in the GPT2 architecture series. | Nature Communications

Fig. 8: Histogram of PL exponents and Log Alpha Norm for weight matrices from models of different sizes in the GPT2 architecture series.

From: Predicting trends in the quality of state-of-the-art neural networks without access to training or testing data

Search

Advanced search

Quick links

Explore articles by subject
Find a job
Guide to authors
Editorial policies