Table 3 Technical details of GatorTron models.
Model | # Layers | # Hidden size | # Attention heads | # Parameters |
---|---|---|---|---|
GatorTron-base | 24 | 1024 | 16 | 345 million |
GatorTron-medium | 48 | 2560 | 40 | 3.9 billion |
GatorTron-large | 56 | 3584 | 56 | 8.9 billion |
Model | # Layers | # Hidden size | # Attention heads | # Parameters |
---|---|---|---|---|
GatorTron-base | 24 | 1024 | 16 | 345 million |
GatorTron-medium | 48 | 2560 | 40 | 3.9 billion |
GatorTron-large | 56 | 3584 | 56 | 8.9 billion |