Table 5 Estimated training time per epoch.
From: A deep sentiment model combining ALBERT-driven context and EHO-optimized architecture
Dataset | Multihead (4 heads) | Multihead (8 heads) | Proposed |
|---|---|---|---|
SST-5 | 410 | 643 | 472 |
366 | 457 | 388 | |
Laptop14 | 417 | 600 | 473 |
Restaurant14 | 819 | 1082 | 886 |
Restaurant15 | 822 | 1116 | 893 |
Restaurant16 | 822 | 1120 | 907 |