Table 3 Summarizes the architecture design for the different findings.

From: Critical evaluation of artificial intelligence as a digital twin of pathologists for prostate cancer pathology

Architecture
Finding	Block type	Depth	Width	Junction	Short path	C₁	K₁	Apply crop center on input (256 × 256)	Initial filter factor per path	Global pooling	No. of Ch. for 1^st FC layer	Output function (no. of cat.)	Parameter capacity	Supervised contrastive learning⁵³	Transformer⁴⁸ (no. of blocks. header number)
Prostate Cancer	Inception	7	2	3	1	32	5 × 5	No	2	Max	96	Softmax (2)	178,342	No	No
Gleason pattern 3	Inception	4	2	3	1	32	5 × 5	No	2	Max	60	Softmax (2)	58,879	No	No
Gleason pattern 3	Inception	4	2	3	1	32	5 × 5	No	2	Max	60	Softmax (2)	58,879	No	No
Gleason pattern 4	Inception⁺	4	2	3	1	6	5 × 5	No	2	Avg	60	Sigmoid (1)	49,122	Yes	No
	Inception⁺	4	2	3	1	6	5 × 5	No	2	Avg	60	Sigmoid (1)	49,122	Yes	No
	Inception⁺	4	2	3	1	32	5 × 5	No	2	Avg	60	Softmax (2)	50,201	No	No
	Soft attention	6	2	3	1	8	3 × 3	Yes	2	Max	20	Softmax (2)	51,732	No	No
	Soft attention	6	2	3	1	8	3 × 3	Yes	2	Max	20	Softmax (2)	51,732	No	No
ResNet	5	2	3	1	16	5 × 5	No	4	Avg	44	Softmax (2)	206,638	No	No
Gleason pattern 5	Soft attention	5	2	3	1	16	5 × 5	No	2	Avg	24	Softmax (2)	48,590	No	No
Gleason pattern 5	Soft attention	5	2	3	1	16	5 × 5	No	2	Avg	24	Softmax (2)	48,590	No	No
Ductal morphology	Inception	4	2	3	1	16	5 × 5	No	2	Max	60	Softmax (2)	53,683	No	No
Cribriform pattern	Inception	3	2	3	1	32	5 × 5	No	2	Max	48	Softmax (2)	34,033	No	No
HGPIN	Inception	5	4	2	2	16	5 × 5	No	4	Max	160	Softmax (2)	412,322	No	No
Vessel	Inception⁺	5	2	3	1	16	5 × 5	No	2	Max	72	Softmax (2)	183,448	No	Yes (3,4)
Nerve	Inception⁺	5	2	3	1	16	5 × 5	No	2	Max	72	Softmax (2)	171,997	No	Yes (3,4)
Inflammatory cell infiltration	Inception⁺	5	2	3	1	16	5 × 5	No	2	Max	72	Softmax (2)	171,277	No	Yes (3,4)
Total											1012		1,878,587

By combining all the models listed here, the novel architecture design achieved a total parameter capacity markedly lower than the parameter capacities of a single ResNet-18 model (~ 11 million) or the 2nd version of a single MobileNet model (~ 2.0 million parameters)⁵⁰. These models combined a total of 1,012 features in the fully connected layers (in comparison, a single RestNet-18 model had 512 features in the fully connected layers). Given the compactness of our models, we could assign all 17 models to a single GPU card (VRAM 24 GB) for the pathology report generation task. For each Gleason pattern, we applied an ensemble model that weighted the predictions of the models equally. The weighted prediction scores ranged between 0 and 1. Note: The variation in the parameter capcity despite having the same hyperparameter configuration is due to the variation in the block number (depth) of the short path. A number of models with different configurations were tested to achieve the final model configuration via the grid search and trial-and-error approaches.
Ch. Channel, conv. Convolution, C₁ channel number of the first convolution layer, K₁ kernel size of the first convolution layer, act. activation, no. number, cat. Category, FC fully connected; + Layer normalization⁵¹ is applied instead of batch normalization⁵². Transfomer blocks were added prior to global pooling; since the output of the convolutional layer was batch size × height × width × channel, we reshaped the output to batch size × height ∗ width × channel before feeding into the transformer block; the output from the transformer⁴⁸ was reshaped back to the dimension of the convolutional layer before global pooling was applied. For all the models, a patch dimension of 512 × 512 × 3 is applied.
Significant values are in bold.

Back to article page

Table 3 Summarizes the architecture design for the different findings.

Search

Quick links