Deep learning models have been trained on The Cancer Genome Atlas to predict numerous features directly from histology, including survival, gene expression patterns, and driver mutations. Here, the authors demonstrate that site-specific histologic signatures can lead to biased estimates of accuracy for such models, and propose a method to minimize such bias.
- Frederick M. Howard
- James Dolezal
- Alexander T. Pearson