Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Single-cell image-based screens identify host regulators of Ebola virus infection dynamics

Abstract

Filoviruses such as Ebola virus (EBOV) give rise to frequent epidemics with high case fatality rates while therapeutic options remain limited. Earlier genetic screens aimed to identify potential drug targets for EBOV relied on systems that may not fully recapitulate the virus life cycle. Here we applied an image-based genome-wide CRISPR screen to identify 998 host regulators of EBOV infection in 39,085,093 cells. A deep learning model associated each host factor with a distinct viral replication step. From this we confirmed UQCRB as a post-entry regulator of EBOV RNA replication and show that small-molecule UQCRB inhibition reduced virus infection in vitro. Using a random forest model, we found that perturbations on STRAP (a spliceosome-associated factor) disrupted the equilibrium between viral RNA and protein. STRAP was associated with VP35, a viral RNA processing protein. This genome-wide screen coupled with 12 secondary screens including validation experiments with Sudan and Marburg virus, presents a rich resource for host regulators of virus replication and potential targets for therapeutic intervention.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Genome-wide optical pooled screening reveals regulators of multiple responses to Ebola virus infection.
Fig. 2: A deep neural network model reveals regulators of Ebola virus VP35 protein subcellular localization.
Fig. 3: Phenotypic profile clustering and matching on infection level reveal relationships between Ebola virus infection modulators.
Fig. 4: Targeted follow-up screens identify concordant and cell- and virus-specific Ebola virus regulators.
Fig. 5: STRAP KO impacts virus RNA expression levels and virus infectivity.

Similar content being viewed by others

Data availability

Raw screen images and single-cell features are available in their entirety on Google Cloud Storage at https://console.cloud.google.com/storage/browser/opspublic-east1/EBOVOpticalPooledScreen. A curated subset of raw images from the genome-wide screen in HeLa cells (https://doi.org/10.7910/DVN/YHVWXY), and the targeted screens in HeLa and Huh7 cells at 16 h post infection (https://doi.org/10.7910/DVN/6FQNUA) and at 24 h post infection (https://doi.org/10.7910/DVN/W9WVHG) are available at Harvard’s Dataverse37,38,39. Additional tables are available on Zenodo at https://doi.org/10.5281/zenodo.14741479 (ref. 40). Source data are provided with this paper.

Code availability

Code is available on Zenodo at https://zenodo.org/records/15725070 (ref. 41).

References

  1. Hoenen, T., Groseth, A. & Feldmann, H. Therapeutic strategies to target the Ebola virus life cycle. Nat. Rev. Microbiol. 17, 593–606 (2019).

    Article  CAS  PubMed  Google Scholar 

  2. Hoenen, T. et al. Inclusion bodies are a site of Ebolavirus replication. J. Virol. 86, 11779–11788 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Nanbo, A., Watanabe, S., Halfmann, P. & Kawaoka, Y. The spatio-temporal distribution dynamics of Ebola virus proteins and RNA in infected cells. Sci. Rep. 3, 1206 (2013).

    Article  PubMed  PubMed Central  Google Scholar 

  4. Bruchez, A. et al. MHC class II transactivator CIITA induces cell resistance to Ebola virus and SARS-like coronaviruses. Science 370, 241–247 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Carette, J. E. et al. Ebola virus entry requires the cholesterol transporter Niemann-Pick C1. Nature 477, 340–343 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Cheng, H. et al. A parallel genome-wide RNAi screening strategy to identify host proteins important for entry of Marburg virus and H5N1 influenza virus. Virol. J. 12, 194 (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  7. Martin, S. et al. A genome-wide siRNA screen identifies a druggable host pathway essential for the Ebola virus life cycle. Genome Med. 10, 58 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  8. Feldman, D. et al. Optical pooled screens in human cells. Cell 179, 787–799.e17 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Carlson, R. J., Leiken, M. D., Guna, A., Hacohen, N. & Blainey, P. C. A genome-wide optical pooled screen reveals regulators of cellular antiviral responses. Proc. Natl Acad. Sci. USA 120, e2210623120 (2023).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Funk, L. et al. The phenotypic landscape of essential human genes. Cell 185, 4634–4653.e22 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Galão, R. P. et al. TRIM25 and ZAP target the Ebola virus ribonucleoprotein complex to mediate interferon-induced restriction. PLoS Pathog. 18, e1010530 (2022).

    Article  PubMed  PubMed Central  Google Scholar 

  12. Hölzer, M. et al. Differential transcriptional responses to Ebola and Marburg virus infection in bat and human cells. Sci. Rep. 6, 34589 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  13. Wynne, J. W. et al. Comparative transcriptomics highlights the role of the activator protein 1 transcription factor in the host response to Ebolavirus. J. Virol. 91, e01174-17 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Filone, C. M., Dower, K., Cowley, G. S., Hensley, L. E. & Connor, J. H. Probing the virus host interaction in high containment: an approach using pooled short hairpin RNA. Assay Drug Dev. Technol. 13, 34–43 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Flint, M. et al. A genome-wide CRISPR screen identifies N-acetylglucosamine-1-phosphate transferase as a potential antiviral target for Ebola virus. Nat. Commun. 10, 285 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  16. Gong, M. et al. Genome-wide CRISPR/Cas9 screen identifies SLC39A9 and PIK3C3 as crucial entry factors for Ebola virus infection. PLoS Pathog. 20, e1012444 (2024).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Fang, J. et al. Functional interactomes of the Ebola virus polymerase identified by proximity proteomics in the context of viral replication. Cell Rep. 38, 110544 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Bray, M.-A. et al. Cell Painting, a high-content image-based assay for morphological profiling using multiplexed fluorescent dyes. Nat. Protoc. 11, 1757–1774 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Baldi, P. Autoencoders, unsupervised learning, and deep architectures. In Proc. ICML Workshop on Unsupervised and Transfer Learning 37–49 (JMLR, 2012).

  20. Moon, K. R. et al. Visualizing structure and transitions in high-dimensional biological data. Nat. Biotechnol. 37, 1482–1492 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Luthra, P. et al. Inhibiting pyrimidine biosynthesis impairs Ebola virus replication through depletion of nucleoside pools and activation of innate immune responses. Antiviral Res. 158, 288–302 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Batra, J. et al. Protein interaction mapping identifies RBBP6 as a negative regulator of Ebola virus replication. Cell 175, 1917–1930.e13 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Woolsey, C. et al. A VP35 mutant Ebola virus lacks virulence but can elicit protective immunity to wild-type virus challenge. Cell Rep. 28, 3032–3046.e6 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Jung, H. J. et al. Terpestacin inhibits tumor angiogenesis by targeting UQCRB of mitochondrial complex III and suppressing hypoxia-induced reactive oxygen species production and cellular oxygen sensing. J. Biol. Chem. 285, 11584–11595 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Traag, V. A., Waltman, L. & van Eck, N. J. From Louvain to Leiden: guaranteeing well-connected communities. Sci. Rep. 9, 5233 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Söderberg, O. et al. Direct observation of individual endogenous protein complexes in situ by proximity ligation. Nat. Methods 3, 995–1000 (2006).

    Article  PubMed  Google Scholar 

  27. Replogle, J. M. et al. Mapping information-rich genotype-phenotype landscapes with genome-scale Perturb-seq. Cell 185, 2559–2575.e28 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Gentili, M. et al. Classification and functional characterization of regulators of intracellular STING trafficking identified by genome-wide optical pooled screening. Cell Syst. 15, 1264–1277.e8 (2024).

    Article  CAS  PubMed  Google Scholar 

  29. Feldman, D. et al. Pooled genetic perturbation screens with image-based phenotypes. Nat. Protoc. 17, 476–512 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Ramezani, M. et al. A genome-wide atlas of human cell morphology. Nat. Methods 22, 621–633 (2025).

  31. Köster, J. & Rahmann, S. Snakemake—a scalable bioinformatics workflow engine. Bioinformatics 28, 2520–2522 (2012).

    Article  PubMed  Google Scholar 

  32. Singh, S., Bray, M.‐A., Jones, T. R. & Carpenter, A. E. Pipeline for illumination correction of images for high-throughput microscopy. J. Microsc. 256, 231–236 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).

    Google Scholar 

  34. Kuleshov, M. V. et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 44, W90–W97 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Fang, Z., Liu, X. & Peltz, G. GSEApy: a comprehensive package for performing gene set enrichment analysis in Python. Bioinformatics 39, btac757 (2023).

    Article  CAS  PubMed  Google Scholar 

  36. Ronneberger, O., Fischer, P. & Brox, T. U-Net: convolutional networks for biomedical image segmentation. Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Proc. Part III 18 (pp. 234–241) (Springer International Publishing, 2015).

  37. Carlson, R. Ebola virus optical pooled screen | genome-wide screen images. Harvard Dataverse 10.7910/DVN/YHVWXY (2025).

  38. Carlson, R. Ebola virus optical pooled screen | secondary screen early timepoint (16 hours) images. Harvard Dataverse 10.7910/DVN/6FQNUA (2025).

  39. Carlson, R. Ebola virus optical pooled screen | secondary screen late timepoint (24 hours) images. Harvard Dataverse 10.7910/DVN/W9WVHG (2025).

  40. Carlson, R. Additional data for single-cell image-based screens identify host regulators of Ebola virus infection dynamics [Data set]. Zenodo https://doi.org/10.5281/zenodo.14741479 (2025).

  41. Stefanakis, G. & Carlson R. beccajcarlson/EBOVOpticalPooledScreen: v01. Zenodo https://zenodo.org/records/15725070 (2025).

  42. Carroll, S. A. et al. Molecular evolution of viruses of the family Filoviridae based on 97 whole-genome sequences. J. Virol. 87, 2608–2616 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We thank members of the Blainey, Davey, Uhler and Hacohen labs for critical feedback and discussions; C. Diaz and J. Bauman in the lab of J. T. Neal at the Broad Institute for assistance in developing custom antibody conjugations; and Y. Qin for assistance with data management. The HeLa cell line was used in this research. Henrietta Lacks, and the HeLa cell line that was established from her tumour cells without her knowledge or consent in 1951, have made significant contributions to scientific progress and advances in human health. We thank Lacks, now deceased, and the Lacks family for their contributions to biomedical research. This work was supported by the Broad Institute through startup funding (to P.C.B.) and the BN10 programme, and two grants from the National Human Genome Research Institute (HG009283 and RM HG006193). P.C.B. was supported by a Career Award at the Scientific Interface from the Burroughs Wellcome Fund. R.J.C. was supported by a Fannie and John Hertz Foundation Fellowship and an NSF Graduate Research Fellowship. C.F.B. was supported by R01AI148663 and P01AI120943. A.R. was supported by a George F. Carrier Postdoctoral Fellowship. A.R. and C.U. acknowledge support from the Eric and Wendy Schmidt Center at the Broad Institute, NCCIH/NIH (1DP2AT012345), and ONR (N00014-22-1-2116). G.K.A., D.W.L, C.F.B and R.A.D. are supported by NIH P01AI120943.

Author information

Authors and Affiliations

Authors

Contributions

R.J.C. and J.J.P. designed the approach with input from all authors. R.J.C, J.J.P., B.Y.S. and N.T. performed experiments. R.J.C. performed analysis aside from developing the deep learning model. A.R. and G.S. developed the deep learning methodology and designed the architecture for it with input from R.J.C., J.J.P. and C.U., and G.S. trained the model to obtain the single-cell embeddings. A.S. provided critical feedback and performed custom antibody conjugation. D.W.L. and K.C.F.S. developed the VP35 antibody. G.K.A., K.C.F.S., D.W.L., C.F.B., N.H., C.U., R.A.D. and P.C.B. supervised the research. R.J.C. and J.J.P. wrote the manuscript with contributions from all authors.

Corresponding authors

Correspondence to Robert A. Davey or Paul C. Blainey.

Ethics declarations

Competing interests

P.C.B. is a consultant to or holds equity in 10X Genomics, General Automation Lab Technologies/Isolation Bio, Next Gen Diagnostics, Cache DNA, Concerto Biosciences, Stately, Ramona Optics, Bifrost Biosystems, and Amber Bio. His laboratory received research funding from Calico Life Sciences, Merck, and Genentech for work related to genetic screening. N.H. holds equity in and advises Danger Bio/Related Sciences, owns equity in BioNtech and receives research funding from Bristol Myers Squibb. C.U. serves on the Scientific Advisory Board of Immunai, Relation Therapeutics, and Focal Biosciences, and receives research funding from AstraZeneca and Janssen Pharmaceuticals. The Broad Institute and MIT may seek to commercialize aspects of this work, and related applications for intellectual property have been filed. A.S. is an employee at Genentech and R.J.C. is an employee at Flagship Pioneering. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Microbiology thanks Stefan Bonn, Thomas Hoenen and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Table 1 Statistics for fraction and number of cells post initial filtering steps based on in situ sequencing data
Extended Data Table 2 Statistics on cell coverage for targeting and non-targeting sgRNA post filtering

Extended Data Fig. 1 A genome-wide optical pooled screen identifies genes affecting VP35 protein, RNA, and c-Jun.

(a) Integration of optical pooled screening workflow with RNA FISH detection using HCR amplification. (b) Histograms of intensity features in five channels for non-targeting controls cells that were infected or not infected in the genome-wide optical pooled screen at 28 h. (c) Top 40 hits with increased or decreased VP35 protein levels and the number of non-Ebola virus genetic screens or Ebola-specific genetic screens they scored in. Genes not previously associated with Ebola in the literature are marked with an orange asterisk. (d) Gene set enrichment analysis of genes with significantly decreased (purple) or increased (gold) Ebola virus VP35 protein levels. (e) Volcano plot showing genes that scored significantly for changes in VP35 RNA levels by FISH. (f) Enrichr analysis of gene ontology terms significantly enriched in genes that reduced VP35 RNA levels. (g) Volcano plot showing genes that scored significantly for changes in c-Jun levels. (h) Enrichr analysis of gene ontology terms significantly enriched in genes that reduced or enhanced c-Jun levels.

Source data

Extended Data Fig. 2 Quality control and filtering information for genome-wide optical pooled screen.

(a) Per-channel and per-plate mean intensities across 2562 fields of view for two out of eight plates in the genome-wide screen. (b) Same as (a) but post illumination-correction. (c) Same as (b) but after averaging across all channels for each plate and removing the bottom 10th percentile of each image. (d) Scatterplots of i and j coordinates for VP35 median intensity per-cell (top) and vimentin median intensity per-cell (bottom) for 10,000 randomly selected cells across the entire genome-wide screen post post-illumination correction. (e) Frequency of i and j coordinates for cells pre-filtering for duplicate phenotyping cells and cells at field of view edges and post-filtering. (f) Pre-normalization mean per-field of view FISH median intensity (plate I, well A1) pre-normalization and post-scaling to mean and unit variance based on features for non-targeting cells within -/+ the width of a field of view of each cell of interest. (g) Performance for the random forest classification of apoptotic, mitotic, and interphase cells trained on manual annotations.

Source data

Extended Data Fig. 3 Additional unsupervised and fine-tuned autoencoder metrics.

(a) Fully unsupervised autoencoder reconstruction losses for training and test sets across 25 epochs. (b) Examples of manually labelled faint, punctate, cytoplasmic, and peripheral input cell images with accompanying unsupervised autoencoder reconstructions. (c) Fine-tuned autoencoder trained using negative log likelihood loss with balanced validation accuracy also reported across 50 epochs of training. (d) Best model train and test set accuracies for the VP35 protein localization prediction task using SVMs on latent embeddings from the unsupervised autoencoder, predefined features, a Resnet-50 architecture trained on the prediction task, or the fine-tuned autoencoder. Predefined features include intensity, correlation, and texture morphological features similar to those previously described for Cell Painting18. (e) Confusion matrix of model predictions vs manually labelled classifications on model test set. (f) Proportion of cells in each VP35 localization category for non-targeting controls and the genes with the largest proportion of faint (NPC1), punctate (UQCRB), and peripheral (ITGB1) cells. Error bars indicate SEM across sgRNAs targeting the same gene.

Source data

Extended Data Fig. 4 Clustering and dimensionality reduction identify information.

(a) Adjusted Rand score for Leiden clustering at different resolutions for 50 folds of 90% of the input data. Box plots here and in (d) indicate median (middle line), 25th, 75th percentile (box) and 1.5 times the IQR (whiskers) as well as outliers (single points). (b) Additional single-cell images of select genetic knockouts from the genome-wide optical pooled screen. (c) Correlation between the PHATE potential distance from the clustering using the fine-tuned model and the adjusted FDR p-value from the Kotliar study, noting genes whose expression significantly increased or decreased along with infection. (d) Correlation between the PHATE potential distance from the supervised clustering and the number of mass spectrometry studies that identified the genes as an interactor with an Ebola virus protein. (e) Venn diagram showing overlap between top optical pooled screen hits, genes that were present in at least one mass spectrometry study, and differentially expressed genes from Kotliar et al’s single-cell RNA sequencing study. (f) Correlation between the PHATE potential distance from the supervised clustering and the 95th percentile z-score for each gene in other virus genetic screens.

Source data

Extended Data Fig. 5 Summary of analytical approaches used in the manuscript.

(a) Workflow of analytical approaches used in this manuscript.

Extended Data Fig. 6. Additional secondary screen metrics.

(a) Fraction of non-significant secondary screen hits (p >=0.05) at varying thresholds of the genome-wide FDR-adjusted p-value for the HeLa late infection timepoint VP35 protein intensity. (b) Correlation between genome-wide c-Jun median nuclear delta AUC scores and secondary screen delta AUC scores; black lines indicate standard deviation for non-targeting control sgRNAs in each screen centred around the mean value for non-targeting sgRNAs in the screen. (c) Secondary screen mean viral protein (VP35 for EBOV and SUDV or VP40 for MARV) and RNA intensities in non-targeting control sgRNAs relative to HeLa cells infected with EBOV. (d) Volcano plots for VP35 (EBOV, SUDV) or VP40 (MARV) protein expression in each of the twelve screening conditions. (e) Volcano plot for viral VP35 RNA levels in HeLa cells at the late timepoint condition. (f) Heatmaps showing the difference between HeLa cell and Huh7 cell z-scored delta AUCs for members of the GARP, retromer, and the Sec61 complex. Hierarchical clustering performed using Euclidean distance. (g) Heatmap showing z-scored delta AUC values for genes identified as enriched for a punctate phenotype in the genome-wide screen and also included in secondary screens (white cells indicate conditions where p > 0.05). z-scored dAUC values for VP35 or VP40 protein were calculated on delta AUC values for all genes in each screen condition relative to means and standard deviations for non-targeting sgRNAs. White cells indicate conditions where p > 0.05 relative to non-targeting controls in the same condition. Hierarchical clustering performed using Pearson correlations.

Source data

Supplementary information

Reporting Summary

Supplementary Tables 1, 3–6

Supplementary Table 1. Per-gene mean cumulative ΔAUC scores and P values for VP35 protein, VP35 RNA FISH and c-Jun channels in genome-wide screen. Table 3. Per-gene mean deep learning predictions of VP35 subcellular protein localization and associated ordinal chi-square statistics and P values. Table 4. Summary of PHATE coordinates and gene cluster membership for supervised or unsupervised clustering, as well as the top 15 significant terms for each cluster. Table 5. Per-gene mean ΔAUC scores and P values for VP35 protein and VP35 RNA FISH channels in all secondary screen conditions. Table 6. Summary of key results across all multiple screens and analyses for all genes in the study.

Supplementary Table 2

Per-gene mean random forest regression results for VP35 RNA FISH and c-Jun predictions in genome-wide screen.

Source data

Source Data Fig. 1

Statistical source data.

Source Data Fig. 2

Statistical source data.

Source Data Fig. 3

Statistical source data.

Source Data Fig. 4

Statistical source data.

Source Data Extended Data Fig. 1

Statistical source data.

Source Data Extended Data Fig. 2

Statistical source data.

Source Data Extended Data Fig. 3

Statistical source data.

Source Data Extended Data Fig. 4

Statistical source data.

Source Data Extended Data Fig. 6

Statistical source data.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Carlson, R.J., Patten, J.J., Stefanakis, G. et al. Single-cell image-based screens identify host regulators of Ebola virus infection dynamics. Nat Microbiol 10, 1989–2002 (2025). https://doi.org/10.1038/s41564-025-02034-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue date:

  • DOI: https://doi.org/10.1038/s41564-025-02034-3

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing