Fig. 1: The National Human Genome Research Institute (NHGRI) archive chronicles the Human Genome Project and subsequent genomics projects. | Nature Communications

Fig. 1: The National Human Genome Research Institute (NHGRI) archive chronicles the Human Genome Project and subsequent genomics projects.

From: A digital archive reveals how a funding agency cooperated with academics to support the nascent field of genomics

Fig. 1: The National Human Genome Research Institute (NHGRI) archive chronicles the Human Genome Project and subsequent genomics projects.The alternative text for this image may have been generated using AI.

A Overview of the digitization, curation, and digital enrichment of the NHGRI archive guided by ethical considerations described in our companion manuscript32. PSS refers to page stream segmentation of collated scanned files. Founding vision refers to three themes of genomics from Collins et al., 200328. B Breakdown of the number of files, documents, and pages of archival materials inside the Core Collection. MS refers to Microsoft Office suite files. C The majority of documents whose dates were extracted from the Core Collection fall between 1988 to 2012. D The timeline of major genomics projects chronicled by the Core Collection. PAGE refers to Population Architecture Using Genomics and Epidemiology Consortium, eMERGE refers to Electronic Medical Records and Genomics Network, H3Africa refers to Human, Heredity & Health in Africa. E t-SNE projection of unsupervised representation of all text documents (dots) from the Core Collection. Colors correspond to folders assigned by the History of Genomics Program before computational analysis (Supplementary Data S1).

Back to article page