Table 2 Dataset details.

From: HALD, a human aging and longevity knowledge graph for precision gerontology and geroscience analyses

File

Objects

Articles

Variables

Short Description

Literature_Info.json

339,145

339,145

PMID, title (TI), abstract (AB), IF (Journal Impact Factor), IF5 (Five-year Journal Impact Factors), author (AU), full author (FAU), affiliation (AD), publication type (PT), date of publication (DP), place of publication (PL), journal title (JT), journal title abbreviation (TA), and source(SO).

JSON file containing the information of human aging and longevity-related literature with abstracts

Entity_Info.json

12,227

181,924

entity, type, official full name, PMID, sentence, number of articles, JT, TA, IF, IF5, years, alias names, description, url, mutation position, mutation alleles, MeSH ID, relation, external links, aging biomarker, and longevity biomarker.

JSON file containing the information of the entities appearing in the literature

Relation_Info.json

115,522

50,191

source entity, relationship, target entity, method, sentence, source, target, source type, target type, PMID, DP, date, TI, TA, IF, and IF5.

JSON file containing the triples information

Aging_Biomarkers.json

1,855

1,502

source entity, relationship, target entity, sentence, source, target, source type, target type, PMID, DP, date, TI, TA, IF, and IF5.

JSON file containing the aging biomarkers information

Longevity_Biomarkers.json

525

494

source entity, relationship, target entity, sentence, source, target, source type, target type, PMID, DP, date, TI, TA, IF, and IF5.

JSON file containing the longevity biomarkers information

Entities.csv

6,906

50,191

ID, name, type, frequency, label

CSV file containing the entities information for Neo4j

Roles.csv

115,514

50,191

start_ID, end_ID, relation, weight, method, type

CSV file containing the relations information for Neo4j