Fig. 1 | Scientific Data

Fig. 1

From: Petagraph: A large-scale unifying knowledge graph framework for integrating biomolecular and biomedical data

Fig. 1

Petagraph Data Ingestion Workflow. The data ingestion workflow starts with Step 1 by downloading the UBKG base CSVs and the dataset(s) that will be integrated. Then in Step 2, the raw dataset (or ontology) must be formatted into nodes and edges files according to guidelines found in the UBKG user guide. If the import file is in OWL format then the user can jump to Step 3 which involves running the OWLNETS Python script to convert the edges and nodes files into the UBKG format and simply appended to the base UBKG files. Lastly, for Step 4k,Neo4js command-line bulk import tool is used to build the graph database.

Back to article page