Fig. 5: Schema development.

Phases of schema development leveraging curated data resources of the PULSE program library. All schema requests are registered and assigned a unique ID for data lineage tracking and versioning. The PULSE data analyst works with designated domain expert(s) to define use case(s) prior to entering the design phase. Design is executed by selection of relevant variables from the existing library with identification of required transformations (establishing new variables in the library), followed by sourcing new variables from other sources (also added to the library). Schema constructed by the data engineer are then passed back to the data analyst for validation using real-world data and feasibility of use case deployment. Upon validation schema configuration is locked, versioned, and scheduled for desired data generation schedule.