Fig. 2: Overview of PULSE program data extraction, transfer, and loading (ETL) process.

Architectural design for automated and iterative extraction, transformation, and loading of structured and unstructured cardiovascular data resources for consented program participants using centralized pseudonymization. Following patient self-guided program enrollment and consent in clinics, pan-institutional and service-specific patient-reported outcome measures (PROMs) are collected in addition to social determinants of health (SDOH) using point-of-service and/or remote (e-link) electronic surveys. Patient consent status is monitored by a Level 4 (PHI identified) server for automated query-retrieve of eligible data resources from institutional data repositories. Resource-specific pipelines for data pseudonymization are then executed prior to their migration to a cloud data lake for authorized end-user activities.