Table 7 Data quality check criteria
From: Continuous multimodal data supply chain and expandable clinical decision support for oncology
Quality Indicators | Detailed Quality Indicators | Diagnostic Targets | Remarks |
---|---|---|---|
Completeness | Individual Completeness | Columns or input values defined to exist but are Null | Â |
Conditional Completeness | Checking for NOT NULL constraints | Â | |
Structural Completeness | Implementation based on the physical model designed from the schema, including data types | Verified at the DB design stage | |
Validity | Code Validity | Whether codes defined in the common code are used | Â |
Format Validity | Errors in data format | Verified at the DB design stage | |
Boolean Validity | Diagnosis based on columns with Y/N, 0/1 criteria | Â | |
Date Validity | Errors based on date formats | Â | |
Range Validity | Diagnosis based on Min, Max, and Normal range of the column | Â | |
Temporal Relationship Validity | Diagnosis of data that deviates from predetermined sequential relationships | Â | |
Consistency | Referential Integrity | Diagnosis of operation rules for PK (Primary Key) items | Verified at the DB design stage |
Accuracy | Logical Relationship Accuracy | Data diagnosis according to logical relationships, e.g., when item A is n, item B should be at least m | Â |
Derived Item Accuracy | Diagnosis of derived data, e.g., whether the sum of item A and item B is equal | Â |