Table 7 Data quality check criteria

From: Continuous multimodal data supply chain and expandable clinical decision support for oncology

Quality Indicators

Detailed Quality Indicators

Diagnostic Targets

Remarks

Completeness

Individual Completeness

Columns or input values defined to exist but are Null

 

Conditional Completeness

Checking for NOT NULL constraints

 

Structural Completeness

Implementation based on the physical model designed from the schema,

including data types

Verified at the DB design stage

Validity

Code Validity

Whether codes defined in the common code are used

 

Format Validity

Errors in data format

Verified at the DB design stage

Boolean Validity

Diagnosis based on columns with Y/N, 0/1 criteria

 

Date Validity

Errors based on date formats

 

Range Validity

Diagnosis based on Min, Max, and Normal range of the column

 

Temporal Relationship Validity

Diagnosis of data that deviates from predetermined sequential relationships

 

Consistency

Referential Integrity

Diagnosis of operation rules for PK (Primary Key) items

Verified at the DB design stage

Accuracy

Logical Relationship Accuracy

Data diagnosis according to logical relationships,

e.g., when item A is n, item B should be at least m

 

Derived Item Accuracy

Diagnosis of derived data, e.g., whether the sum of item A and item B is equal

Â