Table 2 Time taken taken to perform reads of a number of columns of the Patient table from the Covid Symptom Study 2021/05/23 snapshot.

From: Accessible data curation and analytics for international-scale citizen science datasets

Field count

Time to read patient fields (seconds)

ExeTera

Pandas

Dask

PostgreSQL

2 fields

0.068

(142.56)

NA

3.22

4 fields

0.075

(143.21)

NA

8.24

8 fields

0.084

(142.35)

NA

9.71

  1. Figures in parentheses denote that the read required more than 32 GB of memory to succeed. NA denotes that the operation could not be performed due to the dataset not being successfully imported. Figures in bold indicate the best read time.