Table 1 The heterogeneous nature of the multimodal data acquired in AMP SCZ.

From: Enabling FAIR data stewardship in complex international multi-site studies: Data Operations for the Accelerating Medicines Partnership® Schizophrenia Program

Domain

Frequency

Type

Format

Size

Form data (interviews, runsheet, cognition, etc)

monthly

tabular

JSON, CSV

10MB/participant

EEG

baseline & month 2

binary

BrainVision Core Data Format 1.0

1GB/session

MRI

baseline & month 2

binary

DICOM

5GB/session

Smartphone sensor

dailya

tabular

JSON

100MB/day collected

Actigraphy watch

monthlya

binary

CWA

200MB/month

A/V

Daily diaries (phone),

Baseline & month 2 (open),

Monthly (psychs)b

binary

WAV, MP3, M4A

200MB/h for video

20MB/hour for audio only

200MB/wav

  1. aSmartphone and actigraphy watch data is sampled multiple times per second but transferred as a single data file daily (smartphone) and monthly (watch).
  2. bAudio/video recordings of open interviews are collected twice (baseline and month 2), audio/video recordings of PSYCHS are collected up to 10 times (screening, baseline, month 1, 2, 3, 6, 12, 18, 24, and conversion), audio diaries can be recorded on smartphone at least daily.