A video-based deep-learning system was trained to understand the spectrum of human cardiovascular disease by the self-supervised method of contrastive learning, using pairs of cardiac MRI scans and their corresponding text reports that are generated as part of routine clinical practice.