Extended Data 1: Unsupervised clustering of NLP extracted textual features from pediatric diseases. | Nature Medicine