Extended Data 1: Unsupervised clustering of NLP extracted textual features from pediatric diseases.
From: Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence

The diagnostic system analyzed the EHRs in the absence of a defined classification system. This grouping structure reflects the detection of trends in clinical features without pre-defined labeling or human input. The clustered blocks are marked with the boxes with grey lines.