Fig. 3: Scatter plots of predicted vs. actual neuropsychiatric state score.

The gray dotted line represents perfect correlation. a Shows the results of the semantic search approach based on the similarities between speech transcriptions. b Reports the score estimates of the machine learning model (Random Forest) applied to gte-Qwen2-1.5B-instruct embeddings. c Illustrates the predictions computed by Gemma 2-9b. The colored dots represent the average estimates of 9 predictions. RMSE Root Mean Squared Error, R2 coefficient of determination.