Table 4 Evaluation results of the DAIC-WOZ dataset
From: Depression detection methods based on multimodal fusion of voice and text
Modality | Metric | MCN | Bi-LSTM | MCN+Bi-LSTM |
---|---|---|---|---|
Audio | Accuracy | 0.9434 | 0.9496 | – |
Precision | 0.8603 | 0.9608 | – | |
Recall | 0.9277 | 0.9157 | – | |
F1 score | 0.8928 | 0.9021 | – | |
Text | Accuracy | 0.7021 | 0.7471 | – |
Precision | 0.6506 | 0.6723 | – | |
Recall | 0.7020 | 0.7470 | – | |
F1 score | 0.6753 | 0.7077 | – | |
Audio+Text | Accuracy | 0.9343 | 0.9274 | 0.9794 |
Precision | 0.8170 | 0.7926 | 0.9702 | |
Recall | 0.9548 | 0.9669 | 0.9631 | |
F1 score | 0.8806 | 0.8711 | 0.9666 |