Fig. 13
From: A comprehensive multimodal dataset for contactless lip reading and acoustic analysis

Classification performance of human speech across 10 Sentences with confusion matrix of (a) UWB signals, (b) the fusion of video and UWB, (c) the fusion of audio and UWB.