Table 1 Summary of the datasets. The mean quantity of video instances per class is indicated in column “Mean”.
From: A deep learning-based method combines manual and non-manual features for sign language recognition
Dataset | Language | Sensor | Gloss | Videos | Mean | Signers |
|---|---|---|---|---|---|---|
WLASL300 | ASL | RGB | 300 | 5117 | 17.1 | 109 |
AUTSL | TSL | RGB+D | 226 | 38,336 | 169.6 | 43 |