Fig. 1: Evaluation of the capabilities of AI models on shortcut-free datasets reliably.
From: Mitigating data bias and ensuring reliable evaluation of AI models with shortcut hull learning

a When individuals or AI models learn from datasets containing shortcuts, they may use features other than the intended ones to recognize the same samples, leading to misleading evaluation results. In contrast, when learning from shortcut-free datasets, different individuals or AI models will only use the intended feature to recognize the same samples, thus producing reliable evaluation results. b The curse of shortcuts encompasses two challenges. The first challenge lies in covering all possible shortcut features, as the number of features in high-dimensional data grows exponentially with the data dimensions. The second challenge is in intervening in the covered shortcut features, where the overall label is coupled with local features, making it inevitable that intervening in local features will affect the overall label. c SHL includes a model suite composed of models with different inductive biases and learns the SH of high-dimensional datasets through the intersection of features learned by each model. The more models in the model suite, the more accurate the learning of SH. The diversity in the inductive biases of the models significantly accelerates the learning speed of SH, and directly learning SH avoids intervening in the features of the data, thus addressing both challenges mentioned in (b).