Artificial intelligence approaches inspired by human cognitive function have usually single learned ability. The authors propose a multimodal foundation model that demonstrates the cross-domain learning and adaptation for broad range of downstream cognitive tasks.
- Nanyi Fei
- Zhiwu Lu
- Ji-Rong Wen