Fig. 1

Framework of the bipartite graph neural network model for AD classification. The image modality (in red) and text modality (in blue) are connected through bipartite graphs (in violet) based on the vision language model. Here, image-text similarity, inspired by VLM, is the edge weight of the bipartite graphs.