Fig. 5: The framework of our FedKD approach. | Nature Communications

Fig. 5: The framework of our FedKD approach.

From: Communication-efficient federated learning via knowledge distillation

Fig. 5

The local data is used to train the local mentor model and global mentee model. Both models are learned from local labeled data as well as the prediction and hidden results of each other. The local gradients are decomposed before uploading to the server, and then reconstructed on the server for aggregation. The aggregated global gradients are further decomposed and distributed to clients for local updates.

Back to article page