Figure 6

The structure of Multitask Learner, an LSTM autoencoder followed by multi-heads output. All the group layers have their own parameters, and they share the LSTM autoencoder along with the bottle-neck features. It is equivalent to CALM-NET if each group layer contains exactly one student.