Fig. 6: ASR and accuracy on GLUE benchmarks with varying hyperparameters on LLaMA-2-7b.
From: Nexus scissor: enhance open-access language model safety by connection pruning

ASR is evaluated against BDFinetune attack.
From: Nexus scissor: enhance open-access language model safety by connection pruning

ASR is evaluated against BDFinetune attack.