Fig. 5

The mechanism of Low-Rank Adaptation (LoRA) for parameter-efficient fine-tuning. Instead of updating the original pretrained weights (W), LoRA introduces two small, low-rank matrices (A, B). The input (x) passes through both the frozen original path and the trainable “side branch” (A, B). The outputs are then added together. Only matrices A and B are updated during training, drastically reducing the number of trainable parameters.