Activation-Guided Layer Selection for LoRA

20260 citationsJournal Articlegold Open Access

Authors

Aditya Dawadikar · San Jose State University

Rashmi Vishwanath Bhat · Salesforce (United States)

Navrati Saxena · San Jose State University

Abstract

Low-Rank Adaptation (LoRA) has become a widely adopted parameter-efficient fine-tuning (PEFT) technique for large language models (LLMs). LoRA’s benefits stem from its light weight and modular adapters. Standard LoRA applies adapters uniformly across all Transformer layers, implicitly assuming that each layer contributes equally to task adaptation. However, LLMs are found to have internal substructures that contribute in a disproportionate manner. In this work, we provide a theoretical analysis of how LoRA weight updates are influenced by a layer’s activation magnitude. We propose Act-LoRA, a simple activation-guided layer selection strategy for selective Low-Rank Adaptation. We evaluate this strategy for both encoder-only and decoder-only architectures using the GLUE benchmark. Our method achieved a 20% GPUh saving with a 1% drop in GLUE score using DeBERTaV3-Base on a single-instance GPU with 50% less LoRA parameters. It also achieved 2% GPUh savings with a less than 0.15% drop in GLUE score with the Llama-3.1-8B model in Distributed Data Parallel mode with 25% fewer LoRA parameters. Our experiments and analysis show that the compute and memory requirements of LoRA adapters increase linearly with the number of selected layers. We further compare activation-guided selection against gradient-guided importance metrics and show that activation norms yield more stable and reproducible layer rankings across seeds and datasets. Overall, our results demonstrate that activation-guided layer selection is a practical and effective way to improve the efficiency of LoRA fine-tuning, making it immediately compatible with some existing PEFT techniques and distributed training frameworks.

Topics & Keywords

Topic Modeling Domain Adaptation and Few-Shot Learning Multimodal Machine Learning Applications

Publication Details

Published in: Information

Volume 17, Issue 3, pp. 283-283

DOI: 10.3390/info17030283

Field-Weighted Citation Impact: 0.00

Command Palette

Activation-Guided Layer Selection for LoRA

Authors

Abstract

Topics & Keywords

Publication Details