Command Palette
Search for a command to run...
GIFT-SW: Gaussian noise Injected Fine-Tuning of Salient Weights for LLMs
Maxim Zhelnin Viktor Moskvoretskii Egor Shvetsov Egor Venediktov Mariya Krylova Aleksandr Zuev Evgeny Burnaev

Abstract
Parameter Efficient Fine-Tuning (PEFT) methods have gained popularity anddemocratized the usage of Large Language Models (LLMs). Recent studies haveshown that a small subset of weights significantly impacts performance. Basedon this observation, we introduce a novel PEFT method, called Gaussian noiseInjected Fine Tuning of Salient Weights (GIFT-SW). Our method updates onlysalient columns, while injecting Gaussian noise into non-salient ones. Toidentify these columns, we developeda generalized sensitivity metric thatextends and unifies metrics from previous studies. Experiments with LLaMAmodels demonstrate that GIFT-SW outperforms full fine-tuning and modern PEFTmethods under the same computational budget. Moreover, GIFT-SW offers practicaladvantages to recover performance of models subjected to mixed-precisionquantization with keeping salient weights in full precision.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| parameter-efficient-fine-tuning-on-boolq | LLaMA2-7b | Accuracy (% ): 82.63 |
| parameter-efficient-fine-tuning-on-hellaswag | LLaMA2-7b | Accuracy (% ): 76.68 |
| parameter-efficient-fine-tuning-on-winogrande | LLaMA2-7b | Accuracy (% ): 70.80 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.