Command Palette
Search for a command to run...
Taiqiang Wu Jiahao Wang Zhe Zhao Ngai Wong

Abstract
In this paper, we introduce a subspace-inspired Low-Rank Adaptation (LoRA)method, which is computationally efficient, easy to implement, and readilyapplicable to large language, multimodal, and diffusion models. Initially, weequivalently decompose the weights of LoRA into two subspaces, and find thatsimply mixing them can enhance performance. To study such a phenomenon, werevisit it through a fine-grained subspace lens, showing that such modificationis equivalent to employing a fixed mixer to fuse the subspaces. To be moreflexible, we jointly learn the mixer with the original LoRA weights, and termthe method Mixture-of-Subspaces LoRA (MoSLoRA). MoSLoRA consistentlyoutperforms LoRA on tasks in different modalities, including commonsensereasoning, visual instruction tuning, and subject-driven text-to-imagegeneration, demonstrating its effectiveness and robustness. Codes are availableat https://github.com/wutaiqiang/MoSLoRA{github}.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| common-sense-reasoning-on-arc-challenge | LLaMA 3 8B + MoSLoRA (fine-tuned) | Accuracy: 81.5 |
| common-sense-reasoning-on-arc-easy | LLaMA 3 8B+MoSLoRA (fine-tuned) | Accuracy: 90.5 |
| common-sense-reasoning-on-winogrande | LLaMA3 8B+MoSLoRA | Accuracy: 85.8 |
| question-answering-on-boolq | LLaMA3+MoSLoRA | Accuracy: 74.6 |
| question-answering-on-openbookqa | LLaMA-3 8B+MoSLoRA | Accuracy: 86.8 |
| question-answering-on-piqa | LLaMA3 8B+MoSLoRA | Accuracy: 89.7 |
| question-answering-on-social-iqa | LLaMA-3 8B+MoSLoRA (fine-tuned) | Accuracy: 81.0 |
| visual-question-answering-on-mm-vet | InternLM2+ViT (QMoSLoRA) | GPT-4 score: 35.2 |
| visual-question-answering-on-mm-vet | LLaVA-InternLM2-7B-ViT + MoSLoRA | GPT-4 score: 35.2 |
| visual-question-answering-on-mmbench | LLaVA-InternLM2-ViT + MoSLoRA | GPT-3.5 score: 73.8 |
| visual-question-answering-on-mmbench | LLaVA-LLaMA3-8B-ViT + MoSLoRA | GPT-3.5 score: 73.0 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.