HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

MERGE: Fast Private Text Generation

Zi Liang; Pinghui Wang; Ruofei Zhang; Nuo Xu; Lifeng Xing; Shuo Zhang

MERGE: Fast Private Text Generation

Abstract

The drastic increase in language models' parameters has led to a new trend of deploying models in cloud servers, raising growing concerns about private inference for Transformer-based models. Existing two-party privacy-preserving techniques, however, only take into account natural language understanding (NLU) scenarios. Private inference in natural language generation (NLG), crucial for applications like translation and code completion, remains underexplored.In addition, previous privacy-preserving techniques suffer from convergence issues during model training and exhibit poor inference speed when used with NLG models due to the neglect of time-consuming operations in auto-regressive generations. To address these issues, we propose a fast private text generation framework for Transformer-based language models, namely MERGE.MERGE reuses the output hidden state as the word embedding to bypass the embedding computation and reorganize the linear operations in the Transformer module to accelerate the forward procedure. Extensive experiments show that MERGE achieves a 26.5x speedup to the vanilla encrypted model under the sequence length 512, and reduces 80\% communication cost, with an up to 10x speedup to state-of-the-art approximated models.

Code Repositories

liangzid/MERGE
Official
jax
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
multi-task-language-understanding-on-mmlu-5-1Sakalti/ultiima-78B
MMLU (5-shot): 89.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
MERGE: Fast Private Text Generation | Papers | HyperAI